Who we are We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a DevOps/Site Reliability Engineer with experience managing complex infrastructure and More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer More ❯
Hampshire, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer More ❯
Manage cloud infrastructure (OCI, AWS, Azure, or GCP) using Infrastructure as Code tools like Terraform or Serverless Functions. Monitor system health and performance using tools like Prometheus, Grafana, or Datadog or NewRelic. Collaborate closely with development teams to automate builds, performance tests, and deployments. Ensure system security, compliance, and best practices are followed in deployment pipelines. Ensure network security with More ❯
such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. Useful/Bonus Skills More ❯
the equivalent with Azure and GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations More ❯
the equivalent with Azure and GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations More ❯
the equivalent with Azure and GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumologic, NewRelic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/ More ❯
. Preferred Qualifications Experience in hybrid cloud environments and integration with on-premise systems. Background in DevOps, SRE, or Infrastructure Engineering. Knowledge of monitoring/logging tools (e.g., CloudWatch, Datadog, Prometheus, ELK). Experience with enterprise security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR). Familiarity with cost modeling and optimization strategies in AWS. More ❯
Experience with Event-Driven Architecture using AWS services (SNS, SQS, EventBridge). Knowledge of GraphQL, WebSockets, or real-time data streaming. Exposure to DevOps and observability practices (e.g., Prometheus, Datadog, AWS CloudWatch, OpenTelemetry). Prior experience in leading distributed engineering teams. Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
and other relevant tools. Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency: 2-3 times a week at More ❯
. Knowledge of networking concepts and security best practices. Familiarity with SRE activities and best practices. Familiarity with DevOps practices and tools. Experience with monitoring and logging tools (e.g., DataDog, Coralogix, AWS CloudWatch, Azure Monitor). Excellent problem-solving and stakeholder management skills. Strong written and oral communication skills. Experience collaborating across multiple topics in parallel. Responsibilities: Collaborating with technical More ❯
development in general, with skills in a high-level language (e.g., Python, JavaScript, TypeScript, Java) and familiarity with modern development practices Understanding of Cloud Observability, Monitoring, and Tracing tools (Datadog, CloudWatch, Jaeger, ELK) and how best to leverage to support effective MTTR and mitigate high CFR Our UK benefits: Stock Options Annual Performance Bonus or Commissions Pension matched up to More ❯
VPC, etc. IaC: AWS CDK, Terraform, or CloudFormation CI/CD pipelines + scripting (Python, Bash, PowerShell) Containerized applications (Docker + ECS) Observability tooling like New Relic, CloudWatch, Prometheus, Datadog Who we’re looking for: Proven SRE or platform engineering experience in a high-availability environment Passion for reliability, automation, and system performance Strong problem-solving mindset and solid communication More ❯
East London, London, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯