VPC, etc. IaC: AWS CDK, Terraform, or CloudFormation CI/CD pipelines + scripting (Python, Bash, PowerShell) Containerized applications (Docker + ECS) Observability tooling like New Relic, CloudWatch, Prometheus, Datadog Who we’re looking for: Proven SRE or platform engineering experience in a high-availability environment Passion for reliability, automation, and system performance Strong problem-solving mindset and solid communication More ❯
containerization and orchestration tools like Docker, Kubernetes, AKS, and Helm. Programming skills in Python, Java, PowerShell, or Go, with understanding of REST APIs. Experience with observability tools such as DataDog, Prometheus, Splunk, Elasticsearch, Grafana, Azure Monitor. Experience with CI/CD tools like Git, Terraform, Jenkins. Azure cloud expertise in mission-critical environments. Additional qualifications Azure cloud certification. Understanding of More ❯
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
East London, London, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
Ashton-Under-Lyne, Greater Manchester, United Kingdom Hybrid / WFH Options
Zettafleet
Experience in architecting and deploying in cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Leadership: A track record of leading complex projects. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down complex More ❯
and serverless compute options. Build and maintain CI/CD pipelines using industry-standard tools (e.g., GitHub Actions, GitLab CI, Jenkins). Implement monitoring and logging using tools like DataDog, Serilog, CloudWatch, or equivalent. Use Docker and Kubernetes for containerisation and orchestration of applications. Manage deployments with Helm and configuration in YAML. Develop shell scripts and automation for deployment and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Roc Search
and serverless compute options. Build and maintain CI/CD pipelines using industry-standard tools (e.g., GitHub Actions, GitLab CI, Jenkins). Implement monitoring and logging using tools like DataDog, Serilog, CloudWatch, or equivalent. Use Docker and Kubernetes for containerisation and orchestration of applications. Manage deployments with Helm and configuration in YAML. Develop shell scripts and automation for deployment and More ❯
discipline (e.g., AWS, Kubernetes, etc.) Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform Ability to contribute to large and collaborative teams by presenting information in a logical More ❯
London, England, United Kingdom Hybrid / WFH Options
Zettafleet
native technologies: Experience in deploying to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity: A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down More ❯
London, England, United Kingdom Hybrid / WFH Options
Ten Lifestyle Group
with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code (Terraform). Familiarity and hands-on with DevOps practices (CI/CD, Docker, K8s) and observability tools (Prometheus, Grafana, Datadog). Experience in distributed systems and scaling. Knowledge and hands-on experience with multiple data stores (both SQL and NoSQL). Desired experience in building agentic workflows (e.g., autonomous systems More ❯
Familiarity with Infrastructure as Code and DevOps practices. Knowledge of Hyper-V management. Understanding of networking, security, and system administration (Linux/Windows). Experience with monitoring tools (e.g., DataDog, CloudWatch, Azure Monitor). Strong communication and collaboration skills. Responsibilities: Deploying and managing Kubernetes clusters, including networking, storage, and security. Collaborating with development and platform teams to deliver scalable, secure More ❯
Familiarity with Infrastructure as Code and DevOps practices. Knowledge of Hyper-V management. Understanding of networking, security, and system administration (Linux/Windows). Experience with monitoring tools (e.g., DataDog, CloudWatch, Azure Monitor). Strong communication and collaboration skills. Responsibilities: Deploying and managing Kubernetes clusters, including networking, storage, and security. Collaborating with development and platform teams to deliver scalable, secure More ❯
DevOps practices and SRE best practices and standards and supporting implementation and adoption of these standards. Experience with using and enablement of monitoring and alerting tools and services- Dynatrace, Datadog, cloudwatch, Splunk, Grafana, Prometheus, Hands on experience of GIT, BitBucket, Jenkins, SONAR, Maven, CI/CD tools, Linux and Solaris, relational SQL and non-SQL DB technologies, streaming system – Kafka More ❯
gating into the SDLC. Ensure pipeline scalability and governance while maintaining developer velocity. Observability & Troubleshooting Lead the implementation and usage of modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, Splunk, Datadog). Establish SLOs, SLIs, and error budgets with product and engineering teams. Drive root cause identification using distributed tracing, advanced log analysis, and anomaly detection. Security, Audit & Compliance Partner with More ❯
Familiarity with Infrastructure as Code and DevOps practices. Knowledge of Hyper-V management. Understanding of networking, security, and system administration (Linux/Windows). Experience with monitoring tools (e.g., DataDog, CloudWatch, Azure Monitor). Strong communication and collaboration skills. Responsibilities Deploying and managing Kubernetes clusters, including networking, storage, and security. Collaborating with development and platform teams to deliver scalable, secure More ❯
such as AWS Certified Solutions Architect, Azure Solutions Architect Expert, or Google Professional Cloud Architect. Experience in Agile development environments. Familiarity with monitoring and logging tools (Prometheus, ELK Stack, Datadog). Contribution to open-source projects or community involvement. Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Engineering and Information Technology More ❯