London, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Hampshire, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Portsmouth, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
London, England, United Kingdom Hybrid / WFH Options
Tes
environment. Security Best Practices: Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI, or similar tools for building CI More ❯
Grays, England, United Kingdom Hybrid / WFH Options
TES
environment. Security Best Practices: Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI, or similar tools for building CI More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
scripting and automation using languages like PowerShell, Bash, or Python. Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI. Practical experience with Grafana, Prometheus and/0r other monitoring tools. Solid understanding of networking, security, and compliance principles. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills, with the ability to More ❯
London, England, United Kingdom Hybrid / WFH Options
Quaisr Limited
or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). Exposure to big data processing More ❯
or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). Exposure to big data processing More ❯
London, England, United Kingdom Hybrid / WFH Options
Global Screening Services
Take strategic direction and own end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed systems Strong experience with Python and More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
scripting and automation using languages like PowerShell, Bash, or Python Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI Practical experience with Grafana, Prometheus and/0r other monitoring tools Solid understanding of networking, security, and compliance principles Excellent problem-solving and troubleshooting skills Strong communication and collaboration skills, with the ability to More ❯
solutions using AWS, Kubernetes, and associated DevOps practices. Champion DevOps culture by integrating CI/CD pipelines using Jenkins, GitLab, or similar tools. Leverage monitoring and observability tools like Grafana and Prometheus for system reliability. Enhance security practices and ensure compliance with stringent security and accreditation standards. What You’ll Bring: Active DV or eDV clearance is essential due to More ❯
London, England, United Kingdom Hybrid / WFH Options
CAPGEMINI ENGINEERING
Strong scripting skills in Python, Bash, or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, security groups. Experience with monitoring and logging solutions, such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues and optimize cloud deployments. Your security More ❯
Lisburn, Northern Ireland, United Kingdom Hybrid / WFH Options
Camlin Energy
Science, Management Information Systems, or related is desirable but not essential. Nice to have but not essential: Container Orchestration (Kubernetes, Docker Swarm) Service monitoring and graphing tools (Prometheus + Grafana, Nagios + Munin) Elastic stack Infrastructure as Code (Terraform) Repository solutions (Jfrog Artifactory, Jfrog Bintray, Reprepro) Lets Encrypt/ACME OpenVPN Apache Tomcat Messaging streams or communication platforms (RabbitMQ, Postfix More ❯
language), Bash/Shell, YAML including any Development frameworks Extensive experience and in-depth knowledge of the Linux operating system for effective troubleshooting activities Experience with Observability tools like Grafana, Prometheus, ELK, OCI Observability We highly value ownership and initiative with capabilities to drive projects independently Dealing with changes on a daily basis in a very dynamic work environment Good More ❯
London, England, United Kingdom Hybrid / WFH Options
Capgemini
Strong scripting skills in Python, Bash, or PowerShell for automation. • Understanding of AWS networking concepts, including VPCs, subnets, security groups. • Experience with monitoring and logging solutions, such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. • Familiarity with Zero Trust security models and best practices for securing cloud workloads. • Ability to troubleshoot complex infrastructure issues and optimize cloud deployments. Your security More ❯
London, England, United Kingdom Hybrid / WFH Options
SAP SE
practices, RBAC, IAM, networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration & Problem-Solving : Ability to work closely with More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
scripting and automation using languages like PowerShell, Bash, or Python. Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI. Practical experience with Grafana, Prometheus and/or other monitoring tools. Solid understanding of networking, security, and compliance principles. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills, with the ability to More ❯
Docker, Kubernetes etc Extensive experience using Infrastructure as Code for configuration management and code implementation - Terraform etc. Experience setting up and using monitoring and alerting tools such as Dynatrace, Grafana, Cloudwatch etc. Experience using Configuration management tools like Puppet, Ansible, Packer, Chef. Experience with various testing tooling - Selenium, Cucumber etc Experience in scripting - bash/shell Ability to be flexible More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
Capgemini
Docker, Kubernetes etc Extensive experience using Infrastructure as Code for configuration management and code implementation - Terraform etc. Experience setting up and using monitoring and alerting tools such as Dynatrace, Grafana, Cloudwatch etc. Experience using Configuration management tools like Puppet, Ansible, Packer, Chef. Experience with various testing tooling - Selenium, Cucumber etc Experience in scripting - bash/shell Ability to be flexible More ❯
City of London, London, United Kingdom Hybrid / WFH Options
SoTalent
teams to ensure robust and scalable integrations. Drive continuous improvement, automation, and cost-optimization across engineering platforms. Provide advanced troubleshooting and 3rd-line production support using tools like Prometheus, Grafana, and ELK Stack . Maintain detailed technical documentation, system diagrams, and operational runbooks. Ensure compliance with data security and regulatory standards (e.g., GDPR, ISO 27001). Contribute to disaster recovery More ❯
teams to ensure robust and scalable integrations. Drive continuous improvement, automation, and cost-optimization across engineering platforms. Provide advanced troubleshooting and 3rd-line production support using tools like Prometheus, Grafana, and ELK Stack . Maintain detailed technical documentation, system diagrams, and operational runbooks. Ensure compliance with data security and regulatory standards (e.g., GDPR, ISO 27001). Contribute to disaster recovery More ❯
South East London, England, United Kingdom Hybrid / WFH Options
SoTalent
teams to ensure robust and scalable integrations. Drive continuous improvement, automation, and cost-optimization across engineering platforms. Provide advanced troubleshooting and 3rd-line production support using tools like Prometheus, Grafana, and ELK Stack . Maintain detailed technical documentation, system diagrams, and operational runbooks. Ensure compliance with data security and regulatory standards (e.g., GDPR, ISO 27001). Contribute to disaster recovery More ❯