Southampton, Hampshire, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Hampshire, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
Hedge End, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
service level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration … such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through More ❯
London, England, United Kingdom Hybrid / WFH Options
Tes
environment. Security Best Practices: Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI, or similar tools for building CI More ❯
Grays, England, United Kingdom Hybrid / WFH Options
TES
environment. Security Best Practices: Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI, or similar tools for building CI More ❯
Integration services such as messaging and streams. Building RESTful API Services. Containerization, Kubernetes, serverless functions. Microservices and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (eg, ELK, Splunk, Prometheus, Grafana). Automation Scripting (using Scripting languages such as Terraform, Ansible, etc.). Experience working with Continuous Integration (CI), Continuous Delivery (CD), and continuous testing tools. Experience working within an Agile More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
scripting and automation using languages like PowerShell, Bash, or Python. Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI. Practical experience with Grafana, Prometheus and/0r other monitoring tools. Solid understanding of networking, security, and compliance principles. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills, with the ability to More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
Undisclosed
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
Experis - ManpowerGroup
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
Experis
Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
London, England, United Kingdom Hybrid / WFH Options
Quaisr Limited
or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). Exposure to big data processing More ❯
or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). Exposure to big data processing More ❯
London, England, United Kingdom Hybrid / WFH Options
Global Screening Services
Take strategic direction and own end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed systems Strong experience with Python and More ❯
London, England, United Kingdom Hybrid / WFH Options
ZigZag Global
scripting and automation using languages like PowerShell, Bash, or Python Hands-on experience with CI/CD tools like Azure DevOps, GitHub Actions or GitLab CI Practical experience with Grafana, Prometheus and/0r other monitoring tools Solid understanding of networking, security, and compliance principles Excellent problem-solving and troubleshooting skills Strong communication and collaboration skills, with the ability to More ❯
Strong scripting skills in Python, Bash, or PowerShell for automation. • Understanding of AWS networking concepts, including VPCs, subnets, security groups. • Experience with monitoring and logging solutions, such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. • Familiarity with Zero Trust security models and best practices for securing cloud workloads. • Ability to troubleshoot complex infrastructure issues and optimize cloud deployments. Your security More ❯
City of London, Greater London, UK Hybrid / WFH Options
LHH
Strong scripting skills in Python, Bash, or PowerShell for automation. • Understanding of AWS networking concepts, including VPCs, subnets, security groups. • Experience with monitoring and logging solutions, such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. • Familiarity with Zero Trust security models and best practices for securing cloud workloads. • Ability to troubleshoot complex infrastructure issues and optimize cloud deployments. Your security More ❯
language), Bash/Shell, YAML including any Development frameworks Extensive experience and in-depth knowledge of the Linux operating system for effective troubleshooting activities Experience with Observability tools like Grafana, Prometheus, ELK, OCI Observability We highly value ownership and initiative with capabilities to drive projects independently Dealing with changes on a daily basis in a very dynamic work environment Good More ❯
London, England, United Kingdom Hybrid / WFH Options
Capgemini
Strong scripting skills in Python, Bash, or PowerShell for automation. • Understanding of AWS networking concepts, including VPCs, subnets, security groups. • Experience with monitoring and logging solutions, such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. • Familiarity with Zero Trust security models and best practices for securing cloud workloads. • Ability to troubleshoot complex infrastructure issues and optimize cloud deployments. Your security More ❯
Eastbourne, England, United Kingdom Hybrid / WFH Options
AxisOps
but not required) Maintain and evolve microservice architecture built in Python and PHP, with deployment via GitLab CI/CD and runtime orchestration via Andromeda Deliver observability using Prometheus, Grafana, and the ELK stack, supporting metrics, logs, and alerting workflows Support and maintain internal ML infrastructure and pipelines , helping ensure that our AI and data workloads run securely and efficiently More ❯
London, England, United Kingdom Hybrid / WFH Options
NPAworldwide
GitLab CI, or similar. Automate deployment and configuration of applications and services using scripting languages and configuration management tools. Monitor infrastructure health and performance using tools like CloudWatch, Prometheus, Grafana, etc. Provide ongoing post-deployment support to customers, ensuring reliability, scalability, and performance. Collaborate with software engineering, product, and support teams to streamline DevOps processes and improve the customer experience. … container orchestration. Proficiency in scripting (Bash, Python, or similar). Experience with Infrastructure as Code (Terraform, CloudFormation). Familiarity with monitoring tools and log aggregation (e.g., CloudWatch, ELK stack, Grafana). Excellent communication and problem-solving skills. Desirable: Certification in AWS, Azure, or Kubernetes. Experience supporting SaaS or enterprise-scale applications. Understanding of DevSecOps principles and secure cloud practices. Why More ❯
Gloucester, Gloucestershire, South West Hybrid / WFH Options
CGI
who can adapt to client problems as required. The Tech Stack used is: Java, Python, Javascript (Typescript), Vue, Bash, Jenkins, Ansible ,Cucumber, NiFi, Go, AWS, Gitlab, ELK stack, Terraform, Grafana, Sonarqube, Openshift, Linux Required qualifications to be successful in this role Proven experience in Site Reliability Engineering or a similar DevOps/SRE role supporting cloud-based applications. Strong scripting … tools like Terraform. Solid understanding of AWS services and cloud-native architecture. Strong troubleshooting skills with experience in Linux-based environments. Experience with monitoring and logging tools such as Grafana, ELK Stack, and SonarQube. Familiarity with container orchestration using OpenShift (or Kubernetes equivalent). Ability to support, maintain, and improve deployment environments, ensuring reliability and scalability. Comfortable with live service More ❯
London, England, United Kingdom Hybrid / WFH Options
Amber Labs
to ensure continuous integration, delivery, and deployment of applications. Collaborate with the development team to optimise pipeline efficiency and ensure code quality. Implement monitoring solutions using AWS CloudWatch, Prometheus, Grafana, or similar tools to ensure visibility into application performance, health, and security. Troubleshoot production issues and provide resolution. Ensure the security of cloud infrastructure by implementing best practices like IAM … Experience automating infrastructure tasks using Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation. Monitoring & Logging Tools: Experience with monitoring and logging tools such as AWS CloudWatch, Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana). Benefits: Join a rapidly expanding start-up where personal growth is a part of our DNA. Benefit from a flexible work environment focused on More ❯
London, England, United Kingdom Hybrid / WFH Options
Tripadvisor
building and working with and monitoring microservice architectures in large distributed cloud environments (ideally AWS). Experience with Observability tooling – having proficiency using tools like Elasticsearch, Kibana, APM, Sentry, Grafana, Prometheus, Overops, or similar The ability to guide and mentor other members within the team and improve the way we collaborate, learn, and share ideas Documentation and internal team members … IaC – Terraform, CloudFormation, VPC, IAM, EC2, EKS, Lambda, RDS, S3, CloudWatch, puppet, docker Experience building and running monitoring infrastructure at a large scale. For example, Elasticsearch clusters, Prometheus, Kibana, Grafana, etc Web applications and HTTP servers – Java, apache, nginx Load balancers – ELB, HAProxy, nginx Experience in running SQL/NoSQL data stores – RDS, DynamoDB, ElastiCache, Solr Perks of Working at More ❯
Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯