Observability Jobs in England

1 to 25 of 2,507 Observability Jobs in England

DevSecOps Engineer

London, England, United Kingdom
Hybrid / WFH Options
Tes
microservices design patterns and deployment strategies in a cloud-native environment. Security Best Practices: Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI More ❯
Posted:

DevSecOps Engineer

Grays, England, United Kingdom
Hybrid / WFH Options
TES
microservices design patterns and deployment strategies in a cloud-native environment. Security Best Practices: Strong understanding of security frameworks and compliance standards for cloud infrastructure and DevOps processes. Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking. Skills CI/CD Tools: Hands-on experience with Jenkins, GitLab CI More ❯
Posted:

Senior DevOps Engineer - Monitoring & Observability

London, England, United Kingdom
Lumenalta
Join or sign in to find your next job Join to apply for the Senior DevOps Engineer - Monitoring & Observability role at Lumenalta As a Senior DevOps Engineer at Lumenalta, you will be pivotal in architecting and managing cloud-based systems on AWS, implementing CI/CD pipelines, and automating infrastructure deployment using tools like Terraform and AWS CDK. You will … to automate application builds, testing, and deployments. Infrastructure as Code (IaC): Use Terraform, AWS CDK, or CloudFormation to automate cloud resource provisioning, enabling consistent and repeatable infrastructure deployments. Monitoring & Observability: Implement monitoring, logging, and alerting solutions using tools like Prometheus, Grafana, Loki, Datadog, or CloudWatch to ensure system health and performance. Security & Compliance: Implement security best practices for cloud infrastructure More ❯
Posted:

Mid-Senior DevOps / Site Reliability Engineer (m/f/*)

London, England, United Kingdom
Hybrid / WFH Options
Quaisr Limited
such as Kubernetes, Docker Swarm, or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). More ❯
Posted:

Senior DevOps Engineer

London, England, United Kingdom
Darktrace
like ArgoCD and Helm), Experience in migrating monolithic applications into microservices architectures, In-depth Linux/Unix experience, emphasizing system performance tuning and automation, Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Loki, OTel, ELK stack) to ensure system reliability and performance, Experience in developing and working with backend applications technologies (e.g. Express, Django). Benefits we offer More ❯
Posted:

Senior DevOps Engineer

Liverpool, United Kingdom
Hybrid / WFH Options
Acorn Group
with GitOps tools (e.g., ArgoCD, Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as: Terraform Associate About Acorn Insurance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer

Liverpool, Lancashire, United Kingdom
Hybrid / WFH Options
The Acorn Group
with GitOps tools (e.g., ArgoCD, Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as: Terraform Associate About Acorn Insurance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

SAP Sovereign Cloud Expert DevOps Engineer

London, United Kingdom
SAP SE
IAM, networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration & Problem-Solving : Ability to work closely with development, security More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

SAP Sovereign Cloud Expert DevOps Engineer

London, England, United Kingdom
Hybrid / WFH Options
SAP SE
IAM, networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration & Problem-Solving : Ability to work closely with development, security More ❯
Posted:

SAP Sovereign Cloud Expert DevOps Engineer

London, England, United Kingdom
SAP
IAM, networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging: Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation: Strong scripting skills in PowerShell, Bash, and Python, along with automation frameworks like Ansible. Collaboration & Problem-Solving: Ability to work closely with development, security, and More ❯
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management … principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless More ❯
Posted:

Junior DevOps Engineer

City of London, London, United Kingdom
Sparta Global
plus) A strong interest in automation, infrastructure best practices, and continuous learning Good communication skills and a collaborative mindset Desirable Skills Experience with Terraform, Ansible, or Helm Familiarity with observability tools such as ELK Stack, CloudWatch, or New Relic Understanding of security considerations in cloud and CI/CD environments More ❯
Posted:

Junior DevOps Engineer

London Area, United Kingdom
Sparta Global
plus) A strong interest in automation, infrastructure best practices, and continuous learning Good communication skills and a collaborative mindset Desirable Skills Experience with Terraform, Ansible, or Helm Familiarity with observability tools such as ELK Stack, CloudWatch, or New Relic Understanding of security considerations in cloud and CI/CD environments More ❯
Posted:

Junior DevOps Engineer

Slough, England, United Kingdom
JR United Kingdom
is a plus) A strong interest in automation, infrastructure best practices, and continuous learning Good communication skills and a collaborative mindset Experience with Terraform, Ansible, or Helm Familiarity with observability tools such as ELK Stack, CloudWatch, or New Relic Understanding of security considerations in cloud and CI/CD environments #J-18808-Ljbffr More ❯
Posted:

Lead DevOps/SRE Engineer

Bristol, England, United Kingdom
Hybrid / WFH Options
Canada Life Assurance Europe plc
infrastructure to the cloud and understanding the challenges involved Familiarity with cloud security best practices, identity and access management (IAM), and encryption techniques Microsoft Azure certifications are a plus Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing … applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of security and scalability across all cloud environments. Enforce security best practices in AKS, including network policies, RBAC (Role-Based Access Control), and integration with Azure Active Directory Core Services Azure core services such as Azure Storage, including Blob, Azure VMs, Azure More ❯
Posted:

Junior DevOps Engineer

London, England, United Kingdom
Sparta Global
is a plus) A strong interest in automation, infrastructure best practices, and continuous learning Good communication skills and a collaborative mindset Experience with Terraform, Ansible, or Helm Familiarity with observability tools such as ELK Stack, CloudWatch, or New Relic Understanding of security considerations in cloud and CI/CD environments Additional Details Seniority level: Entry level Employment type: Full-time More ❯
Posted:

GCP Technical Lead

London Area, United Kingdom
TEKsystems
Storage, Compute G Cloud CLI, VPC, IAM, GCE, GCS, GKE, Pub Sub, Cloud Run, Cloud SQL, Big Query, Dataflow, Bigtable, Fire store GCP – Networking, Security tool/Best Practices Observability - Operations suite, Logging, Monitoring, Alerting. Additional Skills: Good understanding of Linux OS. Bash, Scripting, Automation, Ansible, Networking, Security. Hands-on experience with DevOps Principles and Tools. Hands-on with Terraform More ❯
Posted:

GCP Technical Lead

City of London, London, United Kingdom
TEKsystems
Storage, Compute G Cloud CLI, VPC, IAM, GCE, GCS, GKE, Pub Sub, Cloud Run, Cloud SQL, Big Query, Dataflow, Bigtable, Fire store GCP – Networking, Security tool/Best Practices Observability - Operations suite, Logging, Monitoring, Alerting. Additional Skills: Good understanding of Linux OS. Bash, Scripting, Automation, Ansible, Networking, Security. Hands-on experience with DevOps Principles and Tools. Hands-on with Terraform More ❯
Posted:

GCP Technical Lead

South East London, England, United Kingdom
TEKsystems
Storage, Compute G Cloud CLI, VPC, IAM, GCE, GCS, GKE, Pub Sub, Cloud Run, Cloud SQL, Big Query, Dataflow, Bigtable, Fire store GCP – Networking, Security tool/Best Practices Observability - Operations suite, Logging, Monitoring, Alerting. Additional Skills: Good understanding of Linux OS. Bash, Scripting, Automation, Ansible, Networking, Security. Hands-on experience with DevOps Principles and Tools. Hands-on with Terraform More ❯
Posted:

Senior Site Reliability Engineer

England, United Kingdom
Hybrid / WFH Options
Stratospherec Limited
one or more public cloud providers such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and More ❯
Posted:

Senior Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Stratospherec Limited
one or more public cloud providers such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and More ❯
Posted:

Senior Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
TieTalent
one or more public cloud providers such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and More ❯
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Global Screening Services
impact! About The Role This is an exciting opportunity to join our growing Operations team managing Kubernetes clusters in Production and, through a DevOps culture, empower development teams with observability insights they can use to innovate faster. We are looking for a Site Reliability Engineer, or production experienced DevOps Engineer, who has working experience building observability for cloud native SaaS … products and driving operational excellence. You will be responsible for delivering our monitoring infrastructure, shaping observability, and responding to incidents as well as ensuring the platform is performant and reliable. You will be a key member of the team, liaising with product teams, embedding SRE principles and building the observability platform for the next stage of growth at GSS. You … new features are maintainable, have well defined SLIs, achievable SLOs, are properly monitored, and evaluated for failure scenarios Enabling development teams through DevOps culture and the effective use of observability tools. Promote best practice, present KT sessions, help troubleshoot and resolve business affecting issues Building on our existing monitoring tools to deliver a comprehensive, optimised observability platform for logging, metrics More ❯
Posted:

Site Reliability Engineer

Southampton, Hampshire, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management … principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless More ❯
Employment Type: Permanent
Posted:

Site Reliability Engineer

Southampton, Hampshire, South East, United Kingdom
Hybrid / WFH Options
Spectrum It Recruitment Limited
level goals You'll Stand Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management … principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation (e.g., Docker, Kubernetes) and microservices architecture Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless More ❯
Employment Type: Permanent, Work From Home
Posted:
Observability
England
10th Percentile
£57,500
25th Percentile
£65,000
Median
£77,500
75th Percentile
£97,500
90th Percentile
£117,500