Observability Jobs in London

1 to 25 of 448 Observability Jobs in London

Senior DevOps Engineer [UAE Based]

London Area, United Kingdom
AI71
or Argo Workflows) for containerized microservices, ML model training, and inference workloads. Integrate automated testing, security scans, and policy checks into the release process. Observability & Reliability Engineering Implement comprehensive monitoring, logging, and tracing stacks (Prometheus/Grafana, Loki, ELK, OpenTelemetry). Define SLOs/SLA dashboards; lead incident response, root More ❯
Posted:

Senior DevOps Engineer [UAE Based]

london, south east england, United Kingdom
AI71
or Argo Workflows) for containerized microservices, ML model training, and inference workloads. Integrate automated testing, security scans, and policy checks into the release process. Observability & Reliability Engineering Implement comprehensive monitoring, logging, and tracing stacks (Prometheus/Grafana, Loki, ELK, OpenTelemetry). Define SLOs/SLA dashboards; lead incident response, root More ❯
Posted:

SAP Sovereign Cloud Expert DevOps Engineer

London, United Kingdom
SAP SE
and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration & Problem-Solving : Ability to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer

London Area, United Kingdom
Hybrid / WFH Options
Digital Skills ltd
level experience in AWS Networking/TCP/Firewalls/Certs Advanced proficiency with containers and container orchestration tools such as Docker and Kubernetes Observability champion, experience in designing and building monitoring and logging tools such as CloudWatch, ELK, and Grafana Strong scripting skills in Bash, JavaScript or similar Knowledge More ❯
Posted:

Senior DevOps Engineer

london, south east england, United Kingdom
Hybrid / WFH Options
Digital Skills ltd
level experience in AWS Networking/TCP/Firewalls/Certs Advanced proficiency with containers and container orchestration tools such as Docker and Kubernetes Observability champion, experience in designing and building monitoring and logging tools such as CloudWatch, ELK, and Grafana Strong scripting skills in Bash, JavaScript or similar Knowledge More ❯
Posted:

Solace Messaging Administrator

London Area, United Kingdom
BGC Group
high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). More ❯
Posted:

Solace Messaging Administrator

london, south east england, United Kingdom
BGC Group
high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). More ❯
Posted:

Lead DevOps

Greater London, England, United Kingdom
Hybrid / WFH Options
Focus on SAP
VPC, EC2, EBS, Route 53, WAF, ALB/ELB, Network ACLs, Security Groups, KMS and S3—to meet performance, security and compliance requirements. Monitoring & Observability: Implement application and infrastructure monitoring with Prometheus & Grafana; manage centralized logging with the ELK stack. Web & Reverse Proxy: Configure and tune Nginx for traffic management More ❯
Posted:

Lead DevOps

london, south east england, United Kingdom
Hybrid / WFH Options
Focus on SAP
VPC, EC2, EBS, Route 53, WAF, ALB/ELB, Network ACLs, Security Groups, KMS and S3—to meet performance, security and compliance requirements. Monitoring & Observability: Implement application and infrastructure monitoring with Prometheus & Grafana; manage centralized logging with the ELK stack. Web & Reverse Proxy: Configure and tune Nginx for traffic management More ❯
Posted:

Vice President, DevOps Engineer (NE)

London, United Kingdom
Hybrid / WFH Options
ENGINEERINGUK
available. We combine problem-solving skills with software and systems engineering to take a proactive approach in building fault-tolerant and secure systems, improving observability and zealously automating away toil. In this role you will: Use your site reliability expertise to design, operate and support Preqin's infrastructure, middleware and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Azure DevOps / Platform Engineer

Greater London, England, United Kingdom
Response Informatics
tools, such as Terraform, CloudFormation, ARM, or Pulumi. Expertise in building secure applications and infrastructure, with strong knowledge of security practices. SRE skills, including observability and telemetry monitoring. Hands-on experience with the HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul). Experience in containerisation using Docker, Kubernetes, OpenShift, and Helm. More ❯
Posted:

Azure DevOps / Platform Engineer

london, south east england, United Kingdom
Response Informatics
tools, such as Terraform, CloudFormation, ARM, or Pulumi. Expertise in building secure applications and infrastructure, with strong knowledge of security practices. SRE skills, including observability and telemetry monitoring. Hands-on experience with the HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul). Experience in containerisation using Docker, Kubernetes, OpenShift, and Helm. More ❯
Posted:

SAP Sovereign Cloud DevOps Engineer Azure

London, United Kingdom
SAP SE
Cloud Automation & Tooling (SAT) Team drives automation, security, and compliance for Sovereign Cloud across AWS, Azure, and OpenStack, leveraging IaC, CI/CD, and observability and develops Operations Control Plane (OCP) which orchestrates provisioning, monitoring, and lifecycle management, integrating with our SAP internal tools like SPC, CRM, and cloud automation More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer

London, United Kingdom
P2P
end experience with React or similar frameworks is a plus. Collaborate with the team to implement, configure, and manage comprehensive monitoring, logging, alerting, and observability solutions - advocating for security best practices. Deploy, manage, operate, and scale applications and services on AWS - whilst troubleshooting performance issues across the stack. Collaborative, agile More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Native DevOps Engineer (AWS)

London Area, United Kingdom
Hybrid / WFH Options
LHH
or CloudFormation. Implement CI/CD pipelines, enabling continuous integration and continuous deployment for mission-critical applications. Monitor system performance, availability, and security, implementing observability best practices. Work in an Agile environment, engaging with stakeholders to understand requirements and deliver iterative improvements. Your skills and experience Essential: Experience deploying and More ❯
Posted:

Cloud Native DevOps Engineer (AWS)

london, south east england, United Kingdom
Hybrid / WFH Options
LHH
or CloudFormation. Implement CI/CD pipelines, enabling continuous integration and continuous deployment for mission-critical applications. Monitor system performance, availability, and security, implementing observability best practices. Work in an Agile environment, engaging with stakeholders to understand requirements and deliver iterative improvements. Your skills and experience Essential: Experience deploying and More ❯
Posted:

Senior Network Security Engineer

London, United Kingdom
CFP Energy (UK) Ltd
streamline IT operations and business processes. Monitoring and Maintenance: Manage and maintain network security systems through system patches and periodic maintenance tasks. Establish comprehensive observability and proactive issue-resolution strategies using tools like SNMP, Syslog, Netflow, Elasticsearch (ELK Stack), and Grafana. Collaboration and Communication: Work with CyberEnergiateams to identify functional More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer - Full Stack

London, United Kingdom
Disney Cruise Line - The Walt Disney Company
frameworks (e.g., Hibernate), messaging tools (Kafka, Kinesis, Redis), and cloud infrastructure technologies (AWS, Docker, Kubernetes, Terraform). Strong understanding of CI/CD pipelines, observability tools (e.g., DataDog), and Agile and Lean methodologies. Demonstrated ability to adapt to new technologies, align technical decisions with business goals, and champion quality engineering More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - FinTech / Global Payments - London HQ / Remote First

Central London, UK
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer - FinTech / Global Payments - London HQ / Remote First

West London, UK
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

London Area, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

City of London, London, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

East London, London, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

Central London / West End, London, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

london, south east england, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:
Observability
London
10th Percentile
£64,509
25th Percentile
£67,500
Median
£82,500
75th Percentile
£99,063
90th Percentile
£116,250