Observability Jobs in London

126 to 150 of 374 Observability Jobs in London

Head of Data Engineering (London)

London, UK
Hybrid / WFH Options
Zego
About Zego At Zego, we understand that traditional motor insurance holds good drivers back. It's too complicated, too expensive, and it doesn't reflect how well you actually drive. Since 2016, we have been on a mission to change More ❯
Employment Type: Full-time
Posted:

Head of Data Engineering

London, United Kingdom
Hybrid / WFH Options
Zego
About Zego At Zego, we understand that traditional motor insurance holds good drivers back. It's too complicated, too expensive, and it doesn't reflect how well you actually drive. Since 2016, we have been on a mission to change More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Azure Infrastructure Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
DGH Recruitment
managing cloud infrastructures, with expertise in Infrastructure as Code (IaC), particularly using Terraform, proficiency in designing and implementing CI/CD pipelines, and a deep understanding of monitoring and observability practices. Core responsibilities: - Architect, deploy, and manage Azure-based infrastructure to ensure high availability, scalability, and security. - Develop and maintain Infrastructure as Code (IaC) using Terraform for automated and consistent … Code (IaC) tools, especially Terraform. - Experience in designing and managing CI/CD pipelines using tools such as Azure DevOps, Jenkins, or AWS CodePipeline. - Strong understanding of monitoring and observability tools and practices, including experience with Azure Monitor, SCOM, SolarWinds or similar technologies. Senior Azure Infrastructure Engineer (Azure/Terraform/IaC/CI/CD/AWS) In accordance More ❯
Employment Type: Permanent, Work From Home
Salary: £85,000
Posted:

Head of Platform & Infrastructure (London)

London, UK
Hybrid / WFH Options
ZipRecruiter
with organisational goals. Ensure all services are secure by design, working closely with the information security team to proactively manage risks. Drive service improvement and operational resilience through automation, observability, and DevOps best practices. Experience Required: Proven experience in leading platform/infrastructure and DevOps teams in a hands-on capacity. Strong technical foundation in both traditional infrastructure and modern … CI/CD, GitOps, IaC (e.g., Terraform, ARM), and automation scripting (e.g., PowerShell, Bash, Python). Cloud experience (ideally Azure) and hybrid infrastructure environments. Familiarity with monitoring, alerting, and observability platforms. Package: £100,000 - £120,000 Basic Salary Up to 25% Bonus 15% Pension Remote Working Head of Platform & Infrastructure Engineering – Financial Services- London (Hybrid/Remote More ❯
Employment Type: Full-time
Posted:

Cloud Engineer

London, South East, England, United Kingdom
Hybrid / WFH Options
Picture More
project ownership, this is a greenfield role with space to shape and lead Work with modern Azure technologies in a mature, enterprise setting Exposure to CI/CD, security, observability, and containerised environments Be a mentor and thought leader, influence others and grow professionally Enjoy a collaborative, diverse, and inclusive team culture A chance to work with global stakeholders in … to AWS. You'll: Architect and deploy scalable infrastructure using Terraform and IaC Design and enhance CI/CD pipelines (e.g. Azure DevOps, Jenkins) Implement robust monitoring, logging and observability tools (Azure Monitor, SolarWinds) Work with DevOps and Security teams to enforce cloud governance and zero-trust models Stay close to the technology – supporting the business, guiding junior engineers, and More ❯
Employment Type: Full-Time
Salary: £82,000 - £86,000 per annum
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
ZILO
upgrades, and maintenance of AWS and EKS infrastructure Define and implement resilience and failover strategies for microservices and core platforms Continuously monitor and improve system performance, cost-efficiency, and observability (LGTM stack/Datadog) Partner with security teams on compliance and vulnerability remediation ️ Chaos Engineering & Resilience Design and execute Chaos Engineering experiments. Develop and track SLOs, SLIs, and error budgets … Strong familiarity to be able to read code and trace failures in one or more of the following application languages Java GoLang React .NET Python Solid understanding of modern observability tooling (e.g., Datadog, Loki, Grafana) Comfortable working on a shared on-call rotation Enhanced leave - 38 days inclusive of 8 UK Public Holidays Private Health Care including family cover Life More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer

London, United Kingdom
Omnea
with React & Material UI, Postgres, Hasura and AWS Serverless Technologies such as Lambda, DynamoDB and EventBridge - all managed via AWS CDK & SST. We use Sentry, Lumigo and LogRocket for observability and Github Actions for automated testing and deployment. End-to-end Ownership. You will be entrusted with end-to-end ownership of your projects. From product, design and architectural decisions … ideally AWS). You focus on having a high impact . You've spearheaded the engineering of critical systems before, working with best-in-class tooling in AWS, IaaC, observability and quality assessments. You want to discover the best ways to bring this to an early-stage startup. You know what good can look like . You understand what it … takes to build highly reliable & well architected products. You build with quality, observability & redundancy at the forefront. You're ready to get a lot done. You enjoy all aspects of building a product and are comfortable moving across the stack when necessary. You enjoy problem solving and thinking from first principals You're ready to pick up new skills and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer (London)

Wandsworth, Greater London, UK
Omnea
with React & Material UI, Postgres, Hasura and AWS Serverless Technologies such as Lambda, DynamoDB and EventBridge - all managed via AWS CDK & SST. We use Sentry, Lumigo and LogRocket for observability and Github Actions for automated testing and deployment. End-to-end Ownership. You will be entrusted with end-to-end ownership of your projects. From product, design and architectural decisions … ideally AWS). You focus on having a high impact . You've spearheaded the engineering of critical systems before, working with best-in-class tooling in AWS, IaaC, observability and quality assessments. You want to discover the best ways to bring this to an early-stage startup. You know what good can look like . You understand what it … takes to build highly reliable & well architected products. You build with quality, observability & redundancy at the forefront. You’re ready to get a lot done. You enjoy all aspects of building a product and are comfortable moving across the stack when necessary. You enjoy problem solving and thinking from first principals.. You’re ready to pick up new skills and More ❯
Employment Type: Full-time
Posted:

Engineering Excellence Lead

London, United Kingdom
Hybrid / WFH Options
Trili
Collaborate with People/HR and engineering leadership on career pathing, training, and coaching for engineering staff. Technology Enablement: Evaluate and deploy tools - especially AI - that support engineering productivity, observability, and collaboration. Work closely with DevOps, QA, and SRE teams to align infrastructure and operational excellence with engineering needs. Own key vendor relationships, evaluation of partnerships and represent technology on … scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous long-term incentive plan (LTIP) tez token More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Restaurant Technology Problem Manager

London, United Kingdom
Hybrid / WFH Options
McDonald's Corporation
as follows: Own ITIL Problem & Change Management Take ownership of ITIL Problem Management activities, proactively identifying, addressing and fixing root causes of incidents and recurring issues within the system. Observability lead, promoting stability across the estate by collaborating with cross-functional teams to implement preventive measures. Actively take part in ITIL Change Management processes, ensuring that changes to the system … efficiently. Experience in implementing changes while following ITIL change management processes. Understanding of basic security principles and best practices for securing infrastructure. Optional but advantageous technical skills: Proficient using observability tools (NewRelic and Thousand Eyes), BI platform and data visualisation tools (such as Tableau and Power BI) and technology tools (Jira, Confluence). System Administration: Proficiency in Linux/Unix More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior to Principal DevOps Engineer (London)

London, UK
Hybrid / WFH Options
ZipRecruiter
Code (IaC) using Terraform to automate infrastructure provisioning and management. Establish and maintain robust security controls across all cloud environments, ensuring compliance with relevant standards and regulations. Utilise advanced observability tools to monitor and optimise the performance of production services, proactively identifying and resolving issues. Design and optimise CI/CD pipelines using platforms such as GitLab or Jenkins, enabling More ❯
Employment Type: Full-time
Posted:

Elasticsearch Platform Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Tec Partners
focus on security, resilience, and continuous improvement. Key Responsibilities: Manage and maintain Elastic Cloud Enterprise (ECE) environments, ensuring high availability and performance. Design and deploy scalable Elasticsearch solutions for Observability and Search use cases. Implement robust security, privacy, and compliance controls across Elasticsearch systems. Optimise system configurations and queries to enhance performance and reduce latency. Collaborate with cross-functional teams More ❯
Employment Type: Permanent
Salary: £77000 - £116000/annum
Posted:

SRE

London, United Kingdom
Teksystems
AWS services at the DevOps Engineer level Incident, change & problem management experience. This role is heavily operational-oriented, including on-call requirements Strong background in setup & operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL Proficient in one or more languages of Python, Go, Bash, SQL Familiar with GitHub/GitOps/container orchestration/ More ❯
Employment Type: Contract
Posted:

Senior Software Engineer

London, United Kingdom
Hybrid / WFH Options
Our Future Health Limited
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer (London)

London, UK
Hybrid / WFH Options
Our Future Health Limited
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme – We invest in your future More ❯
Employment Type: Full-time
Posted:

Senior Software Engineer (Core Data Services)

London, United Kingdom
Hybrid / WFH Options
Our Future Health
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer (Core Data Services) (London)

Wandsworth, Greater London, UK
Hybrid / WFH Options
Our Future Health
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme – We invest in your future More ❯
Employment Type: Full-time
Posted:

Senior Staff Software Engineer

London, United Kingdom
JDA Software
across complex systems. Solid knowledge of database systems, data modeling, and query optimization. Experience with Maven artifact deployment, Android XML, and Compose layout systems. Familiarity with monitoring, logging, and observability tools. Experience with performance optimization and security best practices. Understanding of agile development methodologies. History of mentoring junior developers and providing technical leadership. Knowledge of Dagger, Retrofit 2, RxJava, Room More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Staff Software Engineer (London)

London, UK
JDA Software
across complex systems. Solid knowledge of database systems, data modeling, and query optimization. Experience with Maven artifact deployment, Android XML, and Compose layout systems. Familiarity with monitoring, logging, and observability tools. Experience with performance optimization and security best practices. Understanding of agile development methodologies. History of mentoring junior developers and providing technical leadership. Knowledge of Dagger, Retrofit 2, RxJava, Room More ❯
Employment Type: Full-time
Posted:

Machine Learning Engineering Manager (London)

London, UK
Hybrid / WFH Options
Compare the Market
prototypes into high-quality production systems Platform & Engineering Standards • Contribute to the design and evolution of our internal ML platform and tooling • Champion best practices in CI/CD, observability, reproducibility, and infrastructure-as-code for ML • Ensure all deployed systems meet requirements for resilience, testing, security, and performance • Influence and contribute to shared frameworks, libraries, and deployment pipelines Strategy More ❯
Employment Type: Full-time
Posted:

Site Reliability Engineer (DV Security Clearance)

London
CGI
automation scripts, infrastructure as code, creating tooling or frameworks and feature development, ideally using Java and/or python. • Experience of engineering enablement products such as CI/CD, Observability and Alerting • Experience creating designs and documentation, including 'how to user guides' • Experience of investigating and resolving incidents and problems aligned to the SLAs • Continuously seeking opportunities for system performance More ❯
Employment Type: Permanent
Posted:

Senior Software Engineer (London)

London, UK
Hybrid / WFH Options
Orgvue
and overall product quality, including familiarity with test automation, TDD, or BDD methodologies Understanding of DevOps tools, processes, and concepts such as Docker, Kubernetes, CI/CD pipelines, and observability Strong product development skills and customer empathy to drive how you solve problems for our users Benefits Hybrid working - 1+ days a week in the London office Wellbeing: Sanctus Coaching More ❯
Employment Type: Full-time
Posted:

Site Reliability Engineer II, RTO London, England, United Kingdom London, England, United Kingdom

London, United Kingdom
Axon Enterprise
Experience using managed languages such as Python, Go, C#, Java, or similar. Experience utilizing CI/CD platforms to automate provisioning infrastructure, software builds, tests, and releases. Experience using observability tools such as APM, logging, and metrics to assist with debugging issues. Experience designing tooling to simplify the operational management of SaaS/PaaS systems. Familiarity with building flexible and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Engineer

London, South East, England, United Kingdom
Holland & Barrett International Limited
and CI/CD workflows (GitLab CI). Write clean, production-grade code in Python (Scala is a bonus). Build infrastructure using Terraform, AWS CloudFormation, or SAM. Drive observability across the platform using Datadog or CloudWatch. Actively mentor Data Engineers and Associates, and lead technical discussions and design sessions. Key requirements: Must-Have: Strong experience with AWS services: Glue More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Principal Cloud Consultant (London)

London, UK
Hybrid / WFH Options
MMT
chaos into elegant, self-healing systems that deploy flawlessly Knowledge of cloud security frameworks and compliance requirements Understanding of cost optimization strategies and cloud financial management Familiarity with monitoring, observability, and incident response best practices Communication & Business Skills Excellent presentation skills with experience speaking to technical and executive audiences Strong written communication abilities, especially for proposals and technical documentation Natural More ❯
Employment Type: Full-time
Posted:
Observability
London
10th Percentile
£65,000
25th Percentile
£73,125
Median
£82,500
75th Percentile
£108,125
90th Percentile
£120,000