Observability Jobs in the UK

1 to 25 of 607 Observability Jobs in the UK

Cloud Centre of Excellence (CCoE) Engineer

Birmingham, Staffordshire, United Kingdom
Oldcastle Inc
Develop a baseline monitoring and tooling concept for cloud to address the need for compliance infrastructure reporting within agile deliveries as part of our Observability strategy. Develop concepts and tools for chargeback and showback (Financial Instrumentation) in a multicloud context. Implement and mature a cloud forecasting and capacity management solution More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Platform Engineer

London, United Kingdom
Experis
Experience: Proven experience working in cloud-native environments at scale. Exposure to high-load, high-performance systems and large-scale microservices architectures. Experience with observability and monitoring frameworks (OpenTelemetry, Grafana, Prometheus). Knowledge of Graph Databases and AI integration in platform operations is a plus. Experience mentoring junior engineers and More ❯
Employment Type: Contract
Rate: £518/day
Posted:

Site Reliability Engineer / DevOps Engineer

United Kingdom
Hybrid / WFH Options
Stratospherec Ltd
providers such as Azure, AWS or GCP. Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Machine Learning Ops Engineer

Belfast, United Kingdom
DailyPay
CD Pipeline Development: Develop and maintain robust CI/CD pipelines for continuous integration and deployment of ML models and related infrastructure Monitoring and Observability: Build and maintain comprehensive monitoring and alerting systems for our ML infrastructure and models, leveraging tools like DataDog to ensure system health and performance Collaboration More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer

London Area, United Kingdom
Hybrid / WFH Options
Premier Group
GitLab CI/Jenkins) Automate deployments and monitoring for multiple environments Implement Infrastructure as Code using Terraform Manage containerised environments with Docker & Kubernetes Enhance observability with tools like Prometheus , Grafana , and Datadog Collaborate closely with developers, testers, and platform teams 🧰 Tech Stack You'll Use: Cloud: AWS (core services: EC2 More ❯
Posted:

DevOps Engineer

london, south east england, united kingdom
Hybrid / WFH Options
Premier Group
GitLab CI/Jenkins) Automate deployments and monitoring for multiple environments Implement Infrastructure as Code using Terraform Manage containerised environments with Docker & Kubernetes Enhance observability with tools like Prometheus , Grafana , and Datadog Collaborate closely with developers, testers, and platform teams 🧰 Tech Stack You'll Use: Cloud: AWS (core services: EC2 More ❯
Posted:

Platform Engineer - Azure

City of London, London, United Kingdom
Adecco
secure applications and infrastructure Strong communication skills, with the ability to convey and or understand complex technical concepts clearly and concisely SRE skills including observability and telemetry monitoring HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul) Containerisation using Docker, Kubernetes, OpenShift & Helm Programming skills using languages such as Python, Go, Java More ❯
Employment Type: Permanent
Salary: £70000 - £80000/annum
Posted:

Director, DevOps

London, United Kingdom
Group M Worldwide Inc
Actions, CircleCI ) and orchestration technologies (e.g., Kubernetes, Docker). Proficiency in scripting and programming languages (e.g., Python, Bash, Go). Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog). Solid understanding of security best practices, compliance standards, and DevSecOps . Proven ability to manage and deliver complex projects More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Vice President, DevOps Engineer (NE)

London, United Kingdom
Hybrid / WFH Options
ENGINEERINGUK
available. We combine problem-solving skills with software and systems engineering to take a proactive approach in building fault-tolerant and secure systems, improving observability and zealously automating away toil. In this role you will: Use your site reliability expertise to design, operate and support Preqin's infrastructure, middleware and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer

london, south east england, united kingdom
Sanderson
CD tools and workflows (e.g., GitHub Actions, Jenkins, GitLab CI). Expertise in Infrastructure-as-Code using Terraform (or similar tools). Experience with observability tools (e.g., Prometheus, Grafana, ELK, Datadog). Strong communication and collaboration skills. Bonus Points For Experience in containerization and orchestration (e.g., Docker, Kubernetes). Background More ❯
Posted:

Python Tech Lead

London, United Kingdom
N Consulting Limited
skills. Preferred Skills: Experience with TDD, BDD, and automated testing frameworks (PyTest, Selenium). Familiarity with security best practices in software development. Knowledge of observability tools like Prometheus, Grafana, and ELK stack. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer - (Networks, AWS & Kubernetes)

London, United Kingdom
Source Technology
balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Native DevOps Engineer (AWS)

London Area, United Kingdom
Hybrid / WFH Options
LHH
or CloudFormation. Implement CI/CD pipelines, enabling continuous integration and continuous deployment for mission-critical applications. Monitor system performance, availability, and security, implementing observability best practices. Work in an Agile environment, engaging with stakeholders to understand requirements and deliver iterative improvements. Your skills and experience Essential: Experience deploying and More ❯
Posted:

Cloud Native DevOps Engineer (AWS)

london, south east england, united kingdom
Hybrid / WFH Options
LHH
or CloudFormation. Implement CI/CD pipelines, enabling continuous integration and continuous deployment for mission-critical applications. Monitor system performance, availability, and security, implementing observability best practices. Work in an Agile environment, engaging with stakeholders to understand requirements and deliver iterative improvements. Your skills and experience Essential: Experience deploying and More ❯
Posted:

Solace Messaging Administrator

London Area, United Kingdom
BGC Group
high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). More ❯
Posted:

Solace Messaging Administrator

london, south east england, united kingdom
BGC Group
high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). More ❯
Posted:

Senior DevOps Engineer

United Kingdom
London Stock Exchange Group
Terraform and ARM templates. Hands-on experience and understanding of containerization and orchestration with Azure Kubernetes and Docker . Design and implement monitoring and observability solutions to ensure the health and performance of cloud resources and applications. Identify opportunities to optimize cloud resources, improve performance, and reduce costs through monitoring More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer (Azure or GCP) Engineering Newcastle upon Tyne, Leeds

Leeds, Yorkshire, United Kingdom
Hedgehog Lab
enhance internal DevOps culture, tooling, and CI/CD processes. Collaborate cross-functionally to continuously innovate and improve development workflows and system operations. Foster observability and reliability across live systems through best-in-class monitoring and automation. Day to Day: Collaborate with engineers and architects to define and implement cloud More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - FinTech / Global Payments - London HQ / Remote First

Central London, UK
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer - FinTech / Global Payments - London HQ / Remote First

West London, UK
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

East London, London, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

London Area, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

City of London, London, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

Central London / West End, London, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

Leigh, Greater Manchester, United Kingdom
Hybrid / WFH Options
Future Talent Group
Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise More ❯
Posted:
Observability
10th Percentile
£51,250
25th Percentile
£63,630
Median
£75,000
75th Percentile
£93,778
90th Percentile
£112,125