Permanent Observability Jobs in the UK

1 to 25 of 506 Permanent Observability Jobs in the UK

DevOps/Site Reliability Engineer, Junior/Mid/Senior (m/f/ )

United Kingdom
Hybrid / WFH Options
Crane Venture Partners
such as Kubernetes, Docker Swarm, or HashiCorp Nomad. Excellent problem-solving, communication, and collaboration skills. Nice to have: Experience managing distributed systems, microservices, and event-driven architectures. Knowledge of observability tools such as Prometheus, Grafana, ELK Stack, or Datadog. Experience with security best practices, monitoring, and incident response. Familiarity with DevSecOps and compliance frameworks (ISO 27001, SOC 2, GDPR). More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer

Liverpool, Lancashire, United Kingdom
Hybrid / WFH Options
The Granite Group
with GitOps tools (e.g., ArgoCD, Flux). CI/CD - Skilled in building and managing pipelines using Azure DevOps, GitHub Actions, etc. Monitoring - Experience with Prometheus, Grafana, and other observability tools. Application Stack - Familiarity with .NET, Node.js, React, and web server technologies like Nginx. Relevant certifications or the ability to demonstrate equivalent experience, such as: Terraform Associate About Acorn Insurance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Monitoring & Observability Engineer

South East London, London, United Kingdom
COMPUTACENTER (UK) LIMITED
GPS). Our teams operate across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations. As a Monitoring & Observability Engineer, you'll work in high-impact delivery teams that support some of the worlds most well-known organisations. Youll play a key role in helping our customers achieve greater … visibility, performance, and reliability across their IT estatescontributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms … with ITSM tools (e.g. ServiceNow) and CI/CD pipelines to enable proactive alerting and resolution workflows Act as a Monitoring & Observability SME within customer delivery teams Support incident response activities and postmortems by identifying patterns, root causes, and optimisation opportunities Work collaboratively with cross-functional teams to define and implement best practices in observability and monitoring Attend customer and More ❯
Employment Type: Permanent
Posted:

Monitoring & Observability Engineer

London, United Kingdom
Computacenter AG & Co. oHG
Select how often (in days) to receive an alert: Monitoring & Observability Engineer Life on the team At Computacenter, you'll be joining a world-class team of over 1,000 skilled professionals within Group Professional Services (GPS). Our teams operate across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and … modern operations. As a Monitoring & Observability Engineer, you'll work in high-impact delivery teams that support some of the world's most well-known organisations. You'll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estates-contributing to their operational success through proactive insight and incident prevention. What you'll … do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI/CD pipelines to enable proactive alerting and resolution workflows Act as a Monitoring More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Monitoring & Observability Engineer

Lakenheath, Suffolk, United Kingdom
Computacenter AG & Co. oHG
Select how often (in days) to receive an alert: Monitoring & Observability Engineer Life on the team At Computacenter, you'll be joining a world-class team of over 1,000 skilled professionals within Group Professional Services (GPS). Our teams operate across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and … modern operations. As a Monitoring & Observability Engineer, you'll work in high-impact delivery teams that support some of the world's most well-known organisations. You'll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estates-contributing to their operational success through proactive insight and incident prevention. What you'll … do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI/CD pipelines to enable proactive alerting and resolution workflows Act as a Monitoring More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Delivery Engineer

United Kingdom
Hybrid / WFH Options
Sportserve
Python (or other language), Bash/Shell, YAML including any Development frameworks Extensive experience and in-depth knowledge of the Linux operating system for effective troubleshooting activities Experience with Observability tools like Grafana, Prometheus, ELK, OCI Observability We highly value ownership and initiative with capabilities to drive projects independently Dealing with changes on a daily basis in a very dynamic More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Stratospherec Ltd
one or more public cloud providers such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and More ❯
Employment Type: Permanent
Salary: £85000 - £90000/annum Excellent Benefits package
Posted:

Senior Site Reliability Engineer (SRE) / Unix

London, United Kingdom
Morgan Hunt UK Limited
Objective (RPO) of zero . Conduct DR testing (3 scheduled tests per financial year, potentially outside core hours). Maintain CommVault backup administration (Oracle DB, RHEL, MongoDB). Monitoring & Observability Support logging & observability stacks (InfluxDB, Grafana, Prometheus, Nagios). Enhance monitoring via REST APIs, time-series databases, and full-stack tools (TICK, Elasticsearch, OpenSearch). Promote SLO/SLI measurement …/Perl) . Load balancers (HAProxy, Keepalived) . Containers & Orchestration: Docker, Kubernetes, OpenShift . Cloud & IaC: AWS (VPC, EC2, S3, NLB) . Terraform/CDK for automation . Monitoring & Observability: Prometheus, Grafana, InfluxDB, Nagios . Full-stack admin (Elasticsearch, Fluentd, OpenSearch) . Methodologies: Agile (Scrum/Kanban), CI/CD, IaC principles . Risk-aware, customer-focused, proactive problem-solving More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Platform Developer

Edinburgh, United Kingdom
Hybrid / WFH Options
Registers of Scotland
WAF, CloudFront, API GW, AWS Organizations, S3, ECS, EKS, Route 53, ELBs, OpenShift, Kubernetes, Docker Languages: TypeScript, Python Security & Scanning: AWS Guardrails, Checkov, Prisma Cloud, OSV Scanner, SonarQube, Renovate Observability & Logging: CloudWatch, OpenSearch Operating System Management: RedHat Satellite, AMI lifecycle management, Ubuntu Landscape Testing Tools: Pytest, Jest, Cypress APIs/Microservices: RESTful APIs, API Gateway, containerised services Version Control: GitLab … to as Senior DevOps Engineer. On a typical day you will Design, build, and maintain scalable, high-quality software and platform systems Implement and manage CI/CD pipelines, observability, security automation, automated testing, and engineering standards Lead feature development from concept to production with focus on quality and performance Troubleshoot issues, ensuring resilience, reliability, and minimal user disruption Contribute More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Azure Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Nordcloud group
to L3 networking Programming languages, such as C#, Python, Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Hosting technologies such as IIS, nginx, Apache, App Service, LightSail Analytical and creative approach to problem More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer III - Mongo DB

Belfast, United Kingdom
Smarsh, Inc
and their associated data services. Hands-on experience with continuous integration and deployment systems (e.g. Jenkins, Tekton). Practical experience with containerization and orchestration technologies, particularly Kubernetes. Familiarity with observability tools such as Prometheus and Grafana, the ELK stack, or similar managed service. Strong problem-solving skills and attention to detail. Experience in MongoDB, including sharded clusters, replica sets, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer, Clinical Software

United Kingdom
Waters Corporation
to maintain a CI build environment capable of running automation tests for effective feedback. Assist in designing, developing and implementing automation test frameworks. Develop and improve our monitoring and observability tooling. Coach and mentorteam matesto improve their own DevOps skills and experience Research emerging tools, trends and methodologies Assist in managing checked in source code from check-in through to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Junior Delivery Engineer

United Kingdom
Hybrid / WFH Options
Sportserve
operating system for effective troubleshooting activities Awareness of any cloud infrastructure principles (like AWS, GCP or OCI), understanding basic principles of secure software delivery is a plus Familiar with Observability tools like Grafana or Prometheus, understanding the importance of giving the correct visibility to our platforms and environments We highly value ownership and initiative with capabilities to drive projects independently More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Technical Architect / Data DevOps Engineer

Bristol, Gloucestershire, United Kingdom
Hewlett Packard Enterprise Development LP
etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or similar languages The following More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Manual Tester (DV Security Clearance)

Basingstoke, Hampshire, South East
CGI
Manual Tester (DV Security Clearance) Position Description Are you an experienced Test Analyst with a background in secure or classified programmes, ready to contribute to projects of national importance? Step into a role where you'll challenge the complex to More ❯
Employment Type: Permanent
Posted:

Site Reliability Engineer

Glasgow, United Kingdom
Morgan Hunt UK Limited
Objective (RPO) of zero . Conduct DR testing (3 scheduled tests per financial year, potentially outside core hours). Maintain CommVault backup administration (Oracle DB, RHEL, MongoDB). Monitoring & Observability Support logging & observability stacks (InfluxDB, Grafana, Prometheus, Nagios). Enhance monitoring via REST APIs, time-series databases, and full-stack tools (TICK, Elasticsearch, OpenSearch). Promote SLO/SLI measurement …/Perl) . Load balancers (HAProxy, Keepalived) . Containers & Orchestration: Docker, Kubernetes, OpenShift . Cloud & IaC: AWS (VPC, EC2, S3, NLB) . Terraform/CDK for automation . Monitoring & Observability: Prometheus, Grafana, InfluxDB, Nagios . Full-stack admin (Elasticsearch, Fluentd, OpenSearch) . Methodologies: Agile (Scrum/Kanban), CI/CD, IaC principles . Risk-aware, customer-focused, proactive problem-solving More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Edinburgh, United Kingdom
Morgan Hunt UK Limited
Objective (RPO) of zero . Conduct DR testing (3 scheduled tests per financial year, potentially outside core hours). Maintain CommVault backup administration (Oracle DB, RHEL, MongoDB). Monitoring & Observability Support logging & observability stacks (InfluxDB, Grafana, Prometheus, Nagios). Enhance monitoring via REST APIs, time-series databases, and full-stack tools (TICK, Elasticsearch, OpenSearch). Promote SLO/SLI measurement …/Perl) . Load balancers (HAProxy, Keepalived) . Containers & Orchestration: Docker, Kubernetes, OpenShift . Cloud & IaC: AWS (VPC, EC2, S3, NLB) . Terraform/CDK for automation . Monitoring & Observability: Prometheus, Grafana, InfluxDB, Nagios . Full-stack admin (Elasticsearch, Fluentd, OpenSearch) . Methodologies: Agile (Scrum/Kanban), CI/CD, IaC principles . Risk-aware, customer-focused, proactive problem-solving More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Site Reliability Engineer - Cloud

Bristol, Avon, England, United Kingdom
Hybrid / WFH Options
Robert Walters
of cloud infrastructure and applications on Google Cloud Platform. You will work collaboratively with engineering and infrastructure teams to implement site reliability engineering (SRE) principles, focusing on system reliability, observability, automation, and operational excellence. This role follows a hybrid working model, requiring attendance at the Bristol office for at least two days per week or 40% of the working time. … objectives (SLOs), indicators (SLIs), and monitoring practices Hands-on experience with infrastructure as code (e.g., Terraform) and CI/CD tools (e.g., Jenkins, Azure DevOps) Desirable Knowledge Familiarity with observability and performance tools such as Dynatrace, Stackdriver, Cloud Monitoring, or similar Exposure to cost monitoring, logging frameworks, and cloud consumption analytics Personal Attributes Ability to mentor and support engineers in More ❯
Employment Type: Full-Time
Salary: £90,000 - £110,000 per annum
Posted:

Solace Messaging Administrator

London, Clerkenwell, United Kingdom
Eligo Recruitment Ltd
our enterprise messaging infrastructure, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, network optimization, and system observability using industry-standard monitoring tools. Required Skills & Qualifications: 3+ years of experience administering enterprise-grade messaging systems. Strong background in production support, preferably in a 24x7 enterprise environment. Experience working More ❯
Employment Type: Permanent
Posted:

Solace Messaging Administrator

London, South East, England, United Kingdom
Eligo Recruitment
our enterprise messaging infrastructure, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, network optimization, and system observability using industry-standard monitoring tools. Required Skills & Qualifications: 3+ years of experience administering enterprise-grade messaging systems. Strong background in production support, preferably in a 24x7 enterprise environment. Experience working More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Lead Site Reliability Engineer

Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
for someone who has: Strong .NET framework knowledge (C#,ASP.NET Core etc..) Expertise in Windows Server administration Database administration (SQL Server primarily) Ability to instrument and consume monitoring and observability tools (Application Insights, Prometheus, Grafana) Experience using PowerShell, Azure CLI, and Bash for automation tasks Previous experience with Azure DevOps, Jenkins, GitHub Actions, or similar tools Containerisation and orchestration (Docker More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Senior Software Architect - Switzerland

Buchs, St. Gallen, United Kingdom
Proactive Global
scaffolding Collaborate with engineering teams, QA, DevOps, and product managers to deliver integrated solutions Mentor engineers in architectural thinking and AI-assisted development Ensure architectural alignment across systems with observability using Prometheus, Grafana, ELK Stack Required Skills & Qualifications: Master's degree in Computer Science, Software Engineering, or related field 8+ years of software engineering experience, with 3+ years in architectural More ❯
Employment Type: Permanent
Salary: £138118 - £164016/annum
Posted:

Cloud Centre of Excellence (CCoE) Engineer

England, United Kingdom
Oldcastle Inc
meet business needs and objectives. Develop a baseline monitoring and tooling concept for cloud to address the need for compliance infrastructure reporting within agile deliveries as part of our Observability strategy. Develop concepts and tools for chargeback and showback (Financial Instrumentation) in a multicloud context. Implement and mature a cloud forecasting and capacity management solution for the enterprise. Collaborate with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Centre of Excellence (CCoE) Engineer

Birmingham, Staffordshire, United Kingdom
Oldcastle Inc
meet business needs and objectives. Develop a baseline monitoring and tooling concept for cloud to address the need for compliance infrastructure reporting within agile deliveries as part of our Observability strategy. Develop concepts and tools for chargeback and showback (Financial Instrumentation) in a multicloud context. Implement and mature a cloud forecasting and capacity management solution for the enterprise. Collaborate with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff DevOps Engineer Brighton, England, United Kingdom; London, England, United Kingdom - Hybrid

United Kingdom
Hybrid / WFH Options
Cision Global
experience. • Implement and evolve CI/CD pipelines, deployment strategies, and GitOps workflows. • Work closely with software engineers to embed infrastructure and operational thinking early in the SDLC. • Champion observability, reliability, and performance through metrics, logging, and alerting best practices. • Lead incident response, postmortems, and ongoing resilience improvements. • Contribute to a 24 7 on-call rotation for critical systems. Minimum … platforms (AWS, GCP) • Solid grasp of networking, Linux internals, and security best practices. • Deep understanding of CI/CD tools and practices (GitHub Actions, Jenkins, ArgoCD, etc.). • Strong observability mindset-experience with tools like Prometheus, Grafana, Loki, etc. • Experience with hybrid service meshes, multi-cluster Kubernetes, or edge computing, preferred. • Knowledge of Kafka, Redis, Elasticsearch, or RDBMS (MySQL/ More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Observability
10th Percentile
£54,875
25th Percentile
£65,000
Median
£80,000
75th Percentile
£97,500
90th Percentile
£120,000