Observability Jobs in the UK excluding London

76 to 100 of 1,210 Observability Jobs in the UK excluding London

Messaging Administrator - Solace

South East London, England, United Kingdom
Marlin Selection Recruitment
For: 3+ years’ hands-on experience with Solace PubSub+ in a production environment Strong knowledge of WAN-based distributed systems and networking fundamentals Experience with Prometheus and Grafana for observability and alerting Confident in Linux/Unix systems and scripting (Bash, Python, etc.) Excellent problem-solving instincts and attention to detail Strong communicator who works well across technical teams Bonus More ❯
Posted:

Site Reliability Engineer

Edinburgh, Scotland, United Kingdom
Hybrid / WFH Options
JR United Kingdom
ideally with Terraform or CloudFormation. Hands-on experience with CI/CD pipelines and automation tooling. Background in containerisation and orchestration – e.g., Docker, Kubernetes. Familiarity with monitoring, alerting, and observability tools (e.g., Prometheus, Grafana, CloudWatch). Proven ability to troubleshoot and resolve complex infrastructure issues. Experience working in cross-functional engineering teams, ideally in a DevOps or SRE capacity. Strong More ❯
Posted:

Remote Senior Site Reliability Engineer Manager (Remote)

Cambourne, Cambridgeshire, United Kingdom
Hybrid / WFH Options
Remotestar
strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer

Cheltenham, England, United Kingdom
Mane Contract Services
Ansible, Jenkins). Experience working within secure or regulated environments (e.g. Defence, Government, Critical National Infrastructure). Familiarity with cloud platforms such as AWS, Azure, or OpenStack. Experience with observability tooling (e.g. Prometheus, Grafana, ELK stack). Exposure to infrastructure security principles and compliance frameworks. What’s in It for You: Salary from £80,000+ depending on experience Work alongside More ❯
Posted:

DevOps Engineer

Macclesfield, England, United Kingdom
Revolent Group
a minimum of two years working with us post training Nice to have: Domain knowledge: Banking, Financial Services, Lending (Very nice to have – understanding the wholesale lending lifecycle) Monitoring & Observability: Experience in managing Tools like APPD, ELK stack, Grafana Security Practices: DevSecOps principles, vulnerability scanning, compliance automation, Certificate/vault/user role management. Strong attention to detail a passion More ❯
Posted:

Lead Site Reliability Engineer

Crawley, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
VPC, etc.). Strong skills in automation and configuration management-especially Ansible (Terraform experience a plus). Solid grasp of SRE and DevOps principles including CI/CD, GitOps, observability, and infrastructure as code. A strategic mindset with excellent communication and stakeholder engagement skills. Bonus Skills (Desirable but not essential): Exposure to Microsoft Azure. Kubernetes and container orchestration. Knowledge of More ❯
Posted:

Software Engineer

Cheltenham, England, United Kingdom
Hybrid / WFH Options
Argo DevOps Solutions Ltd
BDD approaches (e.g., Cucumber, Gherkin) for test automation Containerisation & Microservices Container Technologies: Practical understanding of Docker or equivalent solutions Microservice Patterns: Experience architecting microservice-based systems with built-in observability and security Cloud Services & Environments Cloud Providers: Demonstrable experience with AWS or Azure Security & Configuration: Ability to build, configure, and secure cloud environments effectively Security & CI/CD Security Integration More ❯
Posted:

AWS Engineer

Manchester, United Kingdom
Hybrid / WFH Options
BAE Systems (New)
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps UtilisingCI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks A More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Engineer

Manchester, Lancashire, United Kingdom
Hybrid / WFH Options
BAE Systems (New)
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps UtilisingCI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks A More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Observability Engineer

Belfast, United Kingdom
Hybrid / WFH Options
Thales Group
working, or the ability to flex your start and finish times. Where possible, we support a working pattern that suits your lifestyle and helps you reach your ambitions. Title: Observability Engineer Base location: Belfast/Remote UK About the company: Imperva, a Thales company, is an analyst-recognized cybersecurity leader-championing the fight to secure data and applications wherever they … pops and core infrastructure with new modern technologies, embracing Infrastructure as code at all levels with automation as a core requirement for all projects. We are looking for an Observability Engineer to work within our SRE teams to design, build and iterate on our O11Y platform. This engineer will have to work both hands on and strategically with our architects … global service delivery and product teams to plan an observability road map and then execute on those plans. Responsibilities: Assess & Enhance Observability: Review the current observability platform, identify areas for improvement, and guide the team in enhancing monitoring, logging, tracing, and alerting capabilities. Design & Implement Solutions: Develop and optimize observability solutions that provide deep insights into system and service health. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Hereford, Herefordshire, West Midlands, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Work Scheme Key Responsibilities of the Site Reliability Engineer: Partner with developers to improve performance and reliability across systems Automate toil and reduce unnecessary alerts with smart tooling Evolve observability so we can prevent issues before they become incidents Improve CI/CD pipelines and support development teams in delivering quality faster Explore new technologies, tools, and services that improve … plus) Experience with Terraform and modern IaC practices Hands-on with Docker and orchestration tools (Kubernetes, OpenShift, or Docker Swarm) CI/CD experience (Jenkins or equivalent) Monitoring/observability tools: Grafana , Prometheus , or InfluxDB Event-driven messaging: RabbitMQ or similar Strong Linux skills, scripting, and understanding of network security protocols Experience with AWS: EC2, S3, RDS, Lambda Desirable: Experience … coding in Python, Java, or Go Exposure to cross-domain solutions Experience in a service management environment Observability best practices and metric-driven reliability improvement Security Requirements Due to the sensitive nature of our work, candidates must be eligible for Developed Vetting (DV) clearance. All offers are subject to security screening. Ready to Engineer Systems That Matter? If youre a More ❯
Employment Type: Permanent, Work From Home
Posted:

Site Reliability Engineer

Chesterfield, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
automation and internal tools for deployment, monitoring, and incident response Tune performance across OS, network, and cloud layers — this role is hands-on and detail-oriented Improve system resilience, observability, and security in a high-stakes production environment Requirements: Fluent in Linux — not just using it, but understanding how it works under the hood Advanced terminal skills — manipulating systems efficiently … time environments Hands-on with Docker (Kubernetes is a plus), infrastructure-as-code, and CI/CD tooling Strong scripting and automation experience in Python and Bash Familiarity with observability stacks (Prometheus, OpenTelemetry, eBPF) Cloud infrastructure experience (AWS/GCP/Azure), with attention to IAM and software supply chain security Curious, persistent, and comfortable experimenting at the lowest levels More ❯
Posted:

Senior Site Reliability Engineer

Manchester, United Kingdom
Hybrid / WFH Options
Embarcaderomediagroup
ll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices like GitOps, Infrastructure as Code, DevSecOps automation, and self-service enablement, to help development teams ship faster, safer, and more cost-efficiently. What you … ll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through platform tools, reusable Terraform modules, and self-service infrastructure Enhancing CI/CD pipelines (Azure DevOps, YAML-based) with security … knowledge (AKS, Functions, SQL, Cosmos DB, etc.) Strong Infrastructure as Code skills with Terraform (v1.7+) Experience with CI/CD pipelines, GitOps, and automation tools (PowerShell, Bash) Familiarity with observability and incident tools like Datadog, ELK, and synthetic monitoring Solid understanding of networking (TCP/IP, Load Balancing, DNS, Routing) Good knowledge of DevSecOps practices - including security scanning, IAM, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Sheffield, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
automation and internal tools for deployment, monitoring, and incident response Tune performance across OS, network, and cloud layers — this role is hands-on and detail-oriented Improve system resilience, observability, and security in a high-stakes production environment Requirements: Fluent in Linux — not just using it, but understanding how it works under the hood Advanced terminal skills — manipulating systems efficiently … time environments Hands-on with Docker (Kubernetes is a plus), infrastructure-as-code, and CI/CD tooling Strong scripting and automation experience in Python and Bash Familiarity with observability stacks (Prometheus, OpenTelemetry, eBPF) Cloud infrastructure experience (AWS/GCP/Azure), with attention to IAM and software supply chain security Curious, persistent, and comfortable experimenting at the lowest levels More ❯
Posted:

Platform Engineer

Crewe, Cheshire, United Kingdom
Hybrid / WFH Options
Manchester Digital
platform security, reliability, and performance across systems deployed in Canada, the UK, and AWS cloud environments Contribute to key projects, platform optimizations, and ongoing maintenance initiatives Help drive scalability, observability, and operational excellence If you're passionate about infrastructure, cloud, and systems engineering-and want to help shape the future of mobility-we want to hear from you! Requirements We … configurations (Azure AD , Ory, Cognito, Firebase) - Understanding of Site Reliability Engineering and key concepts - Proficient in Infrastructure as Code pipeline deployments and pipeline version control within Terraform or CloudFormation. - Observability Systems, e.g., Nagios, New Relic - Able to troubleshoot/work under pressure, meet deadlines. - Previous experience in a cloud engineering role. - AWS certified as SysOps Administrator/Solutions Architect/… understanding of Infrastructure as Code principles and related tech such as Terraform or CloudFormation - Enhanced experience of AWS cloud technologies, e.g., ECS, EC2, VPC, Lambda, CFS. Ideally AWS certified. - Observability Systems, e.g., New Relic, CloudWatch, SquadCast - ITIL Qualified or awareness of the framework. Bonus Qualifications: -Experience with Linux system administration and troubleshooting. -Basic knowledge of AWS cloud technologies such as More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Python Developer

Northern Ireland, United Kingdom
Hybrid / WFH Options
Ocho
cross-functional teams to design and deliver full-featured software components • Drive a “security-first” mindset across development practices, including OAuth2 and IAM policies • Lead operational efforts using modern observability frameworks to monitor and debug production systems • Mentor junior engineers and contribute to a culture of continuous improvement Essential Criteria: • Strong commercial experience in Golang and Python • Proven track record … secure application design principles • Hands-on experience designing and consuming RESTful and GraphQL APIs • Strong SQL skills and familiarity with data warehouses like Snowflake • Day-2 operations experience including observability, debugging, and triage Desirable Skills: • Experience with Auth0 , AWS Cognito , or similar identity platforms • Familiarity with Helm , Prometheus , Grafana , or OpenTelemetry • Exposure to other cloud platforms (GCP, Azure) • CI/ More ❯
Posted:

Lead Machine Learning Engineer (Agentic Infrastructure)

Slough, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
with the founding team to integrate models into internal and external user flows Write clean, production-ready code - often improving or refactoring existing prototypes Think holistically about agent lifecycle , observability, failure handling, and scalability Help define the tech stack and architecture for core components of the platform Contribute to novel research and publish at top conferences when opportunities arise What …/LLM libraries (e.g., Transformers, LangChain, LangGraph, OpenAI APIs) Experience with cloud platforms (AWS, GCP, or Azure), deployment, and CI/CD pipelines Familiarity with containerization (Docker, Kubernetes) and observability (e.g., Prometheus, Grafana) A builder mindset: you're comfortable with ambiguous specs, early-stage infrastructure, and iterating fast Excellent communication and self-management skills Nice To Have Familiarity with agentic More ❯
Posted:

Engineering Acceleration Engineer

Wantage, England, United Kingdom
Motorsport Network
and other people in the business who develop software Offer best-practice recommendations on IDEs and developer tooling, build systems, package management and CI/CD systems, monitoring and observability Implement and maintain standard templates, automations and infrastructure that support the development process at Atlassian Williams Racing Adopt or create shared libraries/components that benefit multiple Software Engineering teams … testing in languages such as C#, Go, Java, C++, Python, Typescript Containerization, DevOps, and Cloud Platforms such as Azure or AWS K8s provisioning, configuration and operation Logging, monitoring, and observability tooling CI/CD best practices, Release Engineering Git best practices Cloud-native migration or adoption projects Building developer-facing platforms and tooling Strong desire to build impactful solutions for More ❯
Posted:

Senior Site Reliability Engineer

Bath, England, United Kingdom
JR United Kingdom
SLOs, SLIs, and SLAs, with experience in monitoring, alerting, and logging. Familiarity with Infrastructure as Code (Terraform) and CI/CD pipelines (Jenkins, Azure DevOps, etc.). Experience with observability tools like Dynatrace, Stackdriver, Cloud Operations Suite, Cloud Monitoring, and Cloud Logging. Ability to mentor engineers, troubleshoot complex system issues, and improve automation to reduce manual effort. Up to £106k More ❯
Posted:

Loan IQ DevOps Engineer

Manchester Area, United Kingdom
Hybrid / WFH Options
Revolent Group
related processes like data migrations and environment setup. ✅ Preferred (Nice to Have): Banking/Financial Services knowledge — especially around wholesale lending and Loan IQ . Experience with monitoring and observability tools such as APPD, ELK Stack, or Grafana. Understanding of DevSecOps principles , including vulnerability scanning, secrets management, and compliance automation. Further experience with CI/CD integration and pipeline automation More ❯
Posted:

DevOps Engineer

Glasgow, Scotland, United Kingdom
Hybrid / WFH Options
iO Associates - UK/EU
applications into various environments. Objectives of this role Develop and manage CI/CD pipelines to automate deployment processes. Monitor system performance and troubleshoot issues using CloudWatch and other observability tools. Manage and optimise Kafka clusters for real-time data streaming. Oversee and maintain containerized workloads using EKS (Kubernetes on AWS) . Support data infrastructure, including Amazon Redshift for analytics More ❯
Posted:

Azure Site Reliability Engineer III

Glasgow, Scotland, United Kingdom
ZipRecruiter
in Azure. Proficiency with containerization and orchestration tools like Docker, Kubernetes, AKS, and Helm. Programming skills in Python, Java, PowerShell, or Go, with understanding of REST APIs. Experience with observability tools such as DataDog, Prometheus, Splunk, Elasticsearch, Grafana, Azure Monitor. Experience with CI/CD tools like Git, Terraform, Jenkins. Azure cloud expertise in mission-critical environments. Additional qualifications Azure More ❯
Posted:

Platform Engineer - DevOps Specialist

Knutsford, Cheshire, United Kingdom
Square One Resources
this role, you will assist in upgrading the Elastic DP estate to Kubernetes, moving away from obsolete technology (Cloudera), upgrading to RHEL 8, and contributing to improving stability and observability of the platform. You will provide advanced analytics tooling and services for modeling analytics, working across continuous integration, development, build, and deployment using automation and cloud technologies to support the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Bristol, Gloucestershire, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent
Salary: GBP 80,000 - 110,000 Annual
Posted:

Site Reliability Engineer

BS1, Bristol, City of Bristol, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent
Salary: £80000 - £110000/annum Hybrid, Great Benefits
Posted:
Observability
the UK excluding London
10th Percentile
£49,563
25th Percentile
£61,563
Median
£74,500
75th Percentile
£85,000
90th Percentile
£98,500