Observability Jobs in the UK

951 to 975 of 2,210 Observability Jobs in the UK

Support Engineer

London, United Kingdom
Hybrid / WFH Options
BAE Systems (New)
code, networking and databases with enough knowledge to be able to fault find and identify the root cause. Core Responsibilities involved: Incident management Application design and development Site reliability (observability, alerting, high-availability, self-healing systems etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm) Elastic Stack Typescript/React More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Engineer

Greater London, England, United Kingdom
Fit Collective
as the bridge between product, ML, and analytics, ensuring their needs are met through scalable, transparent, and well-documented data models. Governance & Monitoring: Implement dbt tests, freshness checks, and observability practices to guarantee trust in downstream analytics and ML applications for governance and audits. What You Bring: Experience : 5+ years in data engineering or backend development with a focus on More ❯
Posted:

Site Reliability Engineer

London, United Kingdom
Duffel
us to silently drop spans. - An enthusiasm for both software development and systems engineering. - A high bar for code and configuration quality and readability. - A good understanding of current observability and reliability practices. - Experienced and comfortable in running incident response. - Big picture thinking - you can make trade offs on technical work streams against business impact. - Fantastic communication skills. You're More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Digital Transformation Director

London Area, United Kingdom
SoftServe
digital trends, challenges, solutions, market dynamics, competition, and peer group activities. Understanding and ability to articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps), and operations (e.g., observability, automated response, SRE etc.), and articulate a path toward a target operating model (people, process, and tools). Required Skills Leadership: Strong leadership skills are essential for guiding teams to More ❯
Posted:

Digital Transformation Director

City of London, London, United Kingdom
SoftServe
digital trends, challenges, solutions, market dynamics, competition, and peer group activities. Understanding and ability to articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps), and operations (e.g., observability, automated response, SRE etc.), and articulate a path toward a target operating model (people, process, and tools). Required Skills Leadership: Strong leadership skills are essential for guiding teams to More ❯
Posted:

Senior Software Engineer (Frontend)

London, England, United Kingdom
Raft
error handling, and data transformation on the client side. Proficient in debugging and optimizing performance in complex single-page applications (SPAs), with strong skills in testing frameworks and frontend observability tools. Familiar with modern CI/CD workflows, and has experience with containerization tools like Docker in a frontend deployment context (e.g., local dev environments, static site hosting). Understands More ❯
Posted:

Digital Transformation Lead

City of London, London, United Kingdom
SoftServe
digital trends, challenges, solutions, market dynamics, competition, and peer group activities. Understanding and ability to articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps), and operations (e.g., observability, automated response, SRE etc.), and articulate a path toward a target operating model (people, process, and tools). Required Skills Leadership: Strong leadership skills are essential for guiding teams to More ❯
Posted:

Senior TypeScript Back-End Engineer (£80k - £100k + Bonus)

London, England, United Kingdom
Practi
JavaScript Strong understanding of cloud-based application architecture and stack, preferably including AWS Good understanding of Docker and experience with CI/CD tooling Good understanding of security and observability best practices and tooling What else about you would excite us? Experience building and maintaining high-traffic server-side web applications Experience with infrastructure-as-code tools such as Terraform More ❯
Posted:

Search - Search Inference - Senior MLOps Engineer

London, United Kingdom
Elasticsearch B.V
results that matter. By taking advantage of all structured and unstructured data - securing and protecting private information more effectively - Elastic's complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What is The Role The Search Inference team is responsible for bringing performant, ergonomic, and cost effective machine learning (ML) model inference More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Manager, Site Reliability Engineering Paris, France

London, England, United Kingdom
Hybrid / WFH Options
Algolia
in their growth and development You know how to deploy an application from laptop to production, are able to fully automate it, and you are comfortable with Production requirements (Observability, Alerting, ...) You are knowledgeable in DevOps principles and CI/CD pipelines You are knowledgeable in Configuration Management and Infrastructure as Code such as Chef and Terraform You are More ❯
Posted:

Senior Software Engineer I

London, United Kingdom
Quality Control Specialist - Pest Control
ramp up in highly technical, ambiguous domains. Strong knowledge of REST APIs , distributed system design, and performance optimization. Experience with both SQL and NoSQL data stores , caching layers, and observability tooling (e.g., Prometheus, Datadog). Nice to have: Experience deploying or integrating LLMs or NLP models in production systems. Comfortable balancing short-term execution with long-term architectural thinking . More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Data Scientist

London, England, United Kingdom
Hybrid / WFH Options
Ravelin Technology
as Data Engineering and Product, to build a more effective and cohesive ML ecosystem Deep expertise in data science and engineering best practices (version control, CI/CD, testing, observability) and a history of applying them to build robust, scalable machine learning systems Exceptional analytical and problem-solving skills, with a demonstrated ability to define and solve highly ambiguous, complex More ❯
Posted:

Senior Software Engineer (Robotics & Perception)

Oxford, England, United Kingdom
Hybrid / WFH Options
Oxa
and messaging systems (E.g ROS) Containerisation (E.g. docker) CI/CD experience CUDA, Triton Ability to code in Python Experience of cloud technologies (e.g. AWS, GCP, Azure) Experience with observability platforms such as Grafana The Candidate Journey: Multi-Step and Two-Way No-one wants to feel like a square peg in a round hole, so this process is designed More ❯
Posted:

Digital Transformation Director

London, England, United Kingdom
SoftServe
digital trends, challenges, solutions, market dynamics, competition, and peer group activities. Understanding and ability to articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps), and operations (e.g., observability, automated response, SRE etc.), and articulate a path toward a target operating model (people, process, and tools). Required Skills Leadership: Strong leadership skills are essential for guiding teams to More ❯
Posted:

Senior Principal Network Architect

London, England, United Kingdom
Equinix
networking technologies and ecosystems, such as Routing Daemons (FRR, Bird, GoBGP), Linux Networking (eBPF, VPP, XDP), and SONiC, or other Linux-based open Network Operating Systems Involvement with modern observability platforms (Prometheus/PromQL, Grafana, gNMI, etc) Experience with network flow export (Netflow, IPFIX, sFlow) and analysis Solid understanding of full networking stack (routing, switching and optical networking), including key More ❯
Posted:

Apache Kafka Engineer

London, England, United Kingdom
Stott and May
Apache Kafka Engineer My client is looking for an Senior Apache Kafka Engineer to lead the design, development, and management of our enterprise event streaming platform. This role requires deep Kafka expertise, strong system design skills, and hands-on experience More ❯
Posted:

AWS Cloud Engineer

London, England, United Kingdom
Hybrid / WFH Options
TryHackMe Ltd
TryHackMe is the fastest-growing online cyber security training platform. Our mission is to make learning and teaching cyber security easier by providing gamified security exercises and challenges. Having only been around for handful of years, we've grown to More ❯
Posted:

DevOps Engineer, Research Platform

London, England, United Kingdom
Hybrid / WFH Options
Mistral AI
About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized More ❯
Posted:

Cloud Engineer

United Kingdom
developrec
AWS Cloud Engineer We’re seeking a Cloud Engineer to own and scale our AWS-based infrastructure, powering a platform used by millions of cybersecurity individuals. You’ll ensure performance, security, scalability, and cost-efficiency, while enabling fast, reliable deployments More ❯
Posted:

Lead Software Engineer- Java and Observability

Glasgow, Scotland, United Kingdom
ZipRecruiter
secure, high-quality production code; review and debug code written by others. Drive decisions influencing product design, application functionality, and technical operations. Serve as a subject matter expert on Observability, Alerting, and Event Processing in specific focus areas. Contribute to the engineering community by advocating firmwide frameworks, tools, and SDLC practices. Influence peers and decision-makers to adopt leading-edge … software applications and technical processes, including cloud, AI, machine learning, mobile, etc. Practical cloud experience. Educational background in Computer Science, Engineering, Mathematics, or related fields. Additional Skills Experience with observability tools and interest in AIOps (e.g., Splunk, Grafana, Dynatrace). Knowledge of automation and scripting (e.g., Python). Experience with IBM Netcool/OMNIbus and related products. About Us J.P. More ❯
Posted:

Observability Engineering Manager

Glasgow, Scotland, United Kingdom
Hybrid / WFH Options
Canonical
four times yearly in person, in interesting locations around the world, to align on strategy and execution. The company is founder-led, profitable, and growing. We are hiring an Observability Engineering Manager who will lead the development of the distributed tracing or service mesh products as part of our Observability group. Engineering managers at Canonical are always coders who are … review and lead both architecture and code. They should also be astute judges of character, and comfortable setting expectations and holding colleagues accountable to them. We are building an observability stack that is easy to deploy and operate on Kubernetes. This is part of a broader initiative to deliver the world's best suite of open source tools, where we … provide deep integration and automation for best-of-breed open source offerings that cover metrics, logging, telemetry, alerting, tracing and profiling. Our goal is to make observability tools integral and pervasive across software practices . Our implementation of Kubernetes operators is opinionated, resilient, and scalable, providing deep insights out of the box. The user experience is polished and seamless for More ❯
Posted:

Site Reliability Engineer (Stoke)

Stoke-on-Trent, England, United Kingdom
Hybrid / WFH Options
ZipRecruiter
Job Description Who we are looking for A Site Reliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability of critical systems, directly impacting operational efficiency. Using … practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your … of Site Reliability Engineering principles, including the creation and management of effective Service Level Indicators (SLI) and Service Level Objectives (SLO) for reliability and customer satisfaction. Knowledge of contemporary observability tools, techniques and best practice including Splunk, New Relic, Grafana and Pager Duty. Excellent knowledge of programming including Python, Golang and JavaScript. Knowledge and experience of modern software development techniques More ❯
Posted:

Linux System Engineer

London, England, United Kingdom
JR United Kingdom
Administer GitLab infrastructure for CI/CD processes. Operate and maintain Kafka clusters for real-time data pipelines. Diagnose and resolve issues across systems, networks, containers, and applications. Use observability tools (Grafana, Prometheus, Kibana, Elasticsearch) to monitor system health. Automate system management tasks using Ansible. Participate in an on-call rotation to support global operations. Required Skills & Experience: Strong hands … Rocky) administration, including kernel tuning and system optimization. Proficiency with GitLab for version control and CI/CD workflows. Solid understanding of Kafka in high-throughput environments. Experience with observability tools such as Grafana, Prometheus, Kibana, and Elasticsearch. Expertise in Ansible for automation and configuration management. Strong problem-solving skills across infrastructure layers (compute, network, OS, containers). #J More ❯
Posted:

Linux System Engineer

City of London, London, United Kingdom
NineTech
Administer GitLab infrastructure for CI/CD processes. Operate and maintain Kafka clusters for real-time data pipelines. Diagnose and resolve issues across systems, networks, containers, and applications. Use observability tools (Grafana, Prometheus, Kibana, Elasticsearch) to monitor system health. Automate system management tasks using Ansible. Participate in an on-call rotation to support global operations. Required Skills & Experience: Strong hands … system optimization. Production-level experience managing Kubernetes clusters. Proficiency with GitLab for version control and CI/CD workflows. Solid understanding of Kafka in high-throughput environments. Experience with observability tools such as Grafana, Prometheus, Kibana, and Elasticsearch. Expertise in Ansible for automation and configuration management. Strong problem-solving skills across infrastructure layers (compute, network, OS, containers). More ❯
Posted:

Linux System Engineer

London Area, United Kingdom
NineTech
Administer GitLab infrastructure for CI/CD processes. Operate and maintain Kafka clusters for real-time data pipelines. Diagnose and resolve issues across systems, networks, containers, and applications. Use observability tools (Grafana, Prometheus, Kibana, Elasticsearch) to monitor system health. Automate system management tasks using Ansible. Participate in an on-call rotation to support global operations. Required Skills & Experience: Strong hands … system optimization. Production-level experience managing Kubernetes clusters. Proficiency with GitLab for version control and CI/CD workflows. Solid understanding of Kafka in high-throughput environments. Experience with observability tools such as Grafana, Prometheus, Kibana, and Elasticsearch. Expertise in Ansible for automation and configuration management. Strong problem-solving skills across infrastructure layers (compute, network, OS, containers). More ❯
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£65,000
Median
£80,000
75th Percentile
£97,500
90th Percentile
£120,000