351 to 375 of 419 Observability Jobs in the UK excluding London

Site Reliability Engineer

Hiring Organisation: Connells Limited
Location: Milton Keynes, Buckinghamshire, UK
Employment Type: Full-time

Job Description We are seeking an experienced Site Reliability Engineer (SRE) to join our Group Technology Team in Milton Keynes.ConnellsX is Connells Group Technologys internal developer platform, built on Microsoft Azure. It simplifies cloud hosting ...

Field CTO EMEA

Hiring Organisation: Jobleads-UK
Location: Maidenhead, England, United Kingdom

Engineering, platform teams, and business stakeholders.Translate customer business goals into compelling transformation strategies powered by Dynatrace.Lead high-impact technical discovery and executive conversations around observability, cloud modernization, AI adoption, security, automation, and business outcomes.Shape account strategy with Sales and Solution Engineering teams for complex, multi-stakeholder deals.Develop board-level … executive-level narratives that connect platform capabilities to risk reduction, operational excellence, digital experience, and growth.Guide customers on modern observability and security operating models, including platform engineering, SRE, DevSecOps, and AI-assisted operations.Support large opportunities by validating architecture direction, differentiation, value realization, and long-term platform vision.Influence go-to-market ...

Observability Engineering Manager

Hiring Organisation: Jobleads-UK
Location: Douglas, Northern Ireland, United Kingdom

interesting locations around the world, to align on strategy and execution. The company is founder‐led, profitable, and growing. We are hiring an Observability Engineering Manager who will lead the development of the distributed tracing or service mesh products as part of our Observability group. Engineering managers at Canonical … review and lead both architecture and code. They are astute judges of character, set expectations, and hold colleagues accountable. We are building an observability stack that is easy to deploy and operate on Kubernetes. This is part of a broader initiative to deliver the world's best suite of open ...

SRE - Site Reliability Engineer - Observability & Performance

Hiring Organisation: Sanderson Recruitment
Location: Bristol, Somerset, United Kingdom
Employment Type: Contract
Contract Rate: GBP 550 - 600 Daily

Observability and Performance Up to £600 per day outside IR35 6 month initial contract Bristol - Largely remote I'm currently working with a client who is looking for an SRE to implement and enhance observability across Java applications, middleware and Linux infrastructure using Grafana click apply for full job details ...

GCP DevOps

Hiring Organisation: Pracyva ltd
Location: Bristol, City of Bristol, United Kingdom
Employment Type: Contract
Contract Rate: £400 - £425/day

Actions, Harness, Jenkins). Networking & Security: Experience with GCP Cloud Armor, GCP Networking, and embedding secure-by-design controls from design to runtime. Automation & Observability: Implementing actionable observability, performance tuning, and automation to reduce toil. Defining and operating against SLOs/SLIs. Scripting & Tooling: Scripting in Bash, PowerShell, or Python. ...

DevOps Engineer

Hiring Organisation: Fruition Group
Location: Leeds, West Yorkshire, Yorkshire, United Kingdom
Employment Type: Contract

Contract: Inside IR35 We're seeking an experienced Senior DevOps Engineer to join a small, highly skilled engineering team delivering a large-scale enterprise observability platform as they move away from Splunk This is an opportunity to work on a critical cloud platform supporting the migration of numerous services onto … modern monitoring and logging solution. What you'll be doing * Support and enhance a large-scale observability platform. * Help engineering teams onboard and migrate their services. * Build and maintain dashboards, log pipelines and alerting. * Develop and manage cloud infrastructure using Terraform across Azure and AWS. * Produce technical documentation and operational ...

Remote Principal Software Engineer, Docker Agents (London)

Hiring Organisation: Docker
Location: Stevenage, Hertfordshire, UK
Employment Type: Full-time

Testing: Define evaluation frameworks to measure agent quality, reliability, and production readiness; plus the deployment effectiveness of containerized runtimes Reliability & Operability: Establish standards for observability, performance, and operational excellence; lead critical production decision-making and incident learnings as needed Rapid Prototyping: Iterate quickly on new agent capabilities and deployment patterns … Deep understanding of Docker, containerization best practices, and container orchestration Cloud/Platform Depth: Experience building and operating platform services with strong foundations in observability, CI/CD, and security principles Operational Excellence: Experience operating and evolving high-availability production systems with a focus on reliability and performance Influence & Communication ...

Senior Python Backend Engineer Fully Remote, UK

Hiring Organisation: Interact Consulting Limited
Location: South West London, London, United Kingdom
Employment Type: Permanent, Work From Home

APNs/FCM), user notification preferences, audience segmentation, and delivery tracking. Integrate with third-party data providers and external services, ensuring robust failure handling, observability, and system resilience. Design and support secure internal tooling APIs, including role-based access controls, audit trails, change history, and safe administrative workflows. Build … shape technical direction and the expectation to take real ownership of what you build. Scope: backend services, infra, event-driven systems, CI/CD, observability, all built for live-event traffic. Python-first, Postgres, Redis. You'll own your services fully: building them and keeping them running in production. Heavily ...

Technical Lead Edge Platform

Hiring Organisation: VoCoVo
Location: Oxfordshire, United Kingdom
Employment Type: Full Time
Salary: 80000 to 85000 GBP Annually

MicroK8s). Experience with image build tooling and immutable OS concepts, familiarity with tools such as Kairos, OSTree is highly desirable. Practical exposure to observability at scale, including metrics, logging, alerting (Prometheus, Grafana, Loki) and hands-on experience with OpenTelemetry. Experience operating or building infrastructure to manage, monitor and update … implement secure, reliable over-the-air (OTA) update mechanisms for OS and workload delivery at scale. Take ownership of the edge platform's observability, reliability and security, including driving adoption of OpenTelemetry across the edge estate. Contribute to the technical roadmap, researching new approaches and producing demonstrations and proofs ...

Principal Observability & Cloud Platform Engineer

Hiring Organisation: 17918
Location: Cambridge, Cambridgeshire, United Kingdom

Principal Observability & Cloud Platform Engineer Most observability engineers run someone else's stack. This role is for the person who builds it. Our client is re-architecting observability and cloud infrastructure at a scale very few engineers ever touch: a 3,000-node Kubernetes estate, 50TB of logs ...

Network Monitoring & Observability Architect

Hiring Organisation: Pontoon
Location: Chester, Cheshire, United Kingdom
Employment Type: Contract

Join Our Team as a Network Monitoring & Observability Architect ! Contract Length: 12 months Location: Chester Working Pattern: 3 days per week in the office, Via Umbrella Company Are you ready to take your skills to the next level? We're looking for a talented Monitoring Architect to join our dynamic ...

Senior Infrastructure Engineer — Cloud, IaC & Observability

Hiring Organisation: Jobleads-UK
Location: Tipton, England, United Kingdom

through strong infrastructure practices. The ideal candidate has over 7 years of experience in platform and DevOps roles, strong skills in IaC, networking, and observability, and a passion for AI safety. #J-18808-Ljbffr ...

Remote Senior GenAI Platform Engineer

Hiring Organisation: Pleo
Location: Bournemouth, Dorset, UK
Employment Type: Full-time

engineering. You will help design, build, and operate the shared AI infrastructure used by product teams across Pleo, with a strong focus on reliability, observability, security, and developer experience. Who you'll be working with and reporting to You'll be reporting to the Engineering Manager for the GenAI Platform … GenAI platform components used by product teams at Pleo, including LLM routing gateway, vector search and RAG infrastructure, tool registry and MCP gateway, AI observability and evaluation tooling (tracing LLM calls, supporting human and automated evaluation, detecting drift, and tracking costs) and infrastructure for multi-step, long-running agentic workflows. ...

Site Reliability Engineer

Hiring Organisation: Lorien
Location: Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type: Contractor
Contract Rate: Salary negotiable

production incidents, taking ownership through to resolution. Focus on incident response, service restoration and operational excellence (approximately 70% of the role). Improve system observability, monitoring and alerting capabilities. Work closely with development teams to enhance the reliability and operability of applications. Analyse production issues and identify opportunities for automation … Production Engineering or a similar operational engineering role. Strong hands-on experience supporting live production environments. Excellent troubleshooting and incident management skills. Experience with observability and monitoring platforms, including: Grafana Open Telemetry Splunk Good understanding of cloud platforms (AWS experience preferred). Strong knowledge of APIs and API troubleshooting. Experience ...

Senior Developer - ~Perm - Birmingham

Hiring Organisation: INFUSED SOLUTIONS LIMITED
Location: Birmingham, West Midlands, United Kingdom
Employment Type: Permanent
Salary: £80,000

recurring technical problems and implementing long-term solutions. Improving platform reliability, resilience, and overall product quality. Performing application profiling, performance tuning, and optimisation. Enhancing observability, monitoring, alerting, and diagnostic capabilities. Working with engineering teams to improve development practices and technical standards. Reducing technical debt and identifying opportunities for platform improvement. … Strong communication skills and the ability to collaborate effectively across engineering teams. Desirable Experience working on SaaS platforms or cloud-based applications. Exposure to observability and monitoring tools. Experience with performance profiling and optimisation techniques. Knowledge of scalability, resilience, and reliability engineering principles. Familiarity with CI/CD pipelines ...

Senior Full stack Developer - Birmingham - Perm,

Hiring Organisation: INFUSED SOLUTIONS LIMITED
Location: Birmingham, West Midlands, United Kingdom
Employment Type: Permanent
Salary: £80,000

DevSecOps Capability Manager

Hiring Organisation: WRK DIGITAL LTD
Location: Skipton, North Yorkshire, Yorkshire, United Kingdom
Employment Type: Permanent

improvement Strategy, Governance & Technical Direction Set DevSecOps strategy across pipelines and security automation Establish governance for CI/CD, IaC, and cloud delivery Define observability standards (SLOs, tracing, dashboards) Embed security into pipelines (SAST, SCA, DAST, secrets, IaC scanning) Govern "Golden Path" templates and adoption Operational Oversight & Risk Management Oversee …/CD, DevSecOps, and security integration Strong cloud, containerisation, and IaC knowledge Proven ability to improve DORA and engineering performance metrics Experience with observability and monitoring frameworks Strong background in security tooling (SAST, SCA, DAST, scanning tools) Solid understanding of cloud security, IAM, and zero-trust principles Experience working ...

DevSecOps Capability Manager

Hiring Organisation: WRK DIGITAL LTD
Location: Humber, Devon, UK
Employment Type: Full-time

Direction \n \n Set DevSecOps strategy across pipelines and security automation \n Establish governance for CI/CD, IaC, and cloud delivery \n Define observability standards (SLOs, tracing, dashboards) \n Embed security into pipelines (SAST, SCA, DAST, secrets, IaC scanning) \n Govern \"Golden Path\" templates and adoption \n\n Operational … DevSecOps, and security integration \n Strong cloud, containerisation, and IaC knowledge \n Proven ability to improve DORA and engineering performance metrics \n Experience with observability and monitoring frameworks \n Strong background in security tooling (SAST, SCA, DAST, scanning tools) \n Solid understanding of cloud security, IAM, and zero-trust principles ...

Digital Senior Full Stack Engineer

Hiring Organisation: Leeds Building Society
Location: Leeds, West Yorkshire, Yorkshire, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £75,000

services. You'll lead complex technical delivery, champion modern engineering practices and help shape high-quality solutions through clean architecture, automation, CI/CD, observability and secure-by-default development. Just as importantly, you'll coach and mentor other engineers, raise standards across the squad and define ways of working. … leading code/design reviews; uplifting test automation and quality gates. Ability to influence stakeholders across Product, Architecture, InfoSec, Risk and Operations; governance experience. Observability experience: metrics, logs, traces; operational ownership of services. Experience of supporting UI/UX Design would be beneficial And in return ...

Data Platform Engineer

Hiring Organisation: Connells Limited
Location: Milton Keynes, Buckinghamshire, UK
Employment Type: Full-time

Job Description We are seeking an Azure Platform Engineer to join our Group Technology team in Milton Keynes on a 6-month contract basis. You will play a key role in delivering the Connells Group ...

Senior Automation Engineer

Hiring Organisation: Raytheon
Location: Glenrothes, Fife, Scotland, United Kingdom
Employment Type: Permanent, Work From Home

commissioning of robotic cells and assembly systems; perform First Article Inspection (FAI) and ensure compliance with safety standards i.e. ISO 9001 or AS9100. Observability & Support: Maintain platform observability and respond to incidents through Root Cause Analysis (RCA) to improve service efficiency. System Integration: Designing and implementing interfaces between MES (e.g. ...

Operations Engineer

Hiring Organisation: Ascent Resourcing Limited
Location: Birmingham, West Midlands, England, United Kingdom
Employment Type: Full-Time
Salary: £55,000 - £60,000 per annum

continuity. Key Responsibilities Provide operational support for enterprise platforms, applications, integrations, and associated technologies. Monitor system health, availability, and performance using monitoring, alerting, and observability tools. Analyse, troubleshoot, and resolve incidents affecting services and platforms. Perform root cause analysis and contribute to implementing permanent solutions to prevent recurring issues. Coordinate … within IT operations, support engineering, or service management environments. Experience supporting business-critical production services and operational platforms. Knowledge of monitoring, logging, alerting, and observability practices. Experience working with incident, problem, change, and release management processes. Excellent communication skills with the ability to collaborate effectively across multiple technical and business ...

AI Engineer (Infrastructure SRE & Automation)

Hiring Organisation: Sky
Location: west lothian, central scotland, united kingdom

repetitive operational workflows across compute, storage, and networking layers. Integrate AI capabilities into CI/CD pipelines and infrastructure-as-code ecosystems. Data Engineering & Observability Build and maintain pipelines for high-volume telemetry data (metrics, logs, traces). Ensure data quality, labelling, and feature engineering for ML models. Leverage observability ...

Senior SRE: Reliability & Observability Leader

Hiring Organisation: Jobleads-UK
Location: Nottingham, England, United Kingdom

London Stock Exchange Group (LSEG) is seeking a Senior SRE to strengthen reliability, observability, security, and operational excellence across its Risk Intelligence division. You will be a hands-on technical leader shaping foundations of reliability across projects. You will collaborate with Architecture, Engineering, Security, and Platform teams, owning observability strategies ...

Remote Senior RubyOnRails Engineer

Hiring Organisation: Hulcan
Location: Edinburgh, UK
Employment Type: Full-time

About MILE MILE is the new members-only shopping destination redefining luxury commerce. We offer access to a curated, seasonless catalogue of the most sought-after products from globally renowned fashion houses—all at unmatched ...