626 to 650 of 709 Observability Jobs in England

DevSecOps Capability Manager

Hiring Organisation
WRK DIGITAL LTD
Location
Skipton, North Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent
improvement Strategy, Governance & Technical Direction Set DevSecOps strategy across pipelines and security automation Establish governance for CI/CD, IaC, and cloud delivery Define observability standards (SLOs, tracing, dashboards) Embed security into pipelines (SAST, SCA, DAST, secrets, IaC scanning) Govern "Golden Path" templates and adoption Operational Oversight & Risk Management Oversee …/CD, DevSecOps, and security integration Strong cloud, containerisation, and IaC knowledge Proven ability to improve DORA and engineering performance metrics Experience with observability and monitoring frameworks Strong background in security tooling (SAST, SCA, DAST, scanning tools) Solid understanding of cloud security, IAM, and zero-trust principles Experience working ...

Digital Senior Full Stack Engineer

Hiring Organisation
Leeds Building Society
Location
Leeds, West Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£75,000
services. You'll lead complex technical delivery, champion modern engineering practices and help shape high-quality solutions through clean architecture, automation, CI/CD, observability and secure-by-default development. Just as importantly, you'll coach and mentor other engineers, raise standards across the squad and define ways of working. … leading code/design reviews; uplifting test automation and quality gates. Ability to influence stakeholders across Product, Architecture, InfoSec, Risk and Operations; governance experience. Observability experience: metrics, logs, traces; operational ownership of services. Experience of supporting UI/UX Design would be beneficial And in return ...

Senior AI Engineer

Hiring Organisation
Adria Solutions Ltd
Location
Manchester, United Kingdom
Employment Type
Permanent
Salary
£75000 - £110000/annum
solutions securely within enterprise environments. Ensure solutions leverage Private Endpoints, secure networking, identity management, and enterprise-grade governance controls. Establish monitoring, evaluation, and observability frameworks for AI systems, including hallucination detection, model drift monitoring, performance tracking, and cost optimisation. Partner with operational and commercial stakeholders to identify high-value … evaluation. Experience applying Data Science methodologies to solve complex business problems and identify opportunities for AI adoption. Experience with GenAIOps, LLMOps, MLOps, and AI observability platforms. Exposure to Computer Vision, OCR, Voice AI, Conversational AI, or multimodal AI solutions. Experience working within operational, retail, automotive, logistics, or customer-centric organisations. ...

Enterprise Head of AI Engineering (Founding)Sales Development Representative (SDR)

Hiring Organisation
Pyxos
Location
City of London, London, United Kingdom
position You will own the technical direction of our agent surface, the proprietary build environment behind it (our Agentic Studio), and the evaluation, observability, and safety layers that make the system trustworthy enough for regulated enterprise deployment. We build with AI: agentic development tooling is core to how Pyxos ships … then-execute patterns, output validation, tool-use restrictions, policy enforcement. • Production engineering rigor. Strong Python; cloud fluency (AWS, GCP, or Azure); CI/CD, observability, cost attribution. • Engineering leadership at startup pace. You have hired, managed, and grown teams — not just been an individual contributor. Nice to have: regulated-industry ...

Principal Engineer - Platform Enablement

Hiring Organisation
Centrica - CHP
Location
Maidenhead, Berkshire, UK
Employment Type
Full-time
enhance safety, compliance, customer experience, and productivity Establish engineering excellence across teams: Champion high engineering standards including clean architecture, CI/CD automation, observability, testing strategies, release processes, telemetry, performance tuning, and secure-by-design principles Lead platform performance, reliability & offline capability: Ensure the environment performs reliably in challenging field … Quality and Platform-wide capabilities: Shape quality, resilience, and security strategies across teams-ensuring teams adopt shift-left testing, strong security hygiene, consistent observability, and reliable operational processes Improve how work is done: Continuously identify opportunities to automate, simplify, reduce cycle time, improve developer experience, adopt new tools ...

Enterprise Data Architect: AI-Ready Lakehouse & Governance

Hiring Organisation
Jobleads-UK
Location
Fleet, England, United Kingdom
Quantios is a leading provider of software solutions for the trust administration and corporate services industry. With over 30 years of experience, we empower our clients with innovative technology that enhances governance, operations, and investment ...

Splunk Lead Engineer

Hiring Organisation
VIQU IT
Location
London, Bishopsgate, United Kingdom
Employment Type
Contract
Contract Rate
£550 - £700/day Inside IR35
client a leading finance house are looking for a Lead Splunk Engineer to take the lead in the design and implementation of monitoring and observability patterns and standards within the Observability Team. This role will act as a technical authority, ensuring best practices are followed, automation first approach is taken … mentoring the team to build sustainable capability, advocate monitoring and observability best practice to the wider technology domain. For this opportunity you will have proven skills in: · Attention to detail with the ability to craft concise, informational user documentation · Experience of researching and developing solutions that expand, modernise or improve ...

Splunk Lead Engineer

Hiring Organisation
VIQU IT Recruitment
Location
Street, Somerset, UK
client a leading finance house are looking for a Lead Splunk Engineer to take the lead in the design and implementation of monitoring and observability patterns and standards within the Observability Team. This role will act as a technical authority, ensuring best practices are followed, automation first approach is taken … mentoring the team to build sustainable capability, advocate monitoring and observability best practice to the wider technology domain. For this opportunity you will have proven skills in: · Attention to detail with the ability to craft concise, informational user documentation · Experience of researching and developing solutions that expand, modernise or improve ...

AI Engineer

Hiring Organisation
Hyre AI Limited
Location
Paddington, Warrington, United Kingdom
Employment Type
Permanent
Salary
GBP 60,000 - 80,000 Annual
tool-calling patterns Extend the MCP server with new tools and capabilities Enforce structured outputs and validation across LLM boundaries 2. LLM Quality, Evals & Observability Build the layer that lets the team ship LLM features with confidence. You will: Design and grow the eval platform - golden datasets, regression suites … judge Integrate observability and tracing across providers and prompt versions Track cost, latency, and quality per prompt, model, and client Build guardrails for prompt injection, PII, and output safety Drive prompt engineering practice - versioning, A/B testing, platform overlays 3. Cloud & Data Infrastructure Own the cloud substrate that runs ...

AI Engineer

Hiring Organisation
Hyre AI Limited
Location
City of Westminster, Greater London, Paddington, United Kingdom
Employment Type
Permanent
Salary
£60000 - £80000/annum Plus Equity
tool-calling patterns Extend the MCP server with new tools and capabilities Enforce structured outputs and validation across LLM boundaries 2. LLM Quality, Evals & Observability Build the layer that lets the team ship LLM features with confidence. You will: Design and grow the eval platform - golden datasets, regression suites … judge Integrate observability and tracing across providers and prompt versions Track cost, latency, and quality per prompt, model, and client Build guardrails for prompt injection, PII, and output safety Drive prompt engineering practice - versioning, A/B testing, platform overlays 3. Cloud & Data Infrastructure Own the cloud substrate that runs ...

DevOps Engineer

Hiring Organisation
Lorien
Location
West Drayton, England, United Kingdom
DevOps and platform delivery across the squad • Building and improving CI/CD pipelines, automation and infrastructure standards • Supporting operational stability through monitoring, observability and proactive maintenance • Working closely with architecture, cyber and platform teams to deliver pragmatic outcomes • Leading infrastructure improvements, environment optimisation and governance alignment • Creating reusable templates … platform engineering, DevOps and infrastructure automation • Experience with CI/CD tooling such as GitHub Actions, Jenkins or similar • Knowledge of monitoring and observability tools (Datadog, CloudWatch etc.) • Experience operating in large-scale, complex enterprise environments • Ability to balance technical delivery with stakeholder management Bonus experience: • Previous experience within consulting ...

Senior Director, Senior Director, Cloud Engineering

Hiring Organisation
Travelport
Location
Kington Langley, Wiltshire, UK
onboards development teams and products to this platform. You’ll be responsible for driving the move to infrastructure as code, account management, automated pipelines, observability, resilience, operating coverage and cost control. You’ll also help define how the platform supports AI, data engineering and high-scale product demand, including where … standards for infrastructure as code, CI/CD, AWS account management, platform guardrails and developer enablement. Improve the operational model for the platform, including observability, incident response, reliability and 24/7 support coverage. Partner with Product and Commercial teams to get ahead of major demand changes, customer commitments ...

Senior Director, Senior Director, Cloud Engineering

Hiring Organisation
Travelport
Location
Kington Langley, England, United Kingdom
onboards development teams and products to this platform. You’ll be responsible for driving the move to infrastructure as code, account management, automated pipelines, observability, resilience, operating coverage and cost control. You’ll also help define how the platform supports AI, data engineering and high-scale product demand, including where … standards for infrastructure as code, CI/CD, AWS account management, platform guardrails and developer enablement. Improve the operational model for the platform, including observability, incident response, reliability and 24/7 support coverage. Partner with Product and Commercial teams to get ahead of major demand changes, customer commitments ...

Site Reliability Engineer

Hiring Organisation
Autonomai Recruitment
Location
London Area, United Kingdom
intervention. Working extensively in Linux-based environments supporting production infrastructure. Monitoring, troubleshooting, and resolving issues across distributed systems and services. Improving incident response, alerting, observability, and system resilience. Partnering with engineering and infrastructure teams to deliver robust operational support. Contributing to performance tuning and support for low-latency environments. What … communication skills and the ability to work closely with technical teams. Nice to have Experience in latency-sensitive environments. Familiarity with monitoring, logging, and observability tooling. Exposure to cloud, containers, or infrastructure-as-code. Experience working in environments with strong automation and change control. Why the role stands ...

Data Platform Engineer

Hiring Organisation
Cognify Search
Location
London Area, United Kingdom
ideally AWS CDK A solid understanding of data platform design, scalability and performance optimisation Experience building reliable systems with CI/CD, testing and observability practices Knowledge of modern data storage technologies and large-scale data processing Why This Role Stands Out Most Data Engineering roles focus on moving data … critical commercial workflows. The challenges are genuinely engineering-focused: High-volume event processing Real-time decision support Low-latency data systems Platform reliability and observability Revenue-critical infrastructure You'll join a business with a genuine tech-for-good mission, solving complex problems with data while helping shape the future ...

Operations Engineer

Hiring Organisation
Ascent Resourcing Limited
Location
Birmingham, West Midlands, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £60,000 per annum
continuity. Key Responsibilities Provide operational support for enterprise platforms, applications, integrations, and associated technologies. Monitor system health, availability, and performance using monitoring, alerting, and observability tools. Analyse, troubleshoot, and resolve incidents affecting services and platforms. Perform root cause analysis and contribute to implementing permanent solutions to prevent recurring issues. Coordinate … within IT operations, support engineering, or service management environments. Experience supporting business-critical production services and operational platforms. Knowledge of monitoring, logging, alerting, and observability practices. Experience working with incident, problem, change, and release management processes. Excellent communication skills with the ability to collaborate effectively across multiple technical and business ...

Software Engineer III

Hiring Organisation
Expedia Group
Location
Greater London, England, United Kingdom
lines of business, ensuring scalability and preventing regression to existing services Test, debug, and resolve production issues within established SLAs, maintaining system reliability and observability Proactively collaborate with peers across the organization to identify cross-dependencies and engage in shared problem-solving Identify areas of inefficiency in code or system … with on-call responsibilities for tier 1 or business-critical services, including incident response, troubleshooting, and rollback procedures Experience setting up alerts, monitors, and observability tooling for critical production systems Demonstrated ability to architect services end-to-end and build scalable, resilient distributed systems Experience designing and building APIs ...

Senior Director, Data and Information Marketplace

Hiring Organisation
Jobleads-UK
Location
Cambridge, England, United Kingdom
ensuring intuitive experiences for both people and agents across discovery, access, sharing, understanding and use. Advise the development of capabilities for data access, lineage, observability and quality so that data assets are transparent, trusted and usable at scale. Shape enterprise approaches to data, information and knowledge lifecycle management, embedding governance … seamless experiences that are widely adopted by users and machines across multiple enterprise business units. Deep expertise in relevant capability areas, including data quality, observability, lineage, access management, lifecycle management and information governance. Measurable evidence of optimising the data P&L across covering cost/FinOps, value realisation and sustainability. ...

Senior Data Analyst - Product Reliability

Hiring Organisation
Wise
Location
Greater London, United Kingdom
Employment Type
Full Time
Salary
60000 to 85000 GBP Annually
Grafana, Lightdash, or Superset Ability to build and manage data pipelines that are modularised and scalable, using tools like DBT and Airflow Familiarity with observability/reliability concepts (SLIs, SLOs, incidents) Some experience with Python/data transformation (DBT, etc.) is helpful This is not a data engineering role, depth … KPIs Scaling our infrastructure at Wise: how we make it work Wise's Tech Stack 2025 Measuring meaningful availability/uptime of Wise Why Observability is a must for product engineering teams Grafana Mimir Compaction: From Bottleneck to Savings For everyone, everywhere. We're people building money without borders — without ...

Staff Reliability Engineer (Full Stack)

Hiring Organisation
Feeld
Location
Greater London, United Kingdom
Employment Type
Full Time
Salary
100000 to 130000 GBP Annually
Native). Lead technical problem-solving during incidents: coordinate response, diagnose root causes, communicate status, and drive to resolution. Build and evolve monitoring/observability (dashboards, alerts, tracing, logging) that enables fast detection and diagnosis. Drive post‐incident reviews (blameless) and ensure learnings become durable fixes (tech changes, runbooks, automation … comfort working across services and APIs. Proven incident response leadership: on-call participation, triage, mitigation, and root-cause analysis (RCA) with follow-through. Solid observability skills: practical experience with logging/metrics/tracing and turning signals into actionable alerts and dashboards. Experience collaborating with mobile teams and understanding mobile ...

Python Backend Engineer

Hiring Organisation
Daniel James Resourcing Ltd
Location
St. Albans, Hertfordshire, South East, United Kingdom
Employment Type
Contract
Contract Rate
Up to £550/ day Outside IR35
heavily focused on: Real-time event-driven processing High-throughput backend services Distributed microservice architectures Low-latency data ingestion pipelines Operational resilience and observability Production-scale concurrency and asynchronous processing Youll be joining a team working on systems that continuously ingest and process large volumes of operational and telemetry-style … asynchronous processing Strong practical experience with messaging/event-driven architectures Experience handling high-throughput or streaming workloads Strong operational mindset around resilience, retries, observability, scaling, and failure handling Engineers who still code heavily day-to-day Highly desirable: IoT, telemetry, or time-series style systems FastAPI Kafka/Service ...

Application Support - Commodities Trading Firm - Up to £70k + Bonus - Hybrid

Hiring Organisation
Saragossa
Location
London Area, United Kingdom
Europe's leading commodity trading firms, and gain exposure to a technology landscape that's constantly evolving. From cloud platforms and Kubernetes to automation, observability, databases, and emerging AI initiatives, you'll be surrounded by the kind of technology and business exposure that can significantly accelerate your career. Your … years’ application support experience within financial services or another regulated environment. You’ll be technically confident with strong SQL skills, experience using monitoring and observability tools, and exposure to scripting or automation (such as Python or PowerShell). Any interest in or experience with AI would be a strong plus. ...

Senior Backend / Full-Stack Engineer (E5/E6 Level) – AI-Native Startup – Strong Comp + Equity

Hiring Organisation
Mondrian Alpha
Location
London Area, United Kingdom
preferred • Experience with modern frontend frameworks (React/Next.js) is a plus for full-stack candidates • Strong understanding of system design, reliability, scalability, and observability • Experience in startups or fast-paced product environments is highly desirable • AI-native mindset — comfortable leveraging AI tooling and rapid iteration workflows • Strong communication skills … React, Next.js • AI-native workflows and internal LLM tooling • Distributed systems and real-time infrastructure • OpenSearch, SingleStore, Trigger.dev, Axiom • Modern cloud-native infrastructure and observability stack What they offer: • Excellent compensation + meaningful equity • High-ownership environment with direct impact on product and architecture • Small, elite engineering team • Direct collaboration ...

Staff Engineer

Hiring Organisation
Stepstone UK
Location
South East London, London, United Kingdom
Employment Type
Permanent
engineering standards, best practices and reusable patterns while partnering with Enterprise Architecture and influencing technical direction Drive engineering excellence by improving code quality, testing, observability, reliability and operational practices Support end-to-end delivery by guiding teams through complex technical challenges, improving decision-making, and contributing to planning and risk … data lakes/lakehouse architectures, Iceberg or similar table formats, as well as batch and streaming processing Knowledge of data quality, governance, cataloguing and observability tools (e.g. Datadog), with DBT or AI-assisted engineering practices as a plus Additional Information Your benefits Werea community here that cares as much about ...

Senior Network Engineer, Cingularity

Hiring Organisation
IMG
Location
London Area, United Kingdom
specialised DTM (Dynamic Synchronous Transfer Mode) network—while strategically introducing automation to enhance resilience and performance. While you will assist with the development of observability tools, 24/7 monitoring is managed by our Technical Operations Centre (TOC), supported by a joint effort between Systems, Broadcast, and Network Engineering teams. … transmission paths. NetOps & Monitoring Refinement Internal Tooling: Build and refine monitoring techniques where the primary "customers" are our internal TOC and Event Engineering teams. Observability Design: Utilise and assist in the development of modern monitoring and logging systems (e.g., Prometheus, Grafana, ELK/OpenSearch) and the Netbox source of truth. ...