376 to 400 of 570 Observability Jobs in England

Principal Software Architect

Hiring Organisation: Jobleads-UK
Location: Bristol, England, United Kingdom

/software ecosystem. Assess the architectural impact of new technologies. Be aware of the usability, performance, reliability, maintainability, testability, security and observability constraints on the software architecture. Prototyping and validating architectural concepts through proof-of-concept implementations. Contribute to future and/or related product definitions with a forward-looking ...

Engineering Manager - Platform Reliability

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Lakebase Platform Reliability team’s footprint spans multiple stacks, systems, and stakeholders. They include AI‐powered tooling and workflows for customer management, real‐time observability during incidents, monitoring and auditing systems that underpin compliance requirements, and customer‐facing operational APIs and maintenance workflows. You’ll contribute to the wider platform ...

Principal Product Manager

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

boundaries, controls, escalation paths and human-in-the-loop mechanisms Ensure agentic behaviour is understandable, predictable and trustworthy through strong guardrails, safety mechanisms and observability Contribute and partner on core platform capabilities, including agent orchestration and lifecycle management, planning, reasoning and tool use frameworks, and memory, context and state management ...

Enterprise Account Executive (UK)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

together. LangChain is a place where your contributions can shape how this technology shows up in the real world. Today, our platform includes LangSmith (Observability, Evaluation, Deployment, Fleet, and Sandboxes), our open source frameworks (LangChain, LangGraph, and Deep Agents), and the newly launched LangSmith Engine for autonomous agent improvement. ...

Senior Engineering Manager, Global Bank

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

defining scope, aligning with product leadership, and driving delivery across squads and tribes Establish and uplift tribe-wide engineering practices across areas such as observability, incident response, security, or AI workflows, setting standards that go beyond a single squad Act as a senior escalation point for production incidents and complex ...

Principal Product Manager, Data Platform

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

access via self‐service UIs and APIs. Build and evolve a data trust and certification framework, enabling users to assess dataset quality, ownership, observability, and SLAs with confidence. Embed AI‐driven discovery features such as semantic search, natural language query, and recommendations to improve data discoverability and reduce time ...

Architect/Staff Hardware Integration Engineer London, UK

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

shipped, across multiple generations in large and fast-moving organizations. Track record driving technical outcomes in organisations with high reliability expectations, including robust observability, incident management, and close collaboration with hardware and silicon teams on field issues. Outstanding technical communicator. You can articulate architectural decisions and their consequences clearly ...

Regional Vice President

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What is The Role: Elastic, the Search AI company, is looking for a high-energy Regional Vice ...

Observability Engineer

Hiring Organisation: Experis
Location: Telford, Shropshire, United Kingdom
Employment Type: Contract
Contract Rate: £500 - £590/day

Observability Engineer Rate: £598 Clearance Required: SC Eligible Duration: 6 months Location: Telford - 2 days min per month IR35 Status: Inside Role Description: As a Dynatrace/Observability Engineer, you will be responsible for designing, implementing, and supporting monitoring solutions across a range of technologies and platforms, ensuring service stability … insight, and proactive incident management. Key Responsibilities: Translate high-level monitoring and non-functional requirements (NFRs) into actionable configurations in Dynatrace. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Collaborate with architects and project teams to integrate monitoring into solution ...

Senior AI Platform Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

developer tooling that enable self‐service AI development across engineering teams. Design secure, scalable deployment pipelines for AI models and applications. Build AI observability capabilities including monitoring, tracing, evaluation, cost optimisation, and production quality measurement. Collaborate closely with AI Engineers, Backend Engineers and Engineering Leadership to define platform architecture … containerised deployments. Experience with solutions such as AWS Bedrock and AgentCore. Understand how to deploy, monitor, and operate AI services in production. AI Operations & Observability Experience implementing monitoring, tracing, evaluation, and cost optimisation for AI systems. Experience with observability solutions such as Arize Phoenix, Langfuse, or Langsmith. Understand the operational ...

Principal Platform Engineer

Hiring Organisation: SF Partners Admin
Location: Bristol, Avon, South West, United Kingdom
Employment Type: Permanent, Work From Home

capabilities. Design and operate production-grade Kubernetes platforms, including EKS, AKS or OpenShift. Define engineering standards, golden paths, reusable modules and platform patterns. Build observability strategies using Prometheus, Grafana, OpenTelemetry and modern APM tooling. Improve reliability through SLOs, incident reviews and Site Reliability Engineering (SRE) practises. Embed DevSecOps, supply-chain … Infrastructure as Code (IaC). CI/CD automation. GitOps tools such as ArgoCD or Flux. Internal Developer Platforms or self-service engineering. Observability tools including Prometheus, Grafana, OpenTelemetry, ELK, Datadog, Dynatrace or New Relic. DevSecOps and supply-chain security. SRE practises, SLOs, SLIs and incident management. Platform governance, cloud ...

Data Engineer (All Levels, Analytics & Platform) - UK Wide

Hiring Organisation: describe.me
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £40,000 - £120,000 per annum

science and ML all stand on. You'll work across the full data engineering lifecycle—from ingestion and modelling through to transformation, orchestration, quality, observability and platform operation. The role suits someone who pairs strong software engineering discipline with genuine interest in data modelling and a pragmatic view of when … equivalent) and the workflows around it Manage cloud data warehouses and lakehouses (Snowflake, BigQuery, Redshift, Synapse, Databricks) Implement data quality, testing, monitoring and observability across pipelines and models Build streaming pipelines where the use case warrants it (Kafka, Kinesis, Pub/Sub, Flink) Partner with analysts, scientists, BI developers ...

Senior DBA

Hiring Organisation: Morson Edge
Location: Manchester, North West, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £75,000

deployment and ongoing maintenance Support Infrastructure as Code and configuration management using tools such as Ansible and Terraform Collaborate with engineering teams to improve observability, resilience and operational efficiency Provide technical guidance and mentorship to junior team members Participate in incident management, root cause analysis and continuous improvement activities Contribute … Code tools, including Ansible or Terraform Strong troubleshooting and problem-solving skills across database and system layers Familiarity with CI/CD pipelines and observability tooling Desirables Experience with cloud-managed database services Knowledge of schema migration tools Understanding of disaster recovery, retention and data protection strategies Experience with monitoring ...

AI Platform/ DevOps Engineer

Hiring Organisation: The Portfolio Group
Location: City of London, London, Castle Baynard, United Kingdom
Employment Type: Permanent
Salary: £70000 - £100000/annum + Benefits

Bedrock Knowledge Bases) and embedding pipelines Build and maintain CI/CD pipelines for inference services, retrievers, ingestion workflows, and RAG components Implement observability across AI workloads using CloudWatch, MLflow, and OpenTelemetry - covering latency, throughput, cost, and system health Apply secure-by-design principles including IAM, encryption, network controls … Terraform experience for infrastructure-as-code, provisioning and managing cloud infrastructure at scale Experience operating containerised services, managing CI/CD pipelines, and owning observability and reliability Familiarity with vector databases or search infrastructure (OpenSearch, Algolia) is a strong advantage Python proficiency for scripting, automation, and deploying production services Solid ...

Principal Platform Engineer

Hiring Organisation: SF Partners Admin
Location: Bristol, UK
Employment Type: Full-time

self-service capabilities.Design and operate production-grade Kubernetes platforms, including EKS, AKS or OpenShift.Define engineering standards, golden paths, reusable modules and platform patterns.Build observability strategies using Prometheus, Grafana, OpenTelemetry and modern APM tooling.Improve reliability through SLOs, incident reviews and Site Reliability Engineering (SRE) practises.Embed DevSecOps, supply-chain security and secure … role could suit a technical lead, a hands-on architect, a senior platform engineer ready to progress, or a deep SME in Kubernetes, AWS, observability, cloud platforms or developer enablement.Why this role? Work on genuinely national-scale digital services.Join a strong Platform Engineering community.Solve complex cloud and reliability challenges.Influence engineering ...

ML Infrastructure Lead

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

versioning, reproducibility, experimentation, feature management and release management Own and improve the production environment for machine learning systems, ensuring strong standards for availability, performance, observability and resilience Define and implement monitoring across model and platform layers, including system health, data quality, drift, latency, throughput and cost efficiency Build or optimise … pipelines, infrastructure-as-code and workflow orchestration Experience with tools such as Airflow or similar platform and orchestration technologies Good understanding of model observability, data quality, feature pipelines, lineage and reproducibility Experience designing scalable infrastructure for ML workloads, including training, batch inference and real-time serving Strong appreciation of reliability ...

Devops Engineer

Hiring Organisation: Jackson Hogg Ltd
Location: Newcastle upon Tyne, Tyne & Wear, United Kingdom
Employment Type: Permanent
Salary: £50000 - £60000/annum

We're looking for a DevOps Engineer to join a growing technology team responsible for building, supporting and evolving critical cloud platforms. This role sits at the heart of a modern AWS environment, helping to ...

SRE - Site Reliability Engineer - Observability & Performance

Hiring Organisation: Sanderson Recruitment
Location: Bristol, UK
Employment Type: Full-time

Description SRE - Observability and PerformanceUp to £600 per day outside IR356 month initial contractBristol - Largely remoteI'm currently working with a client who is looking for an SRE to implement and enhance observability across Java applications, middleware and Linux infrastructure using Grafana. The role is focused on monitoring, performance analysis … monitoring, alerting and instrumentation. The environment is currently hosted on traditional infrastructure, with an AWS migration planned, offering the opportunity to develop cloud-ready observability, automation and operational capabilities as the platform evolves.Essential Skills:Strong hands-on experience in DevOps, SRE, Platform Engineering or Systems Engineering environments.Expertise in Grafana, observability ...

Lead Cloud & AI Platform Engineer

Hiring Organisation: Jobleads-UK
Location: Manchester, England, United Kingdom

data orchestration toolsets (e.g., dbt, Apache Airflow), ETL/ELT methodologies, real‐time streaming (e.g., AWS Kinesis, Apache Kafka), Vector databases, and RAG architectures. Observability & FinOps: Experience implementing modern observability tooling (OpenTelemetry) alongside automated cost‐control systems (such as Karpenter, Infracost, OpenCost, or Cloud Custodian). Domain & Sector Experience Regulated ...

Lead Cloud & AI Platform Engineer

Hiring Organisation: Jobleads-UK
Location: Leeds, England, United Kingdom

Lead Cloud & AI Platform Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

DevOps Engineer

Hiring Organisation: Fruition Group
Location: Leeds, Yorkshire, United Kingdom
Employment Type: Contract
Contract Rate: GBP Annual

Contract: Inside IR35 We're seeking an experienced Senior DevOps Engineer to join a small, highly skilled engineering team delivering a large-scale enterprise observability platform as they move away from Splunk This is an opportunity to work on a critical cloud platform supporting the migratio click apply for full ...

Principal Data Engineer – Crypto Market Data Platform

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

REST and streaming, and own core infrastructure in a 24/7 financial environment. You’ll mentor senior engineers, drive architectural decisions, and advance observability, tooling, and testing. If you can navigate ambiguity with pragmatic choices and communicate effectively with #J-18808-Ljbffr ...

Head of Product Engineering — SaaS & Data Platforms

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

senior engineering leader to own delivery across multiple squads focused on workforce intelligence and related domains. You will set standards for architecture, testing, and observability, partnering with product, design, and data science to turn customer problems into reliable, scalable solutions. You will drive the architectural roadmap for the SaaS platform ...

Lead AIOps Product Manager — Enterprise Reliability

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

reduction, anomaly detection, and automated remediation across IT operations and SRE teams. You will partner with technical and security teams to deliver enterprise-grade observability while defining KPIs, SLOs, and adoption metrics to improve reliability and reduce operational #J-18808-Ljbffr ...