76 to 100 of 503 Observability Jobs in London

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

Hiring Organisation
Owen Thomas | Pending B Corp™
Location
City of London, London, United Kingdom
Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical ...

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

Hiring Organisation
Owen Thomas | Pending B Corp™
Location
London Area, United Kingdom
Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical ...

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

Hiring Organisation
Owen Thomas | Pending B Corp™
Location
Central London / West End, London, United Kingdom
Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical ...

Senior AI Engineer

Hiring Organisation
Chambers and Partners
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
. Implement CI/CD (GitHub Actions/Azure DevOps), Infrastructure as Code (Bicep/Terraform), secrets via Azure Key Vault, private networking. Add observability: tracing/telemetry (OpenTelemetry, LangSmith), metrics, logs, cost and token usage monitoring, alerts. Apply evaluation & QA: regression suites, offline eval sets/golden data ...

Observability Developer/Engineer -

Hiring Organisation
Morela
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£40,000 - £75,000 per annum
Title: Observability Developer/Engineer Location: Hybrid (UK, with travel as required) Employment Type: Full-time This role is with Morela please respond to for further informaiton Do you want to be part of something special? Morela is proud to represent our exclusive client , a fast-growing start-up transforming … this company is redefining how enterprises monitor, manage, and optimise IT operations. This is your chance to join a team shaping the future of observability and operational intelligence from the ground up. We are seeking a skilled Observability Developer to design, build, and optimise observability solutions that help enterprise clients ...

Observability Developer/Engineer

Hiring Organisation
VIQU IT
Location
London, United Kingdom
Employment Type
Permanent
Salary
£40000 - £75000/annum
Title: Observability Developer/Engineer Location: Hybrid (UK, with travel as required) Employment Type: Full-time This role is with Morela please respond to (url removed) for further informaiton Do you want to be part of something special? Morela is proud to represent our exclusive client , a fast-growing start … this company is redefining how enterprises monitor, manage, and optimise IT operations. This is your chance to join a team shaping the future of observability and operational intelligence from the ground up. We are seeking a skilled Observability Developer to design, build, and optimise observability solutions that help enterprise clients ...

Senior Specialist Engineer (SRE)

Hiring Organisation
UK Health Security Agency
Location
Birmingham, Leeds, Liverpool, London (Canary Wharf), United Kingdom
Employment Type
Permanent
Salary
£41983.00 - £52113.00 a year
identify bottlenecks with an engineering mindset. Ensure systems can handle current and future workloads through automation and capacity planning. Continuously improve services through observability, and identify ways to improve observability practices. Follow SRE principles. Guide and educate stakeholders to adopt implemented principles. Provide technical documentation for engineers. Providing training, where … production incidents, ensuring minimal downtime and quick restoration of services. Perform root cause analysis and postmortems, implementing lessons learned to prevent recurrence. Monitoring, Alerting & Observability Contribute to the design and implementation of effective monitoring and alerting systems using tools and dashboards. Improve observability of services, ensuring issues are identified ...

Observability Engineer (Custom Data Integration)

Hiring Organisation
Ascendion
Location
London, UK
Employment Type
Full-time
Title: Observability Engineer (Custom Data Integration) Location: Remote(UK) Key responsibilities Design and implement observability frameworks: Integrate and manage monitoring, logging, and tracing for cloud-native and on-premises systems. Data integration and ingestion: Build systems to collect and ingest data from various sources, often through APIs, and manage time … design, data modeling, and data pipelines. Expertise in scripting and backend development (e.g., Python, Go, Java). Experience: Proven experience designing and scaling observability stacks for production systems. Hands-on experience with cloud platforms (AWS, Azure). o Experience with containerization (e.g., Kubernetes) is often required. o Familiarity with dashboards ...

Back end Engineer

Hiring Organisation
Lorien
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
Salary negotiable
/or PCF on-prem. Responsibilities Own services end-to-end: design, implement, test, deploy, operate. Data modelling, migrations, performance tuning, resilient integrations. Observability (metrics/logs/traces), SLOs. Nice-to-have Experience with the DWS parent and NatWest platform standards is a strong plus. Experience working within NatWest ...

Site Reliability Engineer- eDV Cleared

Hiring Organisation
Searchability NS&D
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 - £95,000 per annum, Negotiable
Experience as in a Site Reliability Engineering role SITE RELIABILITY ENGINEER ESSENTIAL SKILLS- Reliability, incident response/incident management experience - Experience with Monitoring and Observability tools such as Prometheus, Grafana and OpenSearch- Automation tools (Go, Bash)- Experience with Linux- Cloud infrastructure experience (AWS, Azure, GCP)- DevOps Mindset and ability ...

Managing & Senior Consultant - DevOps & Cloud Architect (SC Eligible)

Hiring Organisation
Stealth iT Consulting
Location
London, UK
Employment Type
Full-time
models. Solid understanding of hybrid/multi-cloud environments (AWS, Azure, GCP), DevOps, CI/CD, SRE, DevSecOps models, DevX, build and deployment pipelines, observability, and ITIL. Proven experience leading/managing/mentoring a team of DevOps/SRE/Platform professionals. Presales/Business Development experience would also ...

Managing & Senior Consultant - DevOps & Cloud Architect (SC Eligible)

Hiring Organisation
Stealth iT Consulting
Location
South London, UK
Employment Type
Full-time
models. Solid understanding of hybrid/multi-cloud environments (AWS, Azure, GCP), DevOps, CI/CD, SRE, DevSecOps models, DevX, build and deployment pipelines, observability, and ITIL. Proven experience leading/managing/mentoring a team of DevOps/SRE/Platform professionals. Presales/Business Development experience would also ...

Artificial Intelligence Engineer

Hiring Organisation
Omnis Partners
Location
London, UK
Employment Type
Full-time
clear and practical terms Teaching frameworks such as LangGraph, LangChain, AutoGen and knowledge graph fundamentals Showing what "production-grade" really looks like - reliability, observability, safety, evaluation, failure handling During non-training weeks, working as a senior IC on internal AI engineering/ML projects What They're Looking ...

Backend Engineer

Hiring Organisation
Calyptus
Location
London, UK
Employment Type
Full-time
best practices and performance optimization. Experience with automated testing and CI/CD pipelines. Go development experience is a plus. Experience with monitoring and observability tools such as Prometheus and Grafana is a plus. Preferred Qualifications Previous experience in DeFi or trading platforms. Experience with Subgraph integration and GraphQL. Understanding ...

Platform Engineer

Hiring Organisation
Ncounter Limited
Location
EC2, Barbican, Greater London, Fox Holes, Wiltshire, United Kingdom
Employment Type
Permanent
Salary
£90000 - £100000/annum plus Bonus & Package
responsible for creating stable, repeatable, and compliant environments that support large scale digital systems. This includes shaping secure-by-design cloud patterns, introducing strong observability, and driving automation to remove manual effort. The work suits someone who is comfortable operating across infrastructure, pipelines, and application delivery while staying close ...

Software Engineer

Hiring Organisation
Digital Waffle
Location
City of London, London, United Kingdom
deliver iteratively Mentor junior engineers and conduct code reviews to maintain quality and consistency Help drive best practices across DevOps, CI/CD, observability, and secure coding Collaborate with other squads to ensure technical alignment across the platform What you’ll need: Strong full stack experience with a backend-first ...

Software Engineer

Hiring Organisation
Digital Waffle
Location
London Area, United Kingdom
deliver iteratively Mentor junior engineers and conduct code reviews to maintain quality and consistency Help drive best practices across DevOps, CI/CD, observability, and secure coding Collaborate with other squads to ensure technical alignment across the platform What you’ll need: Strong full stack experience with a backend-first ...

Technical Lead

Hiring Organisation
Prism Digital
Location
City of London, London, United Kingdom
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking ...

Technical Lead

Hiring Organisation
Prism Digital
Location
London Area, United Kingdom
management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking ...

SRE Expert/Coach

Hiring Organisation
eTeam Workforce Limited
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Daily
incident management. o Guide teams in implementing Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets. o Promote best practices in monitoring, observability, and blameless postmortems. Content Development & Continuous Improvement o Curate and create E-learning modules, assessments, and certification pathways. o Evaluate and iterate training materials based ...

Senior Data Engineer

Hiring Organisation
Troi
Location
London Area, United Kingdom
Lead data modelling, cataloguing, and documentation efforts. Optimise performance across data tools and technologies. Implement automated testing, anomaly detection, and validation frameworks. Support monitoring, observability, and incident-response processes. Ensure strong data governance, including access controls, encryption, and anonymisation. Manage cloud infrastructure through infrastructure-as-code tools. Collaborate with Data ...

Senior Data Engineer

Hiring Organisation
Troi
Location
City of London, London, United Kingdom
Lead data modelling, cataloguing, and documentation efforts. Optimise performance across data tools and technologies. Implement automated testing, anomaly detection, and validation frameworks. Support monitoring, observability, and incident-response processes. Ensure strong data governance, including access controls, encryption, and anonymisation. Manage cloud infrastructure through infrastructure-as-code tools. Collaborate with Data ...

Generative AI Engineer

Hiring Organisation
Anonymous
Location
Greater London, England, United Kingdom
scale You’ll bring experience in Python, vector databases, embeddings, transformers PyTorch or TensorFlow, API deployment Docker, Kubernetes, cloud (AWS/Azure/GCP) Observability, performance tuning, and model lifecycle management Experience working with enterprise security, compliance or regulated data environments is advantageous. If you’re excited by building ...

DevOps Engineer (SC Cleared) | Inside IR35

Hiring Organisation
SR2
Location
London, Chaucer, United Kingdom
Employment Type
Contract
Contract Rate
£500/day
experience, including modules and multi-environment setups Good AWS experience, including IAM, networking and container services Experience supporting services in production environments Familiarity with observability tooling (e.g. CloudWatch, Grafana, Dynatrace, Prometheus) Active SC clearance ...

Platform Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£60,000 - £75,000 per annum
role involves building and improving CI/CD pipelines using tools like Terraform, ArgoCD, and Jenkins to drive automation and efficiency. You'll enhance observability with Prometheus, Splunk, and Honeycomb, ensuring robust monitoring and troubleshooting capabilities. Strong collaboration is essential, as you'll work closely with engineering teams to deliver ...