Observability Jobs in London

76 to 100 of 432 Observability Jobs in London

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

West London, UK
Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Posted:

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

Central London, UK
Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Posted:

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

City of London, London, United Kingdom
Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Posted:

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

East London, London, United Kingdom
Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Posted:

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

London Area, United Kingdom
Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Posted:

Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

Central London / West End, London, United Kingdom
Hybrid/Remote Options
Owen Thomas | Pending B Corp™
with CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Familiarity with distributed computing frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed). Strong understanding of monitoring, logging, and observability for large-scale ML systems. Experience in cost optimisation for compute/GPU workloads. Excellent people leadership and communication skills, able to influence technical and non-technical stakeholders. Comfortable working More ❯
Posted:

Senior AI Engineer

London, South East, England, United Kingdom
Chambers and Partners
/Azure Functions, API Management). Implement CI/CD (GitHub Actions/Azure DevOps), Infrastructure as Code (Bicep/Terraform), secrets via Azure Key Vault, private networking. Add observability: tracing/telemetry (OpenTelemetry, LangSmith), metrics, logs, cost and token usage monitoring, alerts. Apply evaluation & QA: regression suites, offline eval sets/golden data, RAG evals (faithfulness, answer relevance, citation More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Azure DevOps Engineer x 2 - Remote - New Roles!

West London, UK
Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
Posted:

Azure DevOps Engineer x 2 - Remote - New Roles!

Central London, UK
Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
Posted:

Azure DevOps Engineer x 2 - Remote - New Roles!

East London, London, United Kingdom
Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
Posted:

Azure DevOps Engineer x 2 - Remote - New Roles!

City of London, London, United Kingdom
Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
Posted:

Azure DevOps Engineer x 2 - Remote - New Roles!

Central London / West End, London, United Kingdom
Hybrid/Remote Options
RedRock Resourcing
in production (AKS): cluster operations, node pools, networking (CNI), RBAC and workload identity. Experience with GitOps, and container build pipelines (ACR, OPA policies, image scanning). Working knowledge of observability tooling (Azure Monitor, Log Analytics, Application Insights, Datadog/Grafana) and alerting/response workflows. Good understanding of the Microsoft Cloud Adoption Framework, Azure Landing Zones and the Well-Architected More ❯
Posted:

Site Reliability Engineer

London Area, United Kingdom
Searchability NS&D
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
Posted:

Site Reliability Engineer

City of London, London, United Kingdom
Searchability NS&D
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
Posted:

Site Reliability Engineer SRE - eDV Cleared

London, South East, England, United Kingdom
Searchability NS&D
pipelines and automation tools. Ensure system reliability, availability, and performance through proactive monitoring. Collaborate with developers, platform engineers, and security teams to optimise service delivery. Drive operational excellence through observability and robust incident management practices. Required Skills & Experience: Active eDV Clearance (essential). Strong background in Linux or Cloud-native environments. Hands-on experience with tools such as Terraform , Ansible More ❯
Employment Type: Full-Time
Salary: £50,000 - £80,000 per annum
Posted:

Senior Software Engineer | Python | Fully Remote

West London, UK
Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
Posted:

Senior Software Engineer | Python | Fully Remote

Central London, UK
Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
Posted:

Senior Software Engineer | Python | Fully Remote

East London, London, United Kingdom
Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
Posted:

Senior Software Engineer | Python | Fully Remote

City of London, London, United Kingdom
Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
Posted:

Senior Software Engineer | Python | Fully Remote

Central London / West End, London, United Kingdom
Hybrid/Remote Options
Wilson Brown
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling You’ll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. You More ❯
Posted:

Platform Engineer

City of London, London, United Kingdom
Burns Sheehan
security best practices Building and maintaining CI/CD templates to enable rapid, reliable deployments Developing a developer portal (e.g. Backstage) and internal tooling to support engineering productivity Supporting observability initiatives and contributing to reliability metrics Working on internal open-source projects that enhance the developer experience What They’re Looking For: Strong experience with AWS (ECS, EC2, Lambda, VPC More ❯
Posted:

Platform Engineer

London Area, United Kingdom
Burns Sheehan
security best practices Building and maintaining CI/CD templates to enable rapid, reliable deployments Developing a developer portal (e.g. Backstage) and internal tooling to support engineering productivity Supporting observability initiatives and contributing to reliability metrics Working on internal open-source projects that enhance the developer experience What They’re Looking For: Strong experience with AWS (ECS, EC2, Lambda, VPC More ❯
Posted:

Platform Tech Lead

City of London, London, United Kingdom
Prism Digital
Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯
Posted:

Technical Lead

City of London, London, United Kingdom
Prism Digital
Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯
Posted:

Technical Lead

London Area, United Kingdom
Prism Digital
Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯
Posted:
Observability
London
10th Percentile
£62,500
25th Percentile
£73,750
Median
£90,000
75th Percentile
£120,000
90th Percentile
£157,500