76 to 100 of 131 Observability Jobs in London

AI Platform Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £130,000 per annum
environment Work closely with product and design to ensure AI features are user focused and production ready Help establish best practice around observability, governance and responsible AI development Your Skills and Experience You will be a product minded engineer who enjoys building platforms from the ground up. Strong commercial experience ...

Backend Software Engineer - Python

Hiring Organisation
Talent Locker
Location
London, United Kingdom
Employment Type
Permanent
system design Collaborate with data scientists and ML engineers to deploy models into production Mentor engineers and contribute to engineering best practices Improve observability, monitoring, and incident response processes Write maintainable, well-tested code and contribute to code reviews Requirements 5+ years of experience building and operating backend systems ...

Software Engineer AI

Hiring Organisation
Aperia Search
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£110,000 - £130,000 per annum
platform design and engineering best practices Evaluate emerging AI technologies and determine where they can create competitive advantage Help establish standards around security, testing, observability and deployment of AI systems Your Experience Strong software engineering foundations Experience building and deploying AI products into production environments Python and modern backend engineering ...

Cribl Data Analytics Engineer

Hiring Organisation
International Military Ministries
Location
City of London, London, United Kingdom
Employment Type
Contract
supporting a leading financial services organisation in London seeking an experienced Cribl Data Analytics Engineer to join a large-scale Cyber Security and Observability programme. The successful candidate will be responsible for designing, implementing, and optimising data pipelines using Cribl technologies, ensuring the efficient collection, transformation, routing, and analysis … pipelines across enterprise environments. Configure and support Cribl Stream , including data collection, transformation, filtering, enrichment, masking, and routing. Optimise telemetry ingestion into SIEM and observability platforms. Implement data reduction strategies to improve platform efficiency and reduce licensing costs. Develop and maintain data parsing, normalisation, and enrichment processes. Support integration with ...

Integration Developer FTC

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
Build connectors, event-processing services, and data pipelines Design scalable integration patterns, schemas, and event flows Develop CDC pipelines and resilient messaging solutions Improve observability through logging, metrics, and tracing Deploy containerised services using Docker and Kubernetes Contribute to architecture, code reviews, and engineering standards Collaborate with developers, data engineers … design Agile development experience Strong communication and collaboration skills Desirable Skills Go and/or Python CDC pipeline development Azure cloud experience Observability tooling (Prometheus, Grafana, OpenTelemetry) Experience within regulated environments What's on Offer Hybrid working - 2 days per week in London Salary up to £60,900 Generous pension ...

Cloud Native DevOps Engineer

Hiring Organisation
Anson Mccade
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
role will suit engineers with strong AWS infrastructure and cloud-native engineering experience who are comfortable working across platform automation, Kubernetes, CI/CD, observability, and operational support. The Role You will be responsible for designing, building, and supporting secure AWS cloud environments while helping improve automation, reliability, and deployment … Infrastructure as Code using Terraform or CloudFormation Developing and supporting CI/CD pipelines Supporting Kubernetes and container-based platforms Implementing monitoring, logging, and observability solutions Troubleshooting complex infrastructure and deployment issues Supporting secure cloud operations and Zero Trust principles Working closely with engineering and delivery teams in Agile environments ...

Senior HPC Engineer

Hiring Organisation
Spencer Rose Ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 120,000 Annual
Managing storage, backup, recovery and disaster recovery solutions Monitoring platform performance, throughput and resource utilisation Implementing infrastructure automation and configuration management Building and improving observability through logging, monitoring and alerting Supporting SQL Server environments underpinning critical analytics workloads Collaborating with engineering teams to improve platform reliability, scalability and operational maturity … Experience managing backup, resilience and disaster recovery solutions Exposure to infrastructure automation tools such as Ansible, Puppet or similar Experience implementing monitoring, logging and observability solutions Strong troubleshooting skills across infrastructure, networking and application layers Desirable experience: HPE infrastructure and management tooling Terraform, Bicep or other Infrastructure as Code tooling ...

Cribl Data Analytics Engineer

Hiring Organisation
17918
Location
London, United Kingdom
supporting a leading financial services organisation in London seeking an experienced Cribl Data Analytics Engineer to join a large-scale Cyber Security and Observability programme. The successful candidate will be responsible f... CRWG1_UKTJ ...

Business Analyst London

Hiring Organisation
Randstad Technologies Recruitment
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£450 - £500/day Inside IR35
environment comfortable coordinating across regions and product areas Exposure to the full SDLC and appreciation of front office architecture and design tradeoffs performance resiliency observability controls Technical fluency to engage engineers effectively eg Java awareness and SQL knowhow to support technical walkthroughs data analysis and troubleshooting Skills Mandatory Skills ...

Software Architect

Hiring Organisation
Spectrum It Recruitment Limited
Location
Uxbridge, London, United Kingdom
Employment Type
Permanent
Salary
£90,000
reducing unnecessary duplication Reviewing significant technical changes and guiding solution design Helping define clear service boundaries, ownership models and integration patterns Ensuring security, resilience, observability and scalability are considered from the outset Identifying architectural risks, technical debt and platform constraints Working closely with engineering and product leadership on major technical ...

Software Architect

Hiring Organisation
17918
Location
London, United Kingdom
reducing unnecessary duplication Reviewing significant technical changes and guiding solution design Helping define clear service boundaries, ownership models and integration patterns Ensuring security, resilience, observability and scalability are considered from the outset Identifying architectural risks, technical debt and platform constraints Working closely with engineering and product leadership on major technical ...

Platform Engineer

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
large-scale enterprise environment. An exciting opportunity working on a greenfield Kubernetes platform built using modern engineering practices across Azure, GitOps, service mesh, observability and event-driven architecture. The Role You will be responsible for building, operating and improving a shared Kubernetes platform used by application, AI and integration engineering … teams. Hands-on role covering infrastructure as code, Kubernetes operations, CI/CD, networking, observability and platform reliability. Working closely with architects and engineering teams shaping the future of the platform while helping maintain high standards across automation, security, scalability and operational excellence. Key Responsibilities Build and operate Azure Kubernetes ...

AI Platform/ DevOps Engineer

Hiring Organisation
The Portfolio Group
Location
City of London, London, Castle Baynard, United Kingdom
Employment Type
Permanent
Salary
£70000 - £80000/annum + Benefits
Bedrock Knowledge Bases) and embedding pipelines Build and maintain CI/CD pipelines for inference services, retrievers, ingestion workflows, and RAG components Implement observability across AI workloads using CloudWatch, MLflow, and OpenTelemetry - covering latency, throughput, cost, and system health Apply secure-by-design principles including IAM, encryption, network controls … Terraform experience for infrastructure-as-code, provisioning and managing cloud infrastructure at scale Experience operating containerised services, managing CI/CD pipelines, and owning observability and reliability Familiarity with vector databases or search infrastructure (OpenSearch, Algolia) is a strong advantage Python proficiency for scripting, automation, and deploying production services Solid ...

Senior Azure DevOps Engineer

Hiring Organisation
ReVybe IT Recruitment Limited
Location
London, United Kingdom
Employment Type
Permanent
Salary
£85000 - £95000/annum
using PowerShell and scripting best practices Working closely with development teams to improve deployment efficiency, platform reliability, and developer experience Implementing monitoring, logging, and observability solutions to improve platform performance and availability Driving cloud governance, security, and operational best practices across the Azure estate What We're Looking For Proven … with Terraform and Infrastructure as Code Strong Azure DevOps experience, including CI/CD pipeline automation Experience scripting with PowerShell Knowledge of monitoring and observability tools Strong understanding of cloud security, networking, and automation principles Excellent communication skills and a collaborative mindset Why Join? Join a fast-growing fintech with ...

Go Full Stack Developer

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
event-driven services Contribute to CI/CD pipelines and cloud-native deployments Review code and champion engineering best practices Improve application performance, observability and reliability Collaborate within Agile delivery teams across multiple projects Support technical decision-making and continuous improvement Skills & Experience We are looking for candidates with strong … reviews, testing and engineering governance Experience with any of the following would be highly advantageous: Microsoft Azure Python GitOps tooling (Argo CD/Flux) Observability tooling (Prometheus, Grafana, OpenTelemetry) AI/LLM-enabled applications Event-driven architectures and messaging platforms What's on Offer Opportunity to work on cutting-edge ...

AI Engineer

Hiring Organisation
McCabe & Barton
Location
City, London, United Kingdom
Employment Type
Contract
Contract Rate
GBP 800 Daily
ROLE Design and build core AI platform components for a leading buy-side investor. You'll own the LLM gateway, MCP connector layer, observability tooling, and privacy proxy translating business use cases into governed, production-ready AI systems click apply for full job details ...

Site Reliability Engineer

Hiring Organisation
Flint UK Technology Services
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Site Reliability Engineer Provide administration, support, and operational management of the Zabbix monitoring platform, ensuring reliable monitoring, alerting, and observability across enterprise infrastructure and services. Provide Tier 1 support including user access management, alert triage, and incident response. Configure and maintain Zabbix Servers, proxies, templates, hosts, triggers, dashboards, discovery rules ...

Zabbix Administrator

Hiring Organisation
Flint UK Technology Services
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Zabbix Administrator & Site Reliability Engineer Provide administration, support, and operational management of the Zabbix monitoring platform, ensuring reliable monitoring, alerting, and observability across enterprise infrastructure and services. Provide Tier 1 support including user access management, alert triage, and incident response. Configure and maintain Zabbix Servers, proxies, templates, hosts, triggers, dashboards ...

Site Reliability Engineer

Hiring Organisation
Huxley Associates
Location
Bromley, London, United Kingdom
Employment Type
Contract
Contract Rate
£1000/day
Lead role within a banking/payments environment that I thought might be of interest. You'd lead SRE strategy, driving automation, observability, and reliability by design, with a focus on reducing incidents and improving recovery. Looking for someone with 8+ years' experience in SRE, strong resilience engineering background ...

UK Account Manager (DV Cleared)

Hiring Organisation
ENB Recruitment and Training Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£100,000 per annum
complex national security environments and build long term strategic relationships. Experience selling enterprise technology solutions Strong background in either Cybersecurity, Networking, Infrastructure, Service Assurance, Observability or related technologies DV cleared A track record of exceeding targets and growing territories Experience managing complex, high value accounts Strong pipeline generation ...

Data Observability Engineer

Hiring Organisation
Ashdown Group
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£95,000
successful multinational technology business is looking for a Data Observability Engineer to join its growing data team in Central London. This role is hybrid youll be able to work from home 2 days per week. This is a high-impact role focused on improving data quality, reducing incidents, and building … scalable observability across a modern enterprise data platform. Youll help ensure data across the organisation is accurate, reliable, and trusted for critical business decision-making. Youll take ownership of data reliability end-to-end, designing and implementing frameworks that monitor data health, detect anomalies, and enforce standards across complex data ...

Data Observability Engineer

Hiring Organisation
Ashdown Group
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £95,000 per annum
successful multinational technology business is looking for a Data Observability Engineer to join its growing data team in Central London. This role is hybrid – you’ll be able to work from home 2 days per week.This is a high-impact role focused on improving data quality, reducing incidents, and building … scalable observability across a modern enterprise data platform. You’ll help ensure data across the organisation is accurate, reliable, and trusted for critical business decision-making. You’ll take ownership of data reliability end-to-end, designing and implementing frameworks that monitor data health, detect anomalies, and enforce standards across ...

GenAI Python Developer

Hiring Organisation
Sandhata Technologies Limited
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 80,000 Annual
access to LLM capabilities. Develop and optimise Python-based GenAI components including prompt orchestration, output validation, and evaluation tooling. Integrate LLMs with enterprise systems, observability, and security frameworks. Design and maintain CI/CD pipelines aligned to engineering standards (Azure DevOps primarily). Collaborate closely with platform leads, architects … GenAI technologies and Large Language Models. Experience evaluating LLM performance and prompt handling complexities. Solid DevOps mindset with CI/CD expertise and observability best practices (Azure DevOps preferred). Comfortable working in regulated enterprise environments with strict security controls. Experience integrating AI services into real-world apps or workflows. ...

Java Backend Developer

Hiring Organisation
Pro Contract Jobs Ltd
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
improve release reliability. Participate in code reviews, refactoring and documentation to improve maintainability, consistency and engineering standards. Contribute to continuous improvement initiatives across observability, reliability, security, and developer experience. Technical Skills & Experience Strong backend development experience using Java, ideally Java 17+ in production systems. Solid experience with the Spring ecosystem … Agile delivery teams with iterative releases and continuous improvement. Nice to Have Experience with event-driven architectures (e.g., Kafka or similar). Knowledge of observability practices (metrics, logging, tracing) and production monitoring. Experience improving performance and scalability in high-throughput systems. Understanding of secure software development and common security patterns ...

Senior Gen AI Developer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£500 - £700 per day
delivery, including development, CI/CD, deployment, monitoring and incident resolution* Build API-led, cloud-native solutions with strong engineering standards* Implement observability, logging, metrics and alerting for live production workloads* Work closely with product, platform, architecture and business teams to deliver high-impact AI applications* Ensure applications meet …/event-driven patterns* Docker and Kubernetes/EKS experience* Experience operating applications in cloud environments* Strong understanding of security, resiliency, scalability, performance and observability* Experience with monitoring tools such as CloudWatch and/or Datadog* Experience with testing strategy and LLM evaluation Nice to have: * React, Next.js and TypeScript ...