126 to 150 of 263 Observability Jobs in London

Principal Payments Architect

Hiring Organisation
Endava
Location
London, England, United Kingdom
resilient payment architectures capable of supporting high transaction volumes and demanding availability requirements Define and govern non-functional requirements including latency, throughput, availability, recoverability, observability, and security Shape API strategies, integration models, and event-driven or distributed system designs that support extensibility and regulatory compliance Contribute to PCI DSS scoping ...

Director of Software Engineering (Fleet Management)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
with independent trust boundaries. Integration and ensuring consistency with data‐centre inventory management tooling (DCIM), bare‐metal provisioning systems, credential stores and monitoring infrastructure. Observability: structured logging, metrics, distributed tracing and tooling that lets operators troubleshoot effectively. What you’ll lead A team of highly talented software engineers, from ...

Principal Architect - Gen AI

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
frameworks (e.g., TensorFlow, PyTorch) and cloud platforms (AWS, Azure, GCP). Experience with AI infrastructure components including model gateways, orchestration layers, and observability tools. Familiarity with AI governance, data privacy, international regulation and ethical AI frameworks. Excellent leadership, communication, and interpersonal skills. Strong analytical and problem‐solving abilities. Ability ...

Senior Observability Platform Engineer

Hiring Organisation
Finalto
Location
London Area, United Kingdom
support the systems that ensure trades are processed, reconciled, and reported accurately. As the business continues to grow, we are looking for a Senior Observability Platform Engineer based in our London Office to help ClearVision build the next stage of its observability capability. The Team ClearVision … Australia, Dubai and the US. The Role This role is not only about maintaining monitoring tools. It is about designing and implementing a coherent observability platform that makes our systems easier to understand, easier to operate, and faster to diagnose. You will work closely with Platform Engineering and service teams ...

Site Reliability Engineer

Hiring Organisation
Sphere Digital Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£550 - £551 per day
reduce risk and accelerate delivery Implement safe deployment strategies such as canary releases and blue/green deployments Ensure strong rollback and recovery mechanisms Observability & Performance You will be expected to: Build and enhance observability solutions including metrics, logging, and tracing Work with teams to reduce alert fatigue and improve … high-severity incidents and leading remediation efforts Preferred Qualifications Ideally, candidates will also have: Experience with multi-region or multi-cloud architectures Familiarity with observability tools such as Prometheus, Grafana, or Datadog Previous mentoring or technical leadership experience Experience with Infrastructure as Code tools such as Terraform or CloudFormation Exposure ...

Senior SRE Engineer

Hiring Organisation
Prism Digital
Location
City of London, London, United Kingdom
Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services Location: London (Hybrid, typically 3 days onsite) Permanent, Full-time Salary: £80k–£90k + bonus + benefits Visa sponsorship: Not available The Role You’ll join as the first dedicated SRE hire , with responsibility for establishing SRE practices … across a live Azure-based platform and a new strategic platform being brought into service. The role is focused on reliability, observability, incident management, resilience, and automation . You’ll help define how services are measured and operated, introducing practical improvements around SLIs, SLOs, error budgets, monitoring, and service ownership. ...

Observability/Monitoring Service Owner - Cloud

Hiring Organisation
Hays Specialist Recruitment Limited
Location
Uxbridge, Middlesex, England, United Kingdom
Employment Type
Contractor
Contract Rate
Salary negotiable
have an excellent contract job opportunity for Observability/Monitoring Service Owner - Cloud for our leading client. Role overview Own the technical execution of the Observability solutions, integration of monitoring tools, leveraging the AI capabilities in the NOW platform to manage events of client's Transform products and technical platforms. … Waterside (UB7 0GB) (2-3 days per week onsite) Pay - attractive daily rate (inside IR35) Skills Minimum Requirements: Extensive experience (typically 15+ years) in observability and automation technology, tools, service, process with a strong focus on management, effectiveness and architecture. Significant experience in observability and automation architecture and enterprise systems. ...

Senior Product Manager, AI Telemetry & Observability

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
A leading financial markets infrastructure provider is seeking a skilled individual to define and enhance telemetry systems for customer experience optimization. This role requires strong product management skills and familiarity with metrics and event-based ...

Site Reliability Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
City of London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£60,000 - £70,000 per annum
shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Monitoring and Observability Grafana, Prometheus, Datadog Kubernetes (building and managing production clusters) Terraform (IaC provisioning) Python, Bash or Go (scripting, automation) GitHub Actions (CI/CD pipelines) What … They’re Looking For Experience in AWS cloud infrastructure (ideally in a regulated or high-traffic environment) Previous experience working with Monitoring and Observability Tools Hands-on Kubernetes know-how, specifically with EKS. Solid IaC experience with Terraform. Experience with containerisation (Docker, Helm) and CI/CD (GitHub Actions ...

Senior Technical Architect

Hiring Organisation
Cognizant
Location
london, south east england, united kingdom
Hybrid working model – remote and on-site as required Overview Ministry of Justice is seeking an experienced Senior Technical Architect to support Legal Aid Agency on a temporary consultancy engagement. The role provides architectural leadership ...

Lead Fin Ops Engineer

Hiring Organisation
Experis
Location
London, United Kingdom
Employment Type
Permanent
Salary
£80000 - £90000/annum bonus + bens
Microsoft Azure while maintaining agility and fostering innovation. This position is perfect for engineers who are passionate about optimising cloud usage, enhancing cost observability, and championing a Fin Ops culture. Experience in some of the following would be ideal Partner with engineering, finance and product teams to drive cost-efficiency ...

Machine Learning Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£500 - £560 per day
reproducible training workflows, CI/CD for model deployment, batch and real-time model serving, feature consistency, and monitoring * Uphold strong standards around testing, observability, and operational excellence * Contribute to a scaling engineering culture where experimentation and measurable outcomes are central Your Skills and Experience * Strong commercial experience building ...

Product Manager

Hiring Organisation
World Wide Technology
Location
London Area, United Kingdom
successful candidate will require strong stakeholder engagement, vendor coordination with Red Hat, manage the full OpenShift platform lifecycle, including feature prioritization, upgrades, patching, observability, SLAs/SLOs, and incident leadership, please see requirements below . This is a contract Role & Inside IR35 OpenShift Product Manager Contract Duration : 6 months Contract ...

OpenShift Product Manager

Hiring Organisation
WWT EMEA UK LIMITED
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
Up to £620 per day
successful candidate will require strong stakeholder engagement, vendor coordination with Red Hat , manage the full OpenShift platform lifecycle, including feature prioritization, upgrades, patching, observability, SLAs/SLOs, and incident leadership , please see requirements below . This is a contract Role & Inside IR35 OpenShift Product Manager Contract Duration : 6 months Contract ...

Scala Developer (Remote)

Hiring Organisation
Stealth iT Consulting
Location
City of London, London, United Kingdom
Agile environment (Scrum/Kanban). Participate in code reviews, architecture discussions and pair programming. Troubleshoot and resolve production issues; contribute to reliability and observability (logging, metrics, alerts). Help define CI/CD pipelines and deployment processes (e.g., Jenkins/GitHub Actions/Concourse). Produce concise technical documentation ...

Scala Developer (Remote)

Hiring Organisation
Stealth iT Consulting
Location
East London, London, United Kingdom
Agile environment (Scrum/Kanban). Participate in code reviews, architecture discussions and pair programming. Troubleshoot and resolve production issues; contribute to reliability and observability (logging, metrics, alerts). Help define CI/CD pipelines and deployment processes (e.g., Jenkins/GitHub Actions/Concourse). Produce concise technical documentation ...

Scala Developer - Remote Contract - Outside IR35

Hiring Organisation
Stealth iT Consulting
Location
East London, London, United Kingdom
environment (Scrum/Kanban). Participate in code reviews, architecture discussions, and pair programming sessions. Troubleshoot and resolve production issues; contribute to reliability and observability (logging, metrics, alerts). Assist in defining CI/CD pipelines and deployment processes (e.g., Jenkins, GitHub Actions, Concourse). Produce concise technical documentation ...

Scala Developer - Remote Contract - Outside IR35

Hiring Organisation
Stealth iT Consulting
Location
City of London, London, United Kingdom
environment (Scrum/Kanban). Participate in code reviews, architecture discussions, and pair programming sessions. Troubleshoot and resolve production issues; contribute to reliability and observability (logging, metrics, alerts). Assist in defining CI/CD pipelines and deployment processes (e.g., Jenkins, GitHub Actions, Concourse). Produce concise technical documentation ...

Sr Product Manager

Hiring Organisation
Infoplus Technologies UK Ltd
Location
London, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £400 to £500 per day
community. Desirable skills/knowledge/experience: Knowledge of data platforms (e.g. Snowflake, Azure Data Lake). Understanding of monitoring, model performance tracking, and observability best practices ...

Head of SRE Production Support

Hiring Organisation
Huxley Associates
Location
London, United Kingdom
Employment Type
Permanent
Salary
£130000 - £180000/annum
teams and external counterparties DR and BCP design and testing: runbooks, failover playbooks, and RTOs that are tested under realistic conditions, not just documented Observability strategy: monitoring, alerting, and log pipeline design - you define what good looks like and hold teams to it Capacity planning and infrastructure cost management balancing ...

Ruby on Rails Backend Engineer

Hiring Organisation
Rise Technical Recruitment
Location
Fulham, London, United Kingdom
Employment Type
Permanent
Salary
£60000 - £80000/annum Private Healthcare + Holiday + Pensi
safety, and inclusion for millions of users. In this varied role, you will move beyond feature development to focus on the stability, performance, and observability of the core Rails ecosystem. You will act as a trusted backend partner for Product and Platform teams, stabilising third-party integrations, improving background ...

Senior C++ Engineer

Hiring Organisation
Infoplus Technologies UK Ltd
Location
South East London, London, United Kingdom
Employment Type
Permanent
Salary
£90,000
knowledge to successfully translate the requirements into actual software implementation Continuously improve the stability, reliability, and performance of the trading engine Enhance monitoring and observability in collaboration with the Trading Operations team Investigate and resolve production issues such as crashes, unexpected business logic behavior, and performance bottlenecks Prepare for releases ...

Automation Developer

Hiring Organisation
GlobalLogic
Location
City of London, London, United Kingdom
Support teams in authoring and troubleshooting Terraform manifests for new and existing applications/hostnames. · Ensure WAF automation follows best practices for security, reliability, observability, and auditability. · Help document standards, patterns, and runbooks for WAF as code, enabling self-service for product teams. Department/Project Description Our Client ...

Principal Solutions Consultant

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
security protocols (TLS/SSL) and experience with data compliance, governance, and auditability. Proficiency in SQL and comfort with data‐visualisation or observability tools (e.g., Datadog, Kibana). Relevant certifications (CISSP, CCSP, AWS Security) are a plus, though proven practical impact matters most. Understanding of payments, compliance, AML, and cryptocurrency ...

Site Reliability Engineer (Contractor)

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£540 - £550 per day
automation and safety guardrails* Implement safe deployment patterns (canary, blue/green, progressive delivery)* Ensure robust rollback and recovery mechanismsObservability & Performance* Build and evolve observability tooling across metrics, logs, and traces* Reduce alert fatigue and improve signal quality* Diagnose performance bottlenecks across infrastructure and applicationsInfrastructure & Automation* Design and operate cloud … operating CI/CD systems with deployment safety guardrailsPreferred:* Multi-cloud or multi-region resilience experience* Experience with Prometheus, Grafana, Datadog, or similar observability stacks* Prior mentorship or technical leadership experience* Experience with Terraform, CloudFormation, or other IaC tools* Exposure to AI-assisted tooling for incident analysis or operational insightsBENEFITS ...