351 to 375 of 418 Observability Jobs in London

Linux Engineer

Hiring Organisation
ISL Talent
Location
London Area, United Kingdom
workflows Learning about trading infrastructure and financial markets Over time, you'll develop into a specialist Trading Infrastructure Engineer, gaining exposure to performance monitoring, observability, networking, databases, and capital markets technology. What We're Looking For Our client is more interested in curiosity, attitude, and technical foundations than years … your primary operating system Raspberry Pi projects Home labs or home servers Docker projects Bash or Python scripting PostgreSQL Networking fundamentals Monitoring and observability tools What This Role Isn't This isn't a software development role. It isn't an AI, machine learning, or product engineering position. Instead ...

Quant Dev - Pricing & Risk Stack

Hiring Organisation
Lithe Transformation
Location
City of London, London, United Kingdom
intraday becomes realistic, not aspirational. Own it on AWS. Design and run cloud-native services — compute, storage, containers, infra-as-code — with the reliability, observability and cost-awareness of someone who owns production, not someone who throws code over a wall. Engineer for correctness. Strong testing, sensible CI/… compute, storage, containers, IaC) and you understand the trade-offs, not just the buttons. Real software-engineering maturity — system design, testing, CI/CD, observability, performance profiling. You care about correctness and maintainability under pressure. Functional knowledge of commodities (in particular, oil and gas/LNG) — enough understanding of curves ...

Senior Python Backend Engineer Fully Remote, UK

Hiring Organisation
Interact Consulting Limited
Location
South West London, London, United Kingdom
Employment Type
Permanent, Work From Home
APNs/FCM), user notification preferences, audience segmentation, and delivery tracking. Integrate with third-party data providers and external services, ensuring robust failure handling, observability, and system resilience. Design and support secure internal tooling APIs, including role-based access controls, audit trails, change history, and safe administrative workflows. Build … shape technical direction and the expectation to take real ownership of what you build. Scope: backend services, infra, event-driven systems, CI/CD, observability, all built for live-event traffic. Python-first, Postgres, Redis. You'll own your services fully: building them and keeping them running in production. Heavily ...

Platform Principal Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
self-service capabilities. Upskill and Mentor: Transition the in-house engineering team into a high-performing internal platform team throughout the platform build process. Observability: Design and implement enterprise-grade logging, metrics, and tracing for Kubernetes at scale. IaC Leadership: Implement and manage Infrastructure as Code to a senior standard … Terraform/Open Tofu module design. (MUST) Kubernetes Engineering: GitOps (Argo CD/Flux), secrets management, ingress/mesh, and OPA/Gatekeeper. (MUST) Observability: OpenTelemetry (MUST) Tooling: Spacelift, Atlantis, or Terraform Cloud (Desired) Governance: EPAC (Enterprise Policy as Code) (Desired) What You'll Bring To Us Recent, hands ...

Staff Software Engineer

Hiring Organisation
Marks and Spencer
Location
London Area, United Kingdom
Staff Software Engineer, your expertise will help us on this journey, creating solutions for the business that are robust and scalable, with good observability and metrics, following best-in-class engineering practice. What You'll Do Your key accountabilities will include Software Development: Develop, test, and debug software solutions, taking … modernization drive, will be introducing new ones. The sorts of technologies include: Java, Spring, SpringBOOT, Micronaut React, Next.js, Typescript, Angular Azure Cloud, Kubernetes, Dynatrace (observability) SQL Server, MongoDB Ignite, Redis Everyone’s Welcome We are ambitious about the future of retail. We’re disrupting, innovating and leading the industry into ...

Quality Test Engineer III

Hiring Organisation
Elsevier
Location
Greater London, United Kingdom
Employment Type
Full Time
REST and evolving GraphQL APIs. Integrate and maintain automated tests within CI/CD pipelines, supporting testing at multiple levels of the stack. Use observability tooling such as New Relic and logging to support diagnosis, monitoring, and quality analysis. Technically analyse problems and proposed solutions, recognising the right level … Karate, Cucumber with RestAssured. Good exposure to CI/CD pipelines, supporting the integration of testing at various levels of the stack. Exposure to observability tooling such as New Relic or logging. The ability to technically analyse a problem and a solution, recognising the right level of detail and abstraction ...

Principal Machine Learning Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
data engineering teams to implement scalable data lakehouse oriented feature architectures and enterprise‐grade ML governance. Champion engineering standards for model quality, documentation, observability, and platform resilience. Feature Engineering & Data Architecture Architect highly scalable, production‐ready feature pipelines within Lakehouse environments. Set the technical direction for fallback and resilience strategies … including scoring metrics, latency, error analytics, and SLOs. Partner with platform teams to optimise cost, scale, and reliability of inference endpoints. Monitoring, Drift Detection & Observability Define observability standards for feature drift, concept drift, performance degradation, and data integrity. Lead the creation of dashboards, benchmarks, and automated alerting across ...

Splunk Lead Engineer

Hiring Organisation
VIQU IT Recruitment
Location
London, UK
client a leading finance house are looking for a Lead Splunk Engineer to take the lead in the design and implementation of monitoring and observability patterns and standards within the Observability Team. This role will act as a technical authority, ensuring best practices are followed, automation first approach is taken … mentoring the team to build sustainable capability, advocate monitoring and observability best practice to the wider technology domain. Minimum Criteria For this opportunity you will have proven skills in: · Building effective working relationships with others and provide challenge where appropriate · Attention to detail with the ability to craft concise, informational ...

SRE - Contract

Hiring Organisation
17918
Location
London, United Kingdom
contract basis(INSIDE IR35) with strong expertise in Dynatrace implementation . The ideal candidate should have hands-on experience designing and deploying observability solutions across complex enterprise environments, with deep expertise in Dynatrace architecture, integrations, alerting, dashboarding, and troubleshooting distributed systems. Key Requirements 8+ years of IT experience with strong … expertise in cloud and observability solutions. Expert-level experience designing, deploying, and configuring Dynatrace in complex environments. Hands-on experience with Dynatrace integrations, alerting, dashboard creation, synthetic monitoring, and distributed tracing . Strong experience implementing enterprise-scale monitoring and observability solutions. Deep expertise in AWS services including ...

Data Engineer

Hiring Organisation
LMA Recruitment
Location
South West London, London, England, United Kingdom
Employment Type
Contractor
Contract Rate
£300 - £350 per day
candidate will be responsible for building and maintaining scalable data pipelines within Google Cloud Platform (GCP), ensuring high levels of data quality, reliability, and observability across critical business data platforms. Key Responsibilities Build, maintain, and optimise scalable data pipelines within GCP Develop and manage workflows using Cloud Composer and Apache … Airflow Design and support data solutions using BigQuery Implement data quality checks and monitoring frameworks Improve observability and operational performance of data platforms Troubleshoot and resolve pipeline failures and performance issues Work closely with engineering, analytics, and product teams Follow best practices around testing, deployment, and documentation Required Skills & Experience ...

Senior Real-Time Observability & Performance Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Sivara GmbH is seeking an experienced Observability and Performance Engineer to join their global investment banking team in London. This six-month rolling contract offers a day rate of £1000 and allows for hybrid working arrangements. The ideal candidate will have extensive experience with cloud-native and Kubernetes environments … well as a strong background in observability, metrics aggregation, and performance engineering in low-latency, high-frequency settings. #J-18808-Ljbffr ...

Data Engineer

Hiring Organisation
Ashdown Group
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £95,000 per annum
able to work from home 2 days per week.This is a high-impact role focused on improving data quality, reducing incidents, and building scalable observability across a modern enterprise data platform. You’ll help ensure data across the organisation is accurate, reliable, and trusted for critical business decision-making. … platforms.You’ll have excellent SQL and Python skills coupled with experience working in modern cloud-based data environments. Hands-on experience with data observability tools such as Grafana, Monte Carlo, or Acceldata, and data governance/quality platforms like Informatica, Collibra or Microsoft Purview is highly desirable. Experience within ...

Director, Principal Java Engineer (Investment Banking)

Hiring Organisation
Robert Walters
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£140,000 - £170,000 per annum
volumes of financial and transactional data Contribute directly to architecture, system design, and hands-on software development Drive engineering best practices across automation, testing, observability, and performance Build resilient, production-grade systems with a strong focus on reliability and scalability Work across the full software development lifecycle from design through … scalability, and high-availability systems Experience building automated, production-grade platforms with minimal manual intervention Familiarity with cloud-native technologies, CI/CD, and observability tooling Strong engineering mindset with a hands-on approach to development Interest in modern engineering tooling, including AI-assisted development workflows Robert Walters Operations Limited ...

Senior Engineering Manager, Developer Experience

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
About the team The Developer Experience team owns the internal platform that every the company engineer touches daily: CI/CD pipelines, observability tooling, our developer portal, and an emerging AI platform. It's a high-visibility role: the work you lead directly shapes the productivity of hundreds of engineers … looks like at the company as we scale. What you'll do Lead and develop a growing team of 5+ highly motivated engineers across observability, CI/CD, developer portal (Backstage), and FinOps tooling — setting clear priorities and establishing strong ways of working. Own and evolve the technical roadmap across ...

Platform Operations Director

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
business continuity across the group. Internal IT & Systems Manages internal IT and business systems administration (M365, NetSuite, SuccessFactors, SharePoint) —infrastructure, integrations, and IAM. Ensures observability and SRE capability is fit for purpose across cloud, hosted, and end-user environments. Vendor & Cost Management Drives cloud and vendor cost discipline — manages …/CD infrastructure requirements. Head of Infrastructure & Cloud — Direct report. Hosting strategy, cloud platform, and FinOps execution. Head of SRE — Direct report. Observability, on-call, and DR/BCP processes. Head of Internal Services — Direct report. Internal IT, business systems, and end-user support. Finance — Direct report. Cloud cost visibility ...

Platform Modernisation Lead

Hiring Organisation
Adecco
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£800 - £900/day
role demands strong leadership and a strategic mindset as you define and embed our cloud operating model, aligning it with change and release management, observability, monitoring, alerting, and support processes. Key Responsibilities: Lead the design and implementation of a hybrid multi-cloud container platform across Azure, AWS, and GCP. Ensure … corporate governance processes, standards, and tooling. Define and embed a robust cloud operating model that aligns with organisational change and release management. Develop observability, monitoring, and alerting strategies to ensure operational excellence. Maintain end-to-end accountability for platform production readiness, ensuring it meets enterprise standards. Support and enable ...

Enterprise Head of AI Engineering (Founding)Sales Development Representative (SDR)

Hiring Organisation
Pyxos
Location
City of London, London, United Kingdom
position You will own the technical direction of our agent surface, the proprietary build environment behind it (our Agentic Studio), and the evaluation, observability, and safety layers that make the system trustworthy enough for regulated enterprise deployment. We build with AI: agentic development tooling is core to how Pyxos ships … then-execute patterns, output validation, tool-use restrictions, policy enforcement. • Production engineering rigor. Strong Python; cloud fluency (AWS, GCP, or Azure); CI/CD, observability, cost attribution. • Engineering leadership at startup pace. You have hired, managed, and grown teams — not just been an individual contributor. Nice to have: regulated-industry ...

Splunk Lead Engineer

Hiring Organisation
VIQU IT
Location
London, Bishopsgate, United Kingdom
Employment Type
Contract
Contract Rate
£550 - £700/day Inside IR35
client a leading finance house are looking for a Lead Splunk Engineer to take the lead in the design and implementation of monitoring and observability patterns and standards within the Observability Team. This role will act as a technical authority, ensuring best practices are followed, automation first approach is taken … mentoring the team to build sustainable capability, advocate monitoring and observability best practice to the wider technology domain. For this opportunity you will have proven skills in: · Attention to detail with the ability to craft concise, informational user documentation · Experience of researching and developing solutions that expand, modernise or improve ...

AI Engineer

Hiring Organisation
Hyre AI Limited
Location
Paddington, Warrington, United Kingdom
Employment Type
Permanent
Salary
GBP 60,000 - 80,000 Annual
tool-calling patterns Extend the MCP server with new tools and capabilities Enforce structured outputs and validation across LLM boundaries 2. LLM Quality, Evals & Observability Build the layer that lets the team ship LLM features with confidence. You will: Design and grow the eval platform - golden datasets, regression suites … judge Integrate observability and tracing across providers and prompt versions Track cost, latency, and quality per prompt, model, and client Build guardrails for prompt injection, PII, and output safety Drive prompt engineering practice - versioning, A/B testing, platform overlays 3. Cloud & Data Infrastructure Own the cloud substrate that runs ...

AI Engineer

Hiring Organisation
Hyre AI Limited
Location
City of Westminster, Greater London, Paddington, United Kingdom
Employment Type
Permanent
Salary
£60000 - £80000/annum Plus Equity
tool-calling patterns Extend the MCP server with new tools and capabilities Enforce structured outputs and validation across LLM boundaries 2. LLM Quality, Evals & Observability Build the layer that lets the team ship LLM features with confidence. You will: Design and grow the eval platform - golden datasets, regression suites … judge Integrate observability and tracing across providers and prompt versions Track cost, latency, and quality per prompt, model, and client Build guardrails for prompt injection, PII, and output safety Drive prompt engineering practice - versioning, A/B testing, platform overlays 3. Cloud & Data Infrastructure Own the cloud substrate that runs ...

DevOps Engineer

Hiring Organisation
Lorien
Location
West Drayton, England, United Kingdom
DevOps and platform delivery across the squad • Building and improving CI/CD pipelines, automation and infrastructure standards • Supporting operational stability through monitoring, observability and proactive maintenance • Working closely with architecture, cyber and platform teams to deliver pragmatic outcomes • Leading infrastructure improvements, environment optimisation and governance alignment • Creating reusable templates … platform engineering, DevOps and infrastructure automation • Experience with CI/CD tooling such as GitHub Actions, Jenkins or similar • Knowledge of monitoring and observability tools (Datadog, CloudWatch etc.) • Experience operating in large-scale, complex enterprise environments • Ability to balance technical delivery with stakeholder management Bonus experience: • Previous experience within consulting ...

Site Reliability Engineer

Hiring Organisation
Autonomai Recruitment
Location
London Area, United Kingdom
intervention. Working extensively in Linux-based environments supporting production infrastructure. Monitoring, troubleshooting, and resolving issues across distributed systems and services. Improving incident response, alerting, observability, and system resilience. Partnering with engineering and infrastructure teams to deliver robust operational support. Contributing to performance tuning and support for low-latency environments. What … communication skills and the ability to work closely with technical teams. Nice to have Experience in latency-sensitive environments. Familiarity with monitoring, logging, and observability tooling. Exposure to cloud, containers, or infrastructure-as-code. Experience working in environments with strong automation and change control. Why the role stands ...

Data Platform Engineer

Hiring Organisation
Cognify Search
Location
London Area, United Kingdom
ideally AWS CDK A solid understanding of data platform design, scalability and performance optimisation Experience building reliable systems with CI/CD, testing and observability practices Knowledge of modern data storage technologies and large-scale data processing Why This Role Stands Out Most Data Engineering roles focus on moving data … critical commercial workflows. The challenges are genuinely engineering-focused: High-volume event processing Real-time decision support Low-latency data systems Platform reliability and observability Revenue-critical infrastructure You'll join a business with a genuine tech-for-good mission, solving complex problems with data while helping shape the future ...

Software Engineer III

Hiring Organisation
Expedia Group
Location
Greater London, England, United Kingdom
lines of business, ensuring scalability and preventing regression to existing services Test, debug, and resolve production issues within established SLAs, maintaining system reliability and observability Proactively collaborate with peers across the organization to identify cross-dependencies and engage in shared problem-solving Identify areas of inefficiency in code or system … with on-call responsibilities for tier 1 or business-critical services, including incident response, troubleshooting, and rollback procedures Experience setting up alerts, monitors, and observability tooling for critical production systems Demonstrated ability to architect services end-to-end and build scalable, resilient distributed systems Experience designing and building APIs ...

Senior Data Analyst - Product Reliability

Hiring Organisation
Wise
Location
Greater London, United Kingdom
Employment Type
Full Time
Salary
60000 to 85000 GBP Annually
Grafana, Lightdash, or Superset Ability to build and manage data pipelines that are modularised and scalable, using tools like DBT and Airflow Familiarity with observability/reliability concepts (SLIs, SLOs, incidents) Some experience with Python/data transformation (DBT, etc.) is helpful This is not a data engineering role, depth … KPIs Scaling our infrastructure at Wise: how we make it work Wise's Tech Stack 2025 Measuring meaningful availability/uptime of Wise Why Observability is a must for product engineering teams Grafana Mimir Compaction: From Bottleneck to Savings For everyone, everywhere. We're people building money without borders — without ...