301 to 325 of 599 Observability Jobs in the UK

Front End Developer

Hiring Organisation
hireful
Location
London, United Kingdom
Employment Type
Permanent
Salary
£80000 - £85000/annum £80K - £85K Basic + 10% Bonus + Exte
message processing and SES for transactional email delivery. Experience running workloads on Kubernetes and Amazon EKS, including debugging pod, container and networking issues. Solid observability and diagnostics skills, including log aggregation, metrics analysis and root cause investigation in distributed systems. Familiarity with production support workflows: incident triage, escalation, handover documentation ...

Front End Developer

Hiring Organisation
hireful
Location
Manchester / Work from home UK, Greater Manchester, United Kingdom
Employment Type
Permanent
Salary
£80000 - £85000/annum £80 - £85K Basic + 10% Bonus + Exten
message processing and SES for transactional email delivery. Experience running workloads on Kubernetes and Amazon EKS, including debugging pod, container and networking issues. Solid observability and diagnostics skills, including log aggregation, metrics analysis and root cause investigation in distributed systems. Familiarity with production support workflows: incident triage, escalation, handover documentation ...

Head of AI Systems Engineering

Hiring Organisation
London Export Corporation
Location
London, England, United Kingdom
reliability and operational performance * Build evaluation and feedback loops for continuous system improvement Ship Production Infrastructure * Write and operate production-grade backend systems * Improve observability, monitoring, and debugging workflows * Optimise latency, reliability, and infrastructure efficiency * Help scale systems across multiple products and business environments * Build reusable infrastructure rather than isolated ...

Senior Software Engineer

Hiring Organisation
Keith
Location
London, England, United Kingdom
with a strong backend emphasis Strong command of Python, TypeScript and React/Next.js Experience building and operating scalable, secure systems - you understand infrastructure, observability and deployment pipelines Experience with AI agent orchestration frameworks (LangGraph, OpenAI Agent SDK, or similar) Strong architectural judgement - you'll be making foundational decisions about ...

Data Science Manager

Hiring Organisation
Aristocrat
Location
Greater London, United Kingdom
Employment Type
Full Time
best software engineering practices for internal tools and ML/RL model development, define software architecture standards, implement code review practices, auto-tests, improve observability, reproducibility and monitoring of ML/RL solutions. Infrastructure Ownership: Own the development of analytical frameworks, including A/B testing (using Bayesian Inference ...

DevOps Delivery Manager

Hiring Organisation
Mastek
Location
Leeds, England, United Kingdom
tooling (e.g., ServiceNow or Jira Service Management). Working knowledge of modern engineering practices (CI/CD, IaC, blue/green or canary deploys, observability). ...

Enterprise Hybrid Platform Architect (Advisory) – Manager – National Security

Hiring Organisation
KPMG UK
Location
England, United Kingdom
landing zone specific” solutions. You must understand the importance of both enterprise MI, for long term decision making, and (near) real time observability for operations. You should have an appreciation of different operating models associated with both legacy and cloud environments, and be able to contribute to op model design ...

Head of Integration, Data & GenAI Engineering

Hiring Organisation
AJ Bell
Location
Salford, Lancashire, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
business value, with appropriate guardrails around security, risk and responsible use. Ensuring engineering solutions are secure, scalable, resilient and supportable, with appropriate governance, standards, observability and controls built in from the outset. Working closely with product-aligned engineering teams, infrastructure, information security, architecture and business stakeholders to deliver joined ...

System Performance Engineer

Hiring Organisation
Visa
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
/JVM tuning or diagnostic experience. Basic Kubernetes setup or cluster experimentation experience. Understanding of networking fundamentals. Experience with opensource tools used for diagnostics, observability, or system performance. Basic programming ability (any language) to support automation or small tooling improvements. Exposure to distributed systems concepts or cloud environments ...

Lead SRE

Hiring Organisation
Pulse Recruit
Location
London Area, United Kingdom
define how reliability is embedded across the wider platform as it continues to scale. Key Responsibilities Leading the design and evolution of monitoring and observability systems Defining and driving SLOs, SLIs and error budgets across teams Owning incident management processes, post-mortems and continuous improvement Partnering with engineering teams … best practices across the organisation Tech Environment GCP and AWS Kubernetes and containerised workloads Terraform and Infrastructure as Code Prometheus, Grafana, Datadog and modern observability tooling CI/CD pipelines and automation tooling Python, Go or similar scripting languages Distributed systems at scale About You Strong background in SRE, DevOps ...

AWS DevOps Engineer - Cloud Native

Hiring Organisation
83zero Ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
£80000 - £90000/annum 5% Bonus, Pension 6% , PH
pipelines using tools such as GitLab CI, Jenkins, or ArgoCD Deploying and managing containerised workloads with Docker and Kubernetes Implementing monitoring, logging, and observability solutions using Prometheus, Grafana, ELK, and CloudWatch Improving platform scalability, automation, resilience, and security Working within Agile delivery teams across complex transformation programmes Supporting DevSecOps … pipeline engineering Docker and Kubernetes Scripting and automation using Python, Bash, or PowerShell AWS networking including VPCs, subnets, and security groups Monitoring and observability tooling Troubleshooting and optimising cloud infrastructure Working within secure or regulated environments What's on Offer Exposure to enterprise-scale cloud transformation programmes Access to industry ...

Generative AI Engineer

Hiring Organisation
Immersum
Location
City of London, London, United Kingdom
event-driven architectures using Kafka , enabling real-time data processing, system decoupling, and auditability Ensure high performance, reliability, and scalability across distributed systems, including observability, monitoring, and production readiness Partner with product, operations, and investment stakeholders to refine requirements and deliver iterative solutions in Agile environments Core requirements 8+ years … Working with OpenAI or Anthropic APIs Using vector databases and embedding-based search systems Applying prompt engineering techniques effectively Building AI evaluation, monitoring, and observability frameworks Understanding ML fundamentals where relevant (embeddings, fine-tuning, context engineering) Additional note The business is also interested in speaking with UK-based engineers ...

Senior Solutions Architect

Hiring Organisation
Code Wizards Group
Location
Theale, Berkshire, UK
scaling strategies Integrate GameLift with AWS services (Lambda, DynamoDB, API Gateway, etc.) Optimise cost, performance, and multi-region deployments Implement monitoring, logging, and observability solutions Qualifications AWS Certified Solutions Architect – Professional (required) Additional AWS certifications (e.g., DevOps, Security) desirable SKILLS AND EXPERIENCE Proven experience in solutions architecture or senior technical … Code: Terraform, CloudFormation, AWS CDK Game Development: Unreal/Unity basics, backend integration patterns Multiplayer Systems: Backend architecture, session management, real-time systems Observability: CloudWatch, Prometheus, Grafana, or similar Soft Skills Strong communication and presentation skills Ability to engage both technical and non-technical stakeholders Strategic thinking and problem-solving ...

Senior Solutions Architect

Hiring Organisation
Code Wizards Group
Location
Theale, England, United Kingdom
scaling strategies Integrate GameLift with AWS services (Lambda, DynamoDB, API Gateway, etc.) Optimise cost, performance, and multi-region deployments Implement monitoring, logging, and observability solutions Qualifications AWS Certified Solutions Architect – Professional (required) Additional AWS certifications (e.g., DevOps, Security) desirable SKILLS AND EXPERIENCE Proven experience in solutions architecture or senior technical … Code: Terraform, CloudFormation, AWS CDK Game Development: Unreal/Unity basics, backend integration patterns Multiplayer Systems: Backend architecture, session management, real-time systems Observability: CloudWatch, Prometheus, Grafana, or similar Soft Skills Strong communication and presentation skills Ability to engage both technical and non-technical stakeholders Strategic thinking and problem-solving ...

PHP Engineer

Hiring Organisation
Moorepay
Location
Manchester, Lancashire, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
documentation for services, features, and reusable components. Cloud-Native Engineering & DevOps Practices: Deploy and maintain services using CI/CD pipelines. Instrument code for observability, logging, and performance insights. Participate in incident resolution and root-cause analysis for issues within the squad’s domain. Follow best practices for cloud development … Agents. Experience with cloud services such as AWS, Azure, or serverless platforms. Interest in distributed systems, event-driven architectures, or DDD concepts. Familiarity with observability tooling and debugging complex systems. Skills & Experience Proven experience as a software engineer in a modern development environment, including agentic approaches. Strong experience with ...

MLOps Architect - AWS

Hiring Organisation
Quantiphi
Location
United Kingdom
based systems. Serve as a technical authority across multiple internal and customer projects, contributing architectural patterns, best practices, and reusable frameworks. Enable observability, monitoring, drift detection, lineage tracking, and auditability across ML/LLM systems. Define and implement standards for model deployment, monitoring, governance, and automation to ensure production-grade … code (Terraform, Helm, CDK). Hands-on understanding of model drift detection, A/B testing, canary rollouts, and blue-green deployments. Familiarity with Observability stacks (Prometheus, Grafana, CloudWatch, OpenTelemetry). SQL and data transformation experience using Snowflake, Databricks, Spark. Ability to translate business goals into scalable AI/ ...

Platform Engineer (Cloud)

Hiring Organisation
Paragon Alpha - Hedge Fund Talent Business
Location
City of London, London, United Kingdom
this role, you would be responsible for designing, developing and managing platform APIs to automate cloud workflows, as well as contributing to platform observability including monitoring, logging and tracing. The role involves collaboration with teams across the firm, primarily including Cloud Engineering and Security. Stack: Python/Go, AWS, Kubernetes ...

SRE Lead (Banking/Financial)

Hiring Organisation
Ascendion
Location
City of London, London, United Kingdom
production systems. Essential Skills & Experience: Proven experience as an SRE Lead or Senior SRE in large-scale, high-availability production environments. Strong experience with observability and monitoring tools such as Datadog, Grafana, Prometheus, PagerDuty, or similar. Experience managing incident response, on-call processes, and post-incident reviews. Strong understanding ...

Software Engineer

Hiring Organisation
Acceler8 Talent
Location
London Area, United Kingdom
Scale and optimise multi-cloud GPU clusters Build tooling for scheduling, remediation, and node health Debug GPU/NCCL performance at cluster scale Improve observability, storage, and infrastructure reliability 🔧 What They’re Looking For Strong systems engineering background Deep Kubernetes + GPU infrastructure experience Strong coding ability Experience with NCCL ...

GCP Data Engineer - Dataflow

Hiring Organisation
Norton Blake
Location
United Kingdom
frameworks using batch and streaming approaches • Working with structured and semi-structured data • Collaborating with Data Modelling & Analytics teams • Driving data reliability, monitoring, and observability • Automating deployments and workflows • Contributing to tooling and framework decisions What we’re looking for: • Strong ETL/ELT pipeline experience • Proven GCP data services ...

Senior Quant C++ Engineer

Hiring Organisation
Harrington Starr
Location
United Kingdom
Translating research models into production systems Profiling and performance optimisation across critical paths Multithreading, concurrency and lock-free programming Monitoring live systems and improving observability Working closely with core trading and infrastructure teams on performance improvements They are looking for someone with: Strong modern C++ experience (C++17+) Background in performance ...

Technical Lead

Hiring Organisation
Solvex Solutions
Location
United Kingdom, UK
ensuring reliable event delivery to platform consumers. • Ensuring the integration layer includes appropriate resilience and operational capabilities, including retry strategies, dead-letter handling, observability and replay support. • Contributing to the design and implementation of the legacy integration gateway used for platform-to-legacy communication during the coexistence phase. • Working closely ...

Junior Trading Infrastructure Engineer: £60k + Market leading Bonus Structure

Hiring Organisation
Hunter Bond
Location
City of London, London, United Kingdom
technology infrastructure, including: computational hardware and storage, operating systems and virtualization technology, real-time data streaming platforms, CI/CD infrastructure, load balancers, observability tools, container orchestration platform Requirements: Strong experience with Linux Ability to program or script in one or more of: Python, Bash, PowerShell Experience with ...

Delivery Lead

Hiring Organisation
Pyramid Consulting, Inc
Location
London, UK
delivery metrics, quality KPIs, and continuous improvement practices. Own delivery risk management, escalations, and resolution across vendors and internal teams. Drive automation, monitoring, and observability across data pipelines. Ensure adherence to Agile best practices and engineering standards. Mentor senior engineers and foster a high‐performance delivery culture. Bring 15+ years ...

Data Engineer

Hiring Organisation
Intellect Group
Location
City of London, London, United Kingdom
experience with Polars, Parquet and/or Arrow/PyArrow Experience owning pipelines in production end-to-end A strong engineering mindset around reliability, observability and delivery Ability to work independently and help drive things forward Nice to have: Time-series, high-volume or event-driven data experience Orchestration exposure ...