351 to 375 of 567 Observability Jobs in the UK

Azure DevOps Engineer (Kafka)

Hiring Organisation
Digital Waffle
Location
United Kingdom
data movement Support and evolve data platforms (Databricks ideal) Build and maintain data pipelines (batch + streaming/ETL/ELT) Improve platform reliability, observability, and performance Collaborate with engineering teams to improve developer experience Requirements Strong Azure cloud experience Background in Platform Engineering, DevOps, or SRE Strong experience with … Strong understanding of data pipelines and distributed systems Focus on automation, scalability, and reliability Nice to Have Lakehouse or large-scale data platform experience Observability tooling (Datadog, Grafana, Prometheus) SaaS/high-growth product experience Strong developer experience mindset ...

Platform Engineer

Hiring Organisation
Digital Waffle
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£70,000 - £75,000 per annum
data movement Support and evolve data platforms (Databricks ideal) Build and maintain data pipelines (batch + streaming/ETL/ELT) Improve platform reliability, observability, and performance Collaborate with engineering teams to improve developer experience Requirements Strong Azure cloud experience Background in Platform Engineering, DevOps, or SRE Strong experience with … Strong understanding of data pipelines and distributed systems Focus on automation, scalability, and reliability Nice to Have Lakehouse or large-scale data platform experience Observability tooling (Datadog, Grafana, Prometheus) SaaS/high-growth product experience Strong developer experience mindset ...

DevOps Engineer

Hiring Organisation
Coltech
Location
Manchester Area, United Kingdom
pipeline engineering experience using Jenkins, GitLab, GitHub Actions or Azure DevOps Strong Linux administration and Bash/Python scripting skills Experience with monitoring, observability and logging tools such as Prometheus, Grafana, Dynatrace or ELK Experience troubleshooting live production issues across cloud-native environments Strong understanding of ConfigMaps, Secrets, HPA, PDBs …/CD and automation workflows Working closely with engineering teams to improve reliability and scalability Supporting platform upgrades, deployments and infrastructure changes Driving observability, monitoring and operational improvements Helping improve security, resilience and platform performance across cloud-native systems If this sounds relevant, feel free to apply or message directly ...

AI Native Software Engineer

Hiring Organisation
TekWissen UK
Location
London Area, United Kingdom
invocation, and policy‐based routing Build cloud‐native backend services and APIs to support AI‐driven applications and enterprise integrations Implement evaluation, monitoring, and observability frameworks to ensure accuracy, latency, reliability, and system health across AI agent lifecycles Optimize AI and system performance across cost, scalability, and latency dimensions … Frameworks: LangGraph, AutoGen, CrewAI (or similar) Cloud & DevOps Tooling: Docker, Kubernetes, Terraform, Helm, CI/CD pipelines Enterprise Integration: APIs, enterprise platforms, monitoring and observability tools Why You’ll Love This Role Build real, enterprise‐grade AI systems that move beyond experimentation into production Remain deeply technical ...

Kubernetes Linux AIOps Engineer – Elite Quant Hedge Fund

Hiring Organisation
Winston Fox
Location
City of London, London, United Kingdom
Infrastructure DevOps Engineer/SRE with expertise in Kubernetes, Linux, Observability, IaC and AIOps sought by a market-leading Quantitative Hedge Fund to further aide further business growth. Our client is one of the World's Elite Quant Hedge Fund Managers with large-scale, massively Distributed Systems, and ample opportunity … Terraform, C...) Must be able to write high quality Automation/scripts from scratch. Configuration Management Tools (Ansible/Puppet/Kapitan/Terraform....) Observability: Experience within the modern open-source ecosystem (ELK, OpenTelemetry, LGTM stack, Prometheus, Grafana, Loki...) CI/CD and GitLab/GitOps : working with Development teams. ...

Software Engineer (React Native)

Hiring Organisation
Centrica - CHP
Location
Windsor, Berkshire, South East, United Kingdom
Employment Type
Permanent
source control, CI/CD pipelines, and modern engineering practices Utilise Azure DevOps and GitHub for work management, repositories, pipelines, and artifacts Implement comprehensive observability solutions using Datadog for monitoring and performance optimisation Ensure security and code quality through automated scanning and testing processes Leverage Launch Darkly for feature flag … EventBridge, and Step Functions Solid experience with Infrastructure as Code using Terraform Proficiency with Azure DevOps and GitHub for development workflows Experience with observability tools, particularly Datadog Knowledge of security scanning and code quality tools Familiarity with feature flag management tools like Launch Darkly Core Competencies & Technical Skills ...

Senior Java Software Engineer

Hiring Organisation
Synechron
Location
Sheffield, England, United Kingdom
effective usage of copilot Package and deploy services using Docker and Kubernetes Operate and monitor production services using Grafana , Loki , Prometheus, and related observability tooling Manage and query PostgreSQL databases; contribute to schema design and migration scripts Participate in code reviews, incident response Required Skills Core & Advanced Java Strong Java … secrets, liveness/readiness probes, Helm (desirable) Database – PostgreSQL Schema design, indexing, query optimisation JDBC/Spring Data JPA Liquibase or Flyway migrations Monitoring & Observability Grafana : dashboard design, alerting rules Loki : log aggregation, LogQL queries, label strategies Prometheus : metrics scraping, PromQL, alert manager Distributed tracing: Zipkin/Sleuth/OpenTelemetry ...

Principal Java Architect

Hiring Organisation
Jobleads-UK
Location
Nottingham, England, United Kingdom
LSEG (London Stock Exchange Group) is more than a diversified global financial markets infrastructure and data business. We are dedicated, open-access partners with a dedication to excellence in delivering the services our customers expect ...

Software Engineer

Hiring Organisation
Acceler8 Talent
Location
London Area, United Kingdom
training efficiency across 1,000+ GPU clusters Improve utilisation, throughput, and reliability across distributed training infrastructure Build tooling for orchestration, monitoring, scheduling, and observability Work closely with research teams to accelerate large-scale model training 🔧 What They’re Looking For Deep GPU infrastructure/distributed systems experience Strong knowledge ...

Senior Software Engineer (Python)

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £100,000 per annum
cross-service features Make pragmatic architecture and design decisions Own services end-to-end, including performance, reliability, and incident response Set standards for testing, observability, and security Mentor engineers and contribute to strong team practices Collaborate closely with Product, Design, and Data teams What we're looking for: Strong experience ...

AI Engineer

Hiring Organisation
Uneek Global
Location
London Area, United Kingdom
secure LLM integrations at scale. Tech Stack: • Python/FastAPI • LLMs & Agentic AI • MCP Integrations • GraphRAG • Async Architectures • Multi-Agent Systems • Knowledge Graphs • Streaming & Observability Key Experience: • Multi-agent orchestration in production • AI memory systems & write-back design • GraphRAG & retrieval architectures • MCP server integrations • Secure enterprise AI deployments • Agent guardrails ...

AI Engineer

Hiring Organisation
Data Idols
Location
London, United Kingdom
Employment Type
Permanent
Salary
£95000 - £105000/annum
architectures Experience working with APIs, distributed systems, or event-driven architectures Understanding of authentication and security protocols Experience building secure, scalable platforms Knowledge of observability, monitoring, and system reliability Proven experience owning systems end-to-end If you are looking for a new challenge and want to work on cutting ...

Senior Machine Learning Engineer - GenAI, LLM, RAG

Hiring Organisation
Puritas Group
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
cost, and safety signals Integrating LLMs into broader application architectures (APIs, services, orchestration) Working across the full life cycle: data prep - modelling - evaluation - deployment - observability We're looking for people who have: Delivered real GenAI applications into production, not just PoCs Strong Python engineering skills Experience with LangChain, LangSmith, LlamaIndex ...

Agentic Quality Platform Lead

Hiring Organisation
Oliver Bernard
Location
City of London, London, United Kingdom
scaling pipelines Knowledge of AI and how to implement AI tools and processes to reduce Manual workloads Prior hands-on experience across Monitoring and Observability to establish feedback loops and improve platform reliability (eg. Datadog, Prometheus etc) Previous Performance Testing experience, and knowledge on how to integrate this into engineering ...

Machine Learning Engineer

Hiring Organisation
Puritas Group
Location
City of London, London, United Kingdom
hallucinations, cost, and safety signals Integrating LLMs into broader application architectures (APIs, services, orchestration) Working across the full lifecycle: data prep → modelling → evaluation → deployment → observability We’re looking for people who have: Delivered real GenAI applications into production , not just PoCs Strong Python engineering skills Experience with LangChain, LangSmith, LlamaIndex ...

Java Consultant

Hiring Organisation
Stanford Black Limited
Location
London Area, United Kingdom
with portfolio managers and traders to deliver real-time, business-critical technology 🔷 Architect event-driven, distributed systems with strong focus on performance, resilience, and observability 🔷 Drive technical direction across microservices, data streaming, and system design in a fast-moving environment The Role: Join a high-calibre Investment Engineering team embedded ...

Backend Software Engineer - Python

Hiring Organisation
Talent Locker
Location
London, United Kingdom
Employment Type
Permanent
system design Collaborate with data scientists and ML engineers to deploy models into production Mentor engineers and contribute to engineering best practices Improve observability, monitoring, and incident response processes Write maintainable, well-tested code and contribute to code reviews Requirements 5+ years of experience building and operating backend systems ...

AVP Investment Banking Java Full Stack Engineer (Angular/React) - Spring Boot, REST, ELK - PERMANENT

Hiring Organisation
Scope AT Limited
Location
City, Liverpool, United Kingdom
Employment Type
Permanent
Salary
GBP Annual
reliable software. Some other highly valued skills may include: Hands-on full-stack - having UI knowledge and experience - React, Angular. Hands-on logging, observability, reliability - ELK dashboards for monitoring releases and identifying problems proactively Good knowledge of systems architectures and infrastructure with best practices and design patterns Problem-solving skills ...

Event Management Consultant - DV cleared

Hiring Organisation
CBSbutler Holdings Limited trading as CBSbutler
Location
Corsham, Wiltshire, United Kingdom
Employment Type
Contract
Contract Rate
£700 - £750/day
tooling platforms BMC TrueSight, BMC Discovery, and Splunk administration and engineering Agent deployment, configuration, and lifecycle management Product installation, configuration, and customisation Monitoring & Observability Tools SNMP MIB management Experience with tools such as: Zabbix Nagios HP OpenView SolarWinds IBM Tivoli Monitoring IBM Tivoli Netcool Operating Systems Experience across: Windows Server ...

Head of Data & AI Platforms & Engineer with P&C Insurance

Hiring Organisation
SANS Consulting Services, Inc
Location
London Area, United Kingdom
management, and access controls. • Ensure platforms are designed for performance, scalability, reliability, and security. Data Governance, Quality & Compliance • Implement frameworks for data quality, lineage, observability, and metadata management. • Ensure compliance with security, privacy, and regulatory requirements across all data platforms. • Oversee remediation of data quality issues within BAU operations. Change ...

Lead Fullstack Engineer

Hiring Organisation
Zelt
Location
Greater London, England, United Kingdom
motivated by outcomes rather than tasks. Nice to have: Startup or scale-up experience in small, fast-moving teams Familiarity with CI/CD, observability and scaling infrastructure A history of improving performance, code quality or developer experience. Why join Zelt: Work on technically ambitious challenges that will stretch your ...

Operations Team Lead (Production & Reliability)

Hiring Organisation
Complexio
Location
United Kingdom
Looking For Strong experience in SRE, DevOps, Infrastructure, or Production Engineering Prior experience leading technical teams Deep hands-on incident management experience Strong observability and reliability mindset Calm under pressure, clear in communication Systems thinker, fixes root causes, not symptoms How We Think Production is sacred. Clear ownership beats ambiguity. ...

Artificial Intelligence Engineer

Hiring Organisation
Prism Digital
Location
City of London, London, United Kingdom
Next.js Nice to Haves Voice and telephony systems (Twilio, SIP, WebRTC) STT/TTS pipelines (Deepgram, ElevenLabs or similar) Messaging integrations (WhatsApp Business API) Observability tooling (Datadog, Sentry, LangSmith) Previous founding engineer experience Being a founding engineer or an ex-founder from any company backed by Antler or Entrepreneurs First ...

Founding Engineer

Hiring Organisation
Pharosyn
Location
London, England, United Kingdom
system: •⁠ ⁠The development environment, tooling, and workflows that let a small team move extremely fast without breaking production •⁠ ⁠Tight feedback loops between customer usage, observability, evaluation, and execution (tests, CI, code review) •⁠ ⁠Early architectural choices that scale to a $100m+ product with a lean team •⁠ ⁠Treating developer productivity, reliability ...

Software Architect

Hiring Organisation
BBC
Location
Greater London, United Kingdom
Employment Type
Full Time
Salary
72000 to 80000 GBP Annually
offs (e.g. coupling, cohesion, consistency, scalability) Familiarity with evolutionary architecture practices (e.g. fitness functions, incremental change) Experience with modern engineering practices (CI/CD, observability, cloud-native systems) Proven ability to influence without authority across multiple teams Strong facilitation skills - able to guide discussions and surface trade-offs Comfortable operating ...