Observability Jobs in Central London

251 to 275 of 301 Observability Jobs in Central London

Production Engineer - Hedge Fund

City of London, London, United Kingdom
Attribution Search
senior partners who have been there since its launch. The Role: You'll join the fund's global technology team, where you will focus on the resilience, automation and observability of production systems that power a mission-critical quantitative trading platform. The role forms part of a follow-the-sun global support model. Primary Duties: Build and maintain automated tools … core requirement), with additional experience using T-SQL and Bash. Infrastructure & Systems: Exposure to Linux and Windows environments, with working knowledge of Docker containers and AWS cloud services. Monitoring & Observability: Familiarity with DataDog, Grafana, and other internal or custom monitoring solutions. Automation & CI/CD: Experience using Git, TeamCity, and configuration management tools such as Ansible or Terraform. Databases: Hands More ❯
Posted:

Senior Backend Engineer (Python | AI | 3D Environments | £130,000)

City of London, London, United Kingdom
Hybrid / WFH Options
Paradigm Talent
and desktop applications. Design and implement distributed systems for async processing, ML workflows, and asset pipelines. Own authentication, billing, and subscription systems — ensuring reliability and seamless user experience. Drive observability, performance tuning, and deployment automation across the stack. Collaborate closely with product, frontend, and ML teams to deliver features that delight and scale. You should have 5+ years of experience … with cloud-native infrastructure (AWS, GCP, or Azure). Familiarity with auth, billing, or subscription systems . Background in 3D graphics, creative tooling, or ML pipelines . Knowledge of observability tools like Grafana, Prometheus, or OpenTelemetry. This is a rare opportunity to join an early-stage team backed by leading deep-tech investors, building the foundation of a platform that More ❯
Posted:

Production Engineer - Hedge Fund

london (city of london), south east england, united kingdom
Attribution Search
senior partners who have been there since its launch. The Role: You'll join the fund's global technology team, where you will focus on the resilience, automation and observability of production systems that power a mission-critical quantitative trading platform. The role forms part of a follow-the-sun global support model. Primary Duties: Build and maintain automated tools … core requirement), with additional experience using T-SQL and Bash. Infrastructure & Systems: Exposure to Linux and Windows environments, with working knowledge of Docker containers and AWS cloud services. Monitoring & Observability: Familiarity with DataDog, Grafana, and other internal or custom monitoring solutions. Automation & CI/CD: Experience using Git, TeamCity, and configuration management tools such as Ansible or Terraform. Databases: Hands More ❯
Posted:

Senior Backend Engineer (Python | AI | 3D Environments | £130,000)

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Paradigm Talent
and desktop applications. Design and implement distributed systems for async processing, ML workflows, and asset pipelines. Own authentication, billing, and subscription systems — ensuring reliability and seamless user experience. Drive observability, performance tuning, and deployment automation across the stack. Collaborate closely with product, frontend, and ML teams to deliver features that delight and scale. You should have 5+ years of experience … with cloud-native infrastructure (AWS, GCP, or Azure). Familiarity with auth, billing, or subscription systems . Background in 3D graphics, creative tooling, or ML pipelines . Knowledge of observability tools like Grafana, Prometheus, or OpenTelemetry. This is a rare opportunity to join an early-stage team backed by leading deep-tech investors, building the foundation of a platform that More ❯
Posted:

Data Engineer

City of London, London, United Kingdom
Xcede
and expose data across multiple internal platforms Partner with quant stakeholders to translate real-world requirements into high-performance data solutions Expand data platform functionality, improving latency, scalability, and observability as usage grows Own processes around quality assurance, validation, and error monitoring of datasets Explore and introduce new technologies and tooling to keep systems efficient and future-proof Play a … and expose data across multiple internal platforms Partner with quant stakeholders to translate real-world requirements into high-performance data solutions Expand data platform functionality—improving latency, scalability, and observability as usage grows Own processes around quality assurance, validation, and error monitoring of datasets Explore and introduce new technologies and tooling to keep systems efficient and future-proof Play a More ❯
Posted:

Data Engineer

london (city of london), south east england, united kingdom
Xcede
and expose data across multiple internal platforms Partner with quant stakeholders to translate real-world requirements into high-performance data solutions Expand data platform functionality, improving latency, scalability, and observability as usage grows Own processes around quality assurance, validation, and error monitoring of datasets Explore and introduce new technologies and tooling to keep systems efficient and future-proof Play a … and expose data across multiple internal platforms Partner with quant stakeholders to translate real-world requirements into high-performance data solutions Expand data platform functionality—improving latency, scalability, and observability as usage grows Own processes around quality assurance, validation, and error monitoring of datasets Explore and introduce new technologies and tooling to keep systems efficient and future-proof Play a More ❯
Posted:

Senior Software Engineer

City of London, London, United Kingdom
Nando's UK & IRE
of features end-to-end – from discovery to delivery and continuous improvement Leading with technical excellence: shaping architectural decisions, championing TypeScript best practices, and nurturing a strong testing and observability culture 📈 Supporting your teammates through pairing, code reviews, and mentoring – helping everyone grow together 💬 Not sure if you tick every box? 🤔 If this role excites you but you don’t … teamwork, mentoring, and creating a safe, supportive engineering culture 💕 Nice to have: Experience with event-driven architectures (Pub/Sub, queues), form builders, or domain-driven design Familiarity with observability practices like logging, tracing, and monitoring 🔍 Everyone is welcome At Nando’s, everyone is welcome . Inspired by our Southern African heritage, we know and celebrate the richness that diversity More ❯
Posted:

Senior Software Engineer

City of London, London, United Kingdom
Burns Sheehan
Senior Java Engineer – Product Engineering | B2B SaaS | Insurtech Up to £120,000 per annum plus bonus and excellent pension London – 2 days a week Java | Spring Boot | AWS | Kubernetes | Event-Driven Architecture Senior Java Engineer – We have been exclusively engaged More ❯
Posted:

Lead Software Engineer (AI)

City of London, London, United Kingdom
Hybrid / WFH Options
Morson Edge (Technology)
Join our client at the forefront of AI innovation. They’re shaping the digital foundations that will define how humans and AI learn from each other — pioneering technology that is set to transform the way intelligent systems evolve. The Role More ❯
Posted:

Senior TypeScript Back-End Engineer

City of London, London, United Kingdom
Wave Talent
design, build, and support extensible, low-maintenance back-end services. Partner with Product, Design, Operations, and Growth to prioritise customer-facing and internal problems that drive value. Champion security, observability, and reliability best practices. Mentor teammates and help cultivate a healthy, innovative engineering culture. Tech Stack: Core: TypeScript/JavaScript (server-side) Cloud & Infra: AWS (cloud-based architectures), Docker, CI …/CD; IaC such as Terraform or CloudFormation Quality & Ops: Security and observability tooling/best practices Bonus exposure: React on the front end (nice to have) What we’re looking for We value skill and impact over strict year counts. You should have: Strong TypeScript/JavaScript fundamentals and experience building high-traffic server-side web applications. Solid understanding … of cloud-based application architecture (preferably AWS). Hands-on experience with Docker and CI/CD tooling. Practical grasp of security and observability best practices. Clear, collaborative communication and leadership skills. Why join? Work on meaningful problems in a fun, healthy, productive environment. Competitive package: £80k–£100k + Bonus, up to 10% employer pension, 28 days holiday (plus bank More ❯
Posted:

Senior TypeScript Back-End Engineer

london (city of london), south east england, united kingdom
Wave Talent
design, build, and support extensible, low-maintenance back-end services. Partner with Product, Design, Operations, and Growth to prioritise customer-facing and internal problems that drive value. Champion security, observability, and reliability best practices. Mentor teammates and help cultivate a healthy, innovative engineering culture. Tech Stack: Core: TypeScript/JavaScript (server-side) Cloud & Infra: AWS (cloud-based architectures), Docker, CI …/CD; IaC such as Terraform or CloudFormation Quality & Ops: Security and observability tooling/best practices Bonus exposure: React on the front end (nice to have) What we’re looking for We value skill and impact over strict year counts. You should have: Strong TypeScript/JavaScript fundamentals and experience building high-traffic server-side web applications. Solid understanding … of cloud-based application architecture (preferably AWS). Hands-on experience with Docker and CI/CD tooling. Practical grasp of security and observability best practices. Clear, collaborative communication and leadership skills. Why join? Work on meaningful problems in a fun, healthy, productive environment. Competitive package: £80k–£100k + Bonus, up to 10% employer pension, 28 days holiday (plus bank More ❯
Posted:

AWS Cloud Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Advanced Resource Managers
manage and support a customer’s AWS and Data platform To be technical hands on Provide Incident and problem management on the AWS IaaS and PaaS Platform Monitoring and observability of system and platform performance Collaboration with development and build teams on application and platform deployments and changes Involvement in the resolution of Incidents and problems in an efficient and … timely manner Actively monitor an AWS platform and components for technical issues Implement and improve on existing monitoring and observability solution To be involved in the resolution of technical incidents tickets Assist in the root cause analysis of incidents Assist with improving efficiency and processes within the team Examining traces and logs Escalate incidents and problems to the appropriate teams More ❯
Posted:

AWS Cloud Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Advanced Resource Managers
manage and support a customer’s AWS and Data platform To be technical hands on Provide Incident and problem management on the AWS IaaS and PaaS Platform Monitoring and observability of system and platform performance Collaboration with development and build teams on application and platform deployments and changes Involvement in the resolution of Incidents and problems in an efficient and … timely manner Actively monitor an AWS platform and components for technical issues Implement and improve on existing monitoring and observability solution To be involved in the resolution of technical incidents tickets Assist in the root cause analysis of incidents Assist with improving efficiency and processes within the team Examining traces and logs Escalate incidents and problems to the appropriate teams More ❯
Posted:

GenAI Engineer

City of London, London, United Kingdom
Clarity (formerly Anecdote)
up and harden RAG pipelines (indexing, retrieval policies, grounding, guardrails) and agent frameworks. Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐cause analysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors … evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain results clearly, align stakeholders, and write crisp docs. Bonus points DevOps wizardry; GPU/accelerator experience. Multimodal pipelines (text + voice + screenshots). More ❯
Posted:

GenAI Engineer

london (city of london), south east england, united kingdom
Clarity (formerly Anecdote)
up and harden RAG pipelines (indexing, retrieval policies, grounding, guardrails) and agent frameworks. Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐cause analysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors … evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain results clearly, align stakeholders, and write crisp docs. Bonus points DevOps wizardry; GPU/accelerator experience. Multimodal pipelines (text + voice + screenshots). More ❯
Posted:

AppSec Lead

Central London, London, United Kingdom
Hybrid / WFH Options
Halian Technology Limited
A leading fintech company is seeking a Lead AppSec Engineer to join their established team. Youll be instrumental in embedding security into every stage of the software development lifecycleguiding engineers, shaping best practices, and driving secure, scalable solutions across our More ❯
Employment Type: Permanent, Work From Home
Posted:

Staff Site Reliability Engineer - Observability

City of London, London, United Kingdom
Hybrid / WFH Options
Motive Group
Senior/Staff Site Reliability Engineer - Observability | London (Hybrid) If you care deeply about building and operating world-class infrastructure for AI at scale , this one’s worth your time. We’re working with a company that builds the backbone powering some of the most demanding AI workloads on the planet. Think large-scale GPU clusters, global telemetry systems, and … distributed training environments used by leading research and enterprise teams. They’re looking for a Senior or Staff SRE with deep experience in observability at massive scale - someone who’s tuned Prometheus/Mimir, Loki, or Tempo clusters beyond 100M+ series or 10TB/day logs, and who thrives in highly technical, fast-moving environments. You’ll be working on … Designing and scaling observability for globally distributed GPU infrastructure Building automation that cuts operational toil and improves reliability Partnering with platform and infrastructure teams to deliver true visibility across complex AI systems If you’ve built or operated telemetry stacks for large-scale, GPU-heavy, or multi-tenant environments - and want to work on cutting-edge problems in a business More ❯
Posted:

Staff Site Reliability Engineer - Observability

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Motive Group
Senior/Staff Site Reliability Engineer - Observability | London (Hybrid) If you care deeply about building and operating world-class infrastructure for AI at scale , this one’s worth your time. We’re working with a company that builds the backbone powering some of the most demanding AI workloads on the planet. Think large-scale GPU clusters, global telemetry systems, and … distributed training environments used by leading research and enterprise teams. They’re looking for a Senior or Staff SRE with deep experience in observability at massive scale - someone who’s tuned Prometheus/Mimir, Loki, or Tempo clusters beyond 100M+ series or 10TB/day logs, and who thrives in highly technical, fast-moving environments. You’ll be working on … Designing and scaling observability for globally distributed GPU infrastructure Building automation that cuts operational toil and improves reliability Partnering with platform and infrastructure teams to deliver true visibility across complex AI systems If you’ve built or operated telemetry stacks for large-scale, GPU-heavy, or multi-tenant environments - and want to work on cutting-edge problems in a business More ❯
Posted:

Site Reliability Engineer - AWS - Grafana - Cloudwatch - ELK - UK Remote

City of London, London, United Kingdom
Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some of the … planet's biggest artists and they're now looking for a SRE. Someone that knows their way around classic Observability with Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future … like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
Posted:

Site Reliability Engineer - AWS - Grafana - Cloudwatch - ELK - UK Remote

Central London / West End, London, United Kingdom
Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some of the … planet's biggest artists and they're now looking for a SRE. Someone that knows their way around classic Observability with Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future … like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
Posted:

Senior Data Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Identify Solutions
the past year and aggressive expansion across the UK, US, and EU, the company is scaling at pace. Data is the backbone: from APIs and pipelines to governance and observability, their data platform directly powers customer-facing products and AI-driven insights. They’re now hiring a Senior Data Engineer to own and shape this platform, building scalable, production-grade … systems that become the foundation for global brands. Why join? ✨ Greenfield impact – inherit a live but early platform, define best practice across structure, testing, observability, and governance. ✨ Direct product impact – your APIs, pipelines, and orchestration power the platform that 1,000+ brands rely on every day. ✨ AI at the core – work on infrastructure that enables machine learning and intelligent decision … doing: API strategy & development – own and scale FastAPI endpoints that deliver real-time access to platform data. Data pipeline development – build ingestion and replication pipelines with best-in-class observability, latency, and resilience. Platform technical vision – influence architecture and orchestration, shaping how the business handles data at scale. Data quality & governance – embed testing, freshness, lineage, and monitoring to ensure reliability More ❯
Posted:

Senior Data Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Identify Solutions
the past year and aggressive expansion across the UK, US, and EU, the company is scaling at pace. Data is the backbone: from APIs and pipelines to governance and observability, their data platform directly powers customer-facing products and AI-driven insights. They’re now hiring a Senior Data Engineer to own and shape this platform, building scalable, production-grade … systems that become the foundation for global brands. Why join? ✨ Greenfield impact – inherit a live but early platform, define best practice across structure, testing, observability, and governance. ✨ Direct product impact – your APIs, pipelines, and orchestration power the platform that 1,000+ brands rely on every day. ✨ AI at the core – work on infrastructure that enables machine learning and intelligent decision … doing: API strategy & development – own and scale FastAPI endpoints that deliver real-time access to platform data. Data pipeline development – build ingestion and replication pipelines with best-in-class observability, latency, and resilience. Platform technical vision – influence architecture and orchestration, shaping how the business handles data at scale. Data quality & governance – embed testing, freshness, lineage, and monitoring to ensure reliability More ❯
Posted:

DevOps Engineer

City of London, London, United Kingdom
Tribus
frameworks that support thousands of real-time processes across global markets. This isn’t a maintenance role - it’s an opportunity to modernise the firm’s CI/CD, observability, and runtime environments from the ground up. What you’ll be doing: Engineering and optimising CI/CD pipelines and container orchestration at scale Modernising Linux-based deployment and runtime … low-latency environment What we’re looking for: 5+ years’ experience in DevOps, Systems, or Platform Engineering Deep knowledge of Linux, Python, and shell scripting Proven experience with Kubernetes, observability tooling, and CI/CD frameworks Strong grasp of distributed systems and performance tuning Trading, Crypto or Hedge Fund Experience Experience working in low-latency, high-frequency trading environments Why More ❯
Posted:

DevOps Engineer

london (city of london), south east england, united kingdom
Tribus
frameworks that support thousands of real-time processes across global markets. This isn’t a maintenance role - it’s an opportunity to modernise the firm’s CI/CD, observability, and runtime environments from the ground up. What you’ll be doing: Engineering and optimising CI/CD pipelines and container orchestration at scale Modernising Linux-based deployment and runtime … low-latency environment What we’re looking for: 5+ years’ experience in DevOps, Systems, or Platform Engineering Deep knowledge of Linux, Python, and shell scripting Proven experience with Kubernetes, observability tooling, and CI/CD frameworks Strong grasp of distributed systems and performance tuning Trading, Crypto or Hedge Fund Experience Experience working in low-latency, high-frequency trading environments Why More ❯
Posted:

Staff Software Engineer

City of London, London, United Kingdom
Burns Sheehan
to solve complex challenges. Drive innovation around cloud-native technologies and platform automation. Balance strategic vision with ~30% hands-on coding and design work. Promote best practice in reliability, observability, and scalability. The Ideal Staff Software Engineer Proven experience operating at Staff+ level within a fast-paced engineering organisation. Strong background in cloud platforms (AWS or GCP) and deep knowledge … ability to build operators. Strong coding skills in Golang, Java, or C#, with experience in distributed systems. Demonstrated leadership across multiple squads and technical roadmaps. Expertise in operational excellence: observability, reliability, automation. This is an outstanding opportunity for a Staff Software Engineer join a rapidly scaling company where you’ll play a pivotal role in shaping the technical foundations of More ❯
Posted:
Observability
Central London
10th Percentile
£73,250
25th Percentile
£73,750
Median
£85,000
75th Percentile
£105,000
90th Percentile
£111,000