Observability Jobs in South London

16 of 16 Observability Jobs in South London

VP of Platform Engineering (London)

Wandsworth, Greater London, UK
YouLend
infrastructure provisioning and tooling to enhance development efficiency. You will manage Platform Reliability and Infrastructure ensuring a reliable and stable platform. You will oversee YouLend's the Security and Observability frameworks , focusing on platform security, maintaining observability, and providing dashboards for developers to monitor service health. The ideal candidate is someone who has successfully built and scaled platform architectures, led … the ability to work across technical and non-technical teams. Excellent communication skills, with the ability to translate complex technical concepts to business stakeholders. Operational Focus: Expertise in platform observability, monitoring, incident management, and creating highly reliable systems. Experience implementing SLAs, SLOs, and SLIs is a plus. Security & Compliance: In-depth understanding of platform security, data privacy, and regulatory compliance More ❯
Employment Type: Full-time
Posted:

Solutions Architect [UAE Based] (London)

Surbiton, Greater London, UK
ZipRecruiter
multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . #J-18808-Ljbffr More ❯
Employment Type: Full-time
Posted:

Sr. Machine Learning Engineer (London)

Wandsworth, Greater London, UK
Menlo Ventures
. Build evaluation pipelines to benchmark LLM performance and continuously monitor production accuracy and relevance. Work across the ML stack—from data preparation and model training to serving and observability—either independently or in collaboration with other specialists. Optimize model pipelines for latency, scalability, and cost-efficiency , and support real-time and batch inference needs. Collaborate with MLOps, DevOps, and More ❯
Employment Type: Full-time
Posted:

Head of Infrastructure Engineering (London)

Wandsworth, Greater London, UK
Spendesk
AWS as our cloud compute platform Kubernetes (EKS) for container runtime and orchestration RDS (PostgreSQL, MySQL), Kafka, Redis Terraform for infrastructure as code Lambda and Step Functions Datadog for Observability Github actions for CICD Frontend is React Backend services are developed in NodeJS (TypeScript) As we are an international team, please submit your application and CV in English. About Spendesk More ❯
Employment Type: Full-time
Posted:

Director of Rates and Credit Reliability Engineering | London, UK (London)

Surbiton, Greater London, UK
Hybrid / WFH Options
Deutsche Bank
strategy across FIC Technology, aligning reliability goals with business priorities and regulatory expectations Lead the transformation of production support into a proactive, data-driven engineering discipline focused on automation, observability, and continuous improvement Stay close to the technology-reviewing architecture, contributing to tooling, and leading by example in incident response and root cause analysis Act as a trusted advisor to … proficiency in Linux/Unix systems, SQL, and programming languages such as C++, Java or Python. Strong understanding of distributed systems and low-latency architectures Hands-on experience with observability stacks (e.g., Prometheus, Grafana, Splunk, Geneos, OpenTelemetry) and infrastructure automation (e.g., Ansible, Terraform, CI/CD pipelines) Strong understanding of the trade lifecycle, market data, and fixed income products, FX More ❯
Employment Type: Full-time
Posted:

Head of Platform Engineering (London)

Wandsworth, Greater London, UK
Octopus Energy Group
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
Employment Type: Full-time
Posted:

Principal Software Architect (London)

Wandsworth, Greater London, UK
Hybrid / WFH Options
PeopleCheck
able to present past case studies and guide stakeholders Preferred Qualifications Background in compliance or background-screening services Experience with microservices design and orchestration (Kubernetes, ECS) Knowledge of advanced observability tools (Datadog, New Relic, ELK) Why Join Us? Impact : Help define the technical roadmap together with our tech lead of a mission-critical compliance platform. Ownership : Lead key initiatives end More ❯
Employment Type: Full-time
Posted:

Senior Software Engineer (London)

Wandsworth, Greater London, UK
Omnea
with React & Material UI, Postgres, Hasura and AWS Serverless Technologies such as Lambda, DynamoDB and EventBridge - all managed via AWS CDK & SST. We use Sentry, Lumigo and LogRocket for observability and Github Actions for automated testing and deployment. End-to-end Ownership. You will be entrusted with end-to-end ownership of your projects. From product, design and architectural decisions … ideally AWS). You focus on having a high impact . You've spearheaded the engineering of critical systems before, working with best-in-class tooling in AWS, IaaC, observability and quality assessments. You want to discover the best ways to bring this to an early-stage startup. You know what good can look like . You understand what it … takes to build highly reliable & well architected products. You build with quality, observability & redundancy at the forefront. You’re ready to get a lot done. You enjoy all aspects of building a product and are comfortable moving across the stack when necessary. You enjoy problem solving and thinking from first principals.. You’re ready to pick up new skills and More ❯
Employment Type: Full-time
Posted:

Senior Software Engineer (Core Data Services) (London)

Wandsworth, Greater London, UK
Hybrid / WFH Options
Our Future Health
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme – We invest in your future More ❯
Employment Type: Full-time
Posted:

Head of Infrastructure (London)

Wandsworth, Greater London, UK
Hybrid / WFH Options
Intec Select
with organisational goals. Ensure all services are secure by design, working closely with the information security team to proactively manage risks. Drive service improvement and operational resilience through automation, observability, and DevOps best practices. Experience Required: Proven experience in leading platform/infrastructure and DevOps teams in a hands-on capacity. Strong technical foundation in both traditional infrastructure and modern … CI/CD, GitOps, IaC (e.g., Terraform, ARM), and automation scripting (e.g., PowerShell, Bash, Python). Cloud experience (ideally Azure) and hybrid infrastructure environments. Familiarity with monitoring, alerting, and observability platforms. Package: Up to 25% Bonus Remote Working Head of Platform & Infrastructure Engineering – Financial Services- London (Hybrid/Remote) - £100,000 - £120,000 + 25% Bonus + 15% Pension + More ❯
Employment Type: Full-time
Posted:

Senior Software Architect (London)

Wandsworth, Greater London, UK
Rocket Lab
technology. • Experience designing RESTful APIs. • Experience with streaming and messaging systems such as gRPC, Kafka and RabbitMQ. • Experience designing and interfacing with user portals. • Experience with monitoring, telemetry and observability technology and patterns. • Understanding of BSS/OSS systems and their integration with network infrastructure. • Experience with agile development methodologies and ways of working. • Awareness of software and network security More ❯
Employment Type: Full-time
Posted:

Machine Learning Engineer (London)

Wandsworth, Greater London, UK
Bazaarvoice
with containerized services and CI/CD deployment via GitHub Actions. Implement streaming data processing using Kafka for real-time content moderation decisions. Monitor model performance and drift using observability tools (e.g. Arize AI). Collaborate with teams using Scala-based services and maintain API integrations for model serving. Conduct architectural reviews for ML pipeline design and Infrastructure as Code More ❯
Employment Type: Full-time
Posted:

Senior Software Engineer - Croydon, England, United Kingdom; Manchester, England, United Kingdom

Croydon, London, United Kingdom
Jane's Group
coaching skills Strong problem solving and communication skills Strong understanding of SDLC Expertise with cloud technologies especially AWS Good experience delivering solutions and impact in agile environments Good with Observability, Monitoring and Serverless technology Experience providing data for consumption via API Experience and strong understanding of API First principles Our Mission: Creating trusted open-source intelligence has always been our More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer (London)

Wandsworth, Greater London, UK
HOLLAND AND BARRETT
part of our Supply Chain team , implementing scalable, stable, secure, and resilient apps built on cloud-based services that enhance the store space shopping experience through internal tooling, automation, observability, and full-stack engineering practices. Our highly skilled Backend Engineers sit at the heart of our business and are responsible for designing, building, and maintaining our platform solutions. As a More ❯
Employment Type: Full-time
Posted:

Senior Software Engineer II, Endpoint - Cisco ThousandEyes (London)

Wandsworth, Greater London, UK
Cisco Systems, Inc
end-user experiences. ThousandEyes is integrated across the Cisco portfolio and beyond, helping customers deploy at scale while delivering AI-powered insights within Cisco’s Networking, Security, Collaboration, and Observability portfolios. What You'll Do We seek a skilled C++ Software Engineer to join our team. This role involves working on integration and test automation projects, with opportunities to work More ❯
Employment Type: Full-time
Posted:

Solutions Architect (London)

Wandsworth, Greater London, UK
hyperexponential
skills and experience (ideally Python, and/or Rust, Go, Kotlin, Java, etc) Sound technical knowledge, ideally across multiple technical competencies and levels (e.g APIs, networking, databases, security, compliance, observability, architecture) Excellent communication skills (written, graphical, remote, in-person, presentation, one:one, one:many) with the ability to engage, influence, and inspire stakeholders and colleagues to drive collaboration and alignment More ❯
Employment Type: Full-time
Posted: