4 of 4 Permanent Service-Level Objective Jobs in Central London

Observability Platform Engineer – Elite Quant Hedge Fund

Hiring Organisation
Winston Fox
Location
City of London, London, United Kingdom
ability to solve problems on their own initiative. Requirements Debugging distributed systems: operating, improving and scaling complex systems in high-availability environments. SRE fundamentals: SLO/SLI thinking, observability, incident leadership, and a bias for systemic platform fixes. Strong software engineering skills: high proficiency in at least one modern programming ...

AWS Site Reliability Engineer ( Data Platform)

Hiring Organisation
FBI &TMT
Location
City, London, United Kingdom
Employment Type
Permanent
Salary
GBP 450 - 455 Daily
data platform built on AWS, Snowflake, and Databricks. This role focuses on enhancing reliability through automation, disaster recovery testing, resiliency engineering, observability, and proactive SLO/SLI/SLA management click apply for full job details ...

Principal Software Engineer

Hiring Organisation
Fruition Group
Location
City of London, London, United Kingdom
Employment Type
Permanent
across services (failover, circuit breaking, back-pressure, etc.) Lead large-scale refactoring or reliability improvement initiatives Establish best practices in incident response, observability, and SLO management Drive adoption of modern cloud-native and GitOps practices Mentor senior engineers and influence engineering culture at scale Skills Required: Strong backend engineering ability ...

AWS Site Reliability Engineer ( Data Platform)

Hiring Organisation
FBI &TMT
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£450 - £455 per day
data platform built on AWS, Snowflake, and Databricks. This role focuses on enhancing reliability through automation, disaster recovery testing, resiliency engineering, observability, and proactive SLO/SLI/SLA management. Key Responsibilities: Design, build, and maintain automation for infrastructure provisioning, platform operations, and incident response using IaC and CI/… raised by consumer teams, providing operational support and automating fixes to improve reliability and user experience. Job Requirements: Practical knowledge of SRE principles, including SLO/SLI/SLA design and error budgets. Strong experience with AWS (e.g., EC2, S3, IAM, VPC, CloudWatch) in production environments. Experience with observability tools ...