Senior SRE Engineer

Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services

Location: London (Hybrid, typically 3 days onsite)
Permanent, Full-time
Salary: £80k–£90k + bonus + benefits
Visa sponsorship: Not available

The Role

You’ll join as the first dedicated SRE hire, with responsibility for establishing SRE practices across a live Azure-based platform and a new strategic platform being brought into service.

The role is focused on reliability, observability, incident management, resilience, and automation. You’ll help define how services are measured and operated, introducing practical improvements around SLIs, SLOs, error budgets, monitoring, and service ownership.

This is a hands-on role for someone who has done this before and can bring structure, prioritise well, and build an SRE capability in a pragmatic way.

Non-Negotiables

Site Reliability Engineering in production environments
Azure cloud environments in enterprise-scale businesses
SLO / SLI / error budget design and implementation
Observability tooling (Prometheus, Grafana, OpenTelemetry or similar)
Incident leadership across Sev1 / Sev2 environments
Disaster recovery, resilience testing, RTO / RPO
Terraform infrastructure as code
CI/CD pipelines and engineering enablement
Strong scripting with PowerShell, Bash or Python
Experience improving reliability in hybrid estates (cloud + IaaS)
Ability to introduce new ways of working and build an SRE practice from scratch

They are looking for someone with a strong Azure background, but the priority is proven SRE capability and the ability to apply it effectively.

What You’ll Work With

Azure platform engineering
Azure Container Apps / cloud-native services
Terraform infrastructure as code
Prometheus monitoring
Grafana dashboards
OpenTelemetry tracing
Azure DevOps pipelines
GitHub Actions CI/CD
Windows Server and Linux estates
Service Bus, Event Hubs and Kafka
Incident management, runbooks, failover and resilience testing

Nice to Haves

Financial services or regulated environment experience
FCA / PRA operational resilience exposure
Payments or FX platform experience
Chaos engineering
FinOps or cloud cost awareness
Kubernetes exposure

Kubernetes knowledge is useful, but not essential.

Why Join / Projects

Establish the SRE capability from the ground up
Define and implement SLIs, SLOs and error budgets
Improve observability across platforms and services
Lead incident response and post-incident improvements
Drive resilience, failover and automation initiatives
Support the move toward a modern, reliability-first platform

You’ll play a key role in shaping how reliability is engineered across both the current platform and a new strategic platform being brought into production.

Employee Benefits

Pension
Private healthcare
Training and certification support

Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services

Apply Now

Senior SRE Engineer

Job Details