Site Reliability Engineer

Site Reliability Engineer

Central London (Hybrid – 2 to 8 days per month in office)

£50,000 per annum

Clear progression to Mid-Level SRE within 18 months

Build. Automate. Own the platform.

We’re looking for a hands-on Site Reliability Engineer (SRE) to become the technical backbone of our platform automation and CI/CD systems.

This is not a ticket-only support role. You’ll be building, improving, and owning the systems that keep our engineering teams moving quickly and reliably.

If you enjoy writing production code, automating repetitive work, and solving complex infrastructure problems, this is a role where you’ll have real ownership and impact from day one.

What you’ll be doing

You’ll split your time between supporting engineers and building automation that improves the entire platform.

Around 40% – Supporting Engineers & Fixing Issues

Debug and resolve failing builds, deployments, and CI/CD pipelines
Support engineers via Slack, tickets, and pairing sessions
Take ownership of production incidents and help restore service quickly
Be a key escalation point when things break

Around 60% – Automation & Platform Engineering

Design, build and optimise CI/CD pipelines (GitHub Actions, Jenkins, Rundeck)
Develop infrastructure-as-code using Terraform
Build automation tools, self-service platforms, and internal tooling
Improve observability using tools like NewRelic, Datadog, Grafana or Prometheus
Continuously identify opportunities to remove manual work and improve reliability

What we’re looking for

You’ll likely have:

2–3 years’ experience in DevOps, Platform Engineering or SRE
Strong Linux and infrastructure troubleshooting skills
Hands-on experience building CI/CD pipelines
Confidence writing production-quality code in Python or Bash
Experience with Terraform or similar infrastructure-as-code tools
Understanding of networking fundamentals (DNS, load balancers, CDNs, request flow)
Exposure to observability tools (NewRelic, Datadog, Grafana, Prometheus etc.)
Experience supporting engineers in live environments (Slack, tickets, incidents)
Familiarity using AI tools (ChatGPT, Copilot, Cursor etc.) to improve productivity
Willingness to participate in on-call rotations

Bonus points if you have:

Go programming experience
Ansible or configuration management tools
CDN experience (Cloudflare, CloudFront, Fastly)

Who you are

You enjoy solving complex technical problems and automating repetitive work
You stay calm and focused when systems go down
You communicate clearly with engineers and stakeholders
You care about writing clean, maintainable, production-ready code
You’re proactive and always looking for better ways to do things
You use modern AI tools to work smarter and faster

Why this role stands out

This is a genuine ownership role, not a reactive support position.

You’ll:

Own CI/CD systems and platform automation end-to-end
Work closely with senior engineering leadership
Have real influence over tooling and infrastructure decisions
Be part of a team that values automation, quality, and continuous improvement
Have a clear progression path to Mid-Level SRE within 18 months

If you’ve proven you can write production code and want to step into a role where you can shape platform engineering, this is a strong next step.

Apply Now

Site Reliability Engineer

Job Details