Site Reliability Engineer
Site Reliability Engineer
Central London (Hybrid – 2 to 8 days per month in office)
£50,000 per annum
Clear progression to Mid-Level SRE within 18 months
Build. Automate. Own the platform.
We’re looking for a hands-on Site Reliability Engineer (SRE) to become the technical backbone of our platform automation and CI/CD systems.
This is not a ticket-only support role. You’ll be building, improving, and owning the systems that keep our engineering teams moving quickly and reliably.
If you enjoy writing production code, automating repetitive work, and solving complex infrastructure problems, this is a role where you’ll have real ownership and impact from day one.
What you’ll be doing
You’ll split your time between supporting engineers and building automation that improves the entire platform.
Around 40% – Supporting Engineers & Fixing Issues
- Debug and resolve failing builds, deployments, and CI/CD pipelines
- Support engineers via Slack, tickets, and pairing sessions
- Take ownership of production incidents and help restore service quickly
- Be a key escalation point when things break
Around 60% – Automation & Platform Engineering
- Design, build and optimise CI/CD pipelines (GitHub Actions, Jenkins, Rundeck)
- Develop infrastructure-as-code using Terraform
- Build automation tools, self-service platforms, and internal tooling
- Improve observability using tools like NewRelic, Datadog, Grafana or Prometheus
- Continuously identify opportunities to remove manual work and improve reliability
What we’re looking for
You’ll likely have:
- 2–3 years’ experience in DevOps, Platform Engineering or SRE
- Strong Linux and infrastructure troubleshooting skills
- Hands-on experience building CI/CD pipelines
- Confidence writing production-quality code in Python or Bash
- Experience with Terraform or similar infrastructure-as-code tools
- Understanding of networking fundamentals (DNS, load balancers, CDNs, request flow)
- Exposure to observability tools (NewRelic, Datadog, Grafana, Prometheus etc.)
- Experience supporting engineers in live environments (Slack, tickets, incidents)
- Familiarity using AI tools (ChatGPT, Copilot, Cursor etc.) to improve productivity
- Willingness to participate in on-call rotations
Bonus points if you have:
- Go programming experience
- Ansible or configuration management tools
- CDN experience (Cloudflare, CloudFront, Fastly)
Who you are
- You enjoy solving complex technical problems and automating repetitive work
- You stay calm and focused when systems go down
- You communicate clearly with engineers and stakeholders
- You care about writing clean, maintainable, production-ready code
- You’re proactive and always looking for better ways to do things
- You use modern AI tools to work smarter and faster
Why this role stands out
This is a genuine ownership role, not a reactive support position.
You’ll:
- Own CI/CD systems and platform automation end-to-end
- Work closely with senior engineering leadership
- Have real influence over tooling and infrastructure decisions
- Be part of a team that values automation, quality, and continuous improvement
- Have a clear progression path to Mid-Level SRE within 18 months
If you’ve proven you can write production code and want to step into a role where you can shape platform engineering, this is a strong next step.