Site Reliability Engineer

Site Reliability Engineer

Central London (Hybrid – 2 to 8 days per month in office)

£50,000 per annum

Clear progression to Mid-Level SRE within 18 months

Build. Automate. Own the platform.

We’re looking for a hands-on Site Reliability Engineer (SRE) to become the technical backbone of our platform automation and CI/CD systems.

This is not a ticket-only support role. You’ll be building, improving, and owning the systems that keep our engineering teams moving quickly and reliably.

If you enjoy writing production code, automating repetitive work, and solving complex infrastructure problems, this is a role where you’ll have real ownership and impact from day one.

What you’ll be doing

You’ll split your time between supporting engineers and building automation that improves the entire platform.

Around 40% – Supporting Engineers & Fixing Issues

  • Debug and resolve failing builds, deployments, and CI/CD pipelines
  • Support engineers via Slack, tickets, and pairing sessions
  • Take ownership of production incidents and help restore service quickly
  • Be a key escalation point when things break

Around 60% – Automation & Platform Engineering

  • Design, build and optimise CI/CD pipelines (GitHub Actions, Jenkins, Rundeck)
  • Develop infrastructure-as-code using Terraform
  • Build automation tools, self-service platforms, and internal tooling
  • Improve observability using tools like NewRelic, Datadog, Grafana or Prometheus
  • Continuously identify opportunities to remove manual work and improve reliability

What we’re looking for

You’ll likely have:

  • 2–3 years’ experience in DevOps, Platform Engineering or SRE
  • Strong Linux and infrastructure troubleshooting skills
  • Hands-on experience building CI/CD pipelines
  • Confidence writing production-quality code in Python or Bash
  • Experience with Terraform or similar infrastructure-as-code tools
  • Understanding of networking fundamentals (DNS, load balancers, CDNs, request flow)
  • Exposure to observability tools (NewRelic, Datadog, Grafana, Prometheus etc.)
  • Experience supporting engineers in live environments (Slack, tickets, incidents)
  • Familiarity using AI tools (ChatGPT, Copilot, Cursor etc.) to improve productivity
  • Willingness to participate in on-call rotations

Bonus points if you have:

  • Go programming experience
  • Ansible or configuration management tools
  • CDN experience (Cloudflare, CloudFront, Fastly)

Who you are

  • You enjoy solving complex technical problems and automating repetitive work
  • You stay calm and focused when systems go down
  • You communicate clearly with engineers and stakeholders
  • You care about writing clean, maintainable, production-ready code
  • You’re proactive and always looking for better ways to do things
  • You use modern AI tools to work smarter and faster

Why this role stands out

This is a genuine ownership role, not a reactive support position.

You’ll:

  • Own CI/CD systems and platform automation end-to-end
  • Work closely with senior engineering leadership
  • Have real influence over tooling and infrastructure decisions
  • Be part of a team that values automation, quality, and continuous improvement
  • Have a clear progression path to Mid-Level SRE within 18 months

If you’ve proven you can write production code and want to step into a role where you can shape platform engineering, this is a strong next step.

Job Details

Company
Pertemps Network Group
Location
City of London, London, United Kingdom
Posted