Senior Site Reliability Engineer

💼 Principal Platform SRE

👔 AI Infrastructure Startup

📍 Gloucester/London - Remote First

💵 £90-120k + Equity

Do you want to work for a pioneering tech company that’s redefining how AI infrastructure is built and scaled?

Do you want to work in a business critical technical role, utilising all the latest technologies within DevOps, HPC and AI?

My client are a rapidly scaling HPC Cloud & Compute firm who are setting new standards in AI infrastructure. They are backed by leading investors and have just received Series B funding which they will be putting towards a huge tech scale-out

They are now looking for a Principal SRE to join their team and play a pivotal role in the ongoing scaling and optimisation of their platform.

Required Skills:

Kubernetes (CNI, Cilium, Bare Metal)
Automation (Ansible, Terraform)
Linux (Kernal level troubleshooting)
Cloud (Public, On Prem)
SRE (SLAs, SLOs, 24/7 Support models)

Benefits:

✓ Remote First Working (onsite couple times p/m in London or Gloucester)

✓ Equity

✓ 30 days annual leave (+ public holidays)

✓ £400 work-from-home allowance

✓ Private Medical insurance

✓ 12 Learning & Development days per year + dedicated budget)

✓ Flexi-Time

If you’re ready to lead large-scale automation initiatives and shape the future of AI infrastructure, apply today!

Apply Now

Senior Site Reliability Engineer

Job Details