Cloud Infrastructure Site Reliability Engineer
🚀 SRE / Cloud Platform Engineer
📍 Sheffield (Hybrid – 3 days onsite)
đź’Ľ 6-Month Contract (Likely Extension)
The Opportunity
We’re looking for a GCP/Azure-focused Site Reliability Engineer / Cloud Platform Engineer to join a high-impact team working on large-scale, cloud-native infrastructure.
This is a hands-on role where you’ll build, automate, and run highly available platforms, applying modern SRE principles to real-world systems at scale.
If you enjoy solving complex problems, improving reliability, and working across cloud, DevOps, and data platforms, this is for you.
What You’ll Be Doing
- Build, operate, and support scalable GCP/Azure cloud infrastructure
- Apply SRE principles to improve reliability, performance, and automation
- Create and maintain Infrastructure as Code (Terraform / ARM)
- Develop scripts and tooling (PowerShell, Bash, Python)
- Monitor systems using tools like Azure Monitor, Prometheus, Grafana
- Troubleshoot complex, cross-platform production issues
- Collaborate across teams to deliver resilient, production-grade services
- Continuously improve systems, processes, and deployment pipelines
What You Bring
- Strong experience in GCP or Azure + DevOps / SRE environments
- Solid scripting skills (PowerShell, Bash, Python)
- Experience with CI/CD, Git, automation pipelines
- Good understanding of Linux, networking, and cloud architecture
- Hands-on with Terraform or similar IaC tools
- Proven ability to troubleshoot and optimise production systems