Senior Site Reliability Engineer
Senior Site Reliability Engineer (AWS & Azure)
Location: London (Hybrid)
Contract: 6 months
Day Rate: £500-£600 (Outside IR35)
A leading London based organisation is seeking a Senior Site Reliability Engineer to join their platform and reliability function on a 6 month contract. This role is outside IR35 and offers the opportunity to shape and optimise a modern multi cloud environment across AWS and Azure.
What you'll be doing
- Driving reliability, scalability, and performance across AWS and Azure platforms
- Designing and implementing infrastructure using Terraform
- Managing and optimising production Kubernetes clusters
- Building automation, tooling, and internal services using Python and Go
- Enhancing observability using Prometheus, Grafana, and related monitoring stacks
- Implementing and running chaos engineering practices using tools such as Gremlin, Litmus, or similar
- Improving incident response, on call processes, and overall platform resilience
- Collaborating with engineering teams to strengthen CI/CD pipelines and cloud native architectures
- Deep experience operating in AWS and Azure environments
- Strong background in Site Reliability Engineering principles
- Proven expertise with Kubernetes in production
- Advanced Terraform skills
- Programming experience with Python and Go
- Hands on experience with Prometheus, Grafana, and modern observability tooling
- Exposure to chaos engineering tools and methodologies
- A proactive mindset focused on reliability, performance, and continuous improvement