Senior Site Reliability Engineer
- Hiring Organisation
- 17918
- Location
- United Kingdom
Site Reliability Engineer to improve the reliability, and performance of business-critical systems. Reporting into our Head of SRE you will focus on AWS cloud infrastructure, DevOps tooling, and core SRE practices within a distributed, production environment. Main Responsibilities: Leadership & Strategy Define and implement SRE best practices across the organization. … Excellent in designing systems that detect and remediate issues without manual intervention Self Healing systems, Runbook automation Exposure to tools like Gremlin, Chaos Monkey, AWS FIS to simulate outages and improve fault tolerance Incident Management Act as the primary point of escalation for critical production issues and lead major incident ...