Senior Specialist Engineer (SRE)
Birmingham, Leeds, Liverpool, London (Canary Wharf), United Kingdom
Hybrid/Remote Options
Hybrid/Remote Options
UK Health Security Agency
Proactively identify and address system bottlenecks using advanced problem-solving and performance tuning techniques. Conduct capacity planning and implement solutions to ensure systems can support current and future workloads Incident Response & Troubleshooting Respond swiftly to production incidents, ensuring minimal downtime and quick restoration of services. Perform root cause analysis and postmortems, implementing lessons learned to prevent recurrence. Monitoring, Alerting … to streamline deployment and operational workflows. Improve cross-functional collaboration and promote a culture of shared responsibility for service reliability. Documentation & Training Maintain accurate technical documentation, runbooks, and post-incident reports. Provide training and mentorship to engineering teams on best practices and tools. Essential criteria: Experience as a Site Reliability Engineer, DevOps Engineer, Operations Engineer or similar role Coding … demands Desirable criteria: Experience with CI/CD pipelines, cloud platforms (e.g., Amazon Web Services, Google Cloud Platform (AWS, GCP), Azure) and container orchestration (e.g., Kubernetes) Experience with post-incident reviews Previous involvement in driving adoption of SRE practices across an organization Experience delivering training or mentoring junior engineers Selection Process Detail This vacancy is using Success Profiles and More ❯
Employment Type: Permanent
Salary: £41983.00 - £52113.00 a year
Posted: