Remote Senior Site Reliability Engineer Manager (Remote)
Cambourne, Cambridgeshire, United Kingdom
Hybrid / WFH Options
Hybrid / WFH Options
Remotestar
production estate from both a technical and process perspective. Provide a consistent smooth operation of live systems and drive all on-call support issues. Design and operate a new incident tracking process to ensure root causes are found and remediated in a timely fashion by the development team. Create and maintain high end monitoring and automation tooling. Drive automation … and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted: