Senior SRE
- Hiring Organisation
- Anson Mccade
- Location
- Glasgow, Lanarkshire, Scotland, United Kingdom
- Employment Type
- Permanent, Work From Home
long-term platform evolution. Incident Management & Resilience Lead and participate in incident response, root cause analysis and blameless post-mortems. Use data and observability to reduce mean time to detect and resolve. Drive improvements through SLOs, error budgets and reliability metrics. Automation & Engineering Excellence Develop automation and tooling using scripting … code and automation (e.g. Terraform, Ansible, CloudFormation). Strong incident response, troubleshooting and fault-analysis skills using a scientific, data-driven approach. Experience with observability: metrics, logging, tracing, alerting and performance analysis. Ability to explain complex systems clearly and influence across technical and non-technical stakeholders. Nice to have Experience ...