Senior SRE
- Hiring Organisation
- Anson Mccade
- Location
- Glasgow, Lanarkshire, Scotland, United Kingdom
- Employment Type
- Permanent, Work From Home
reliability, scalability, performance and operability. Contribute to technical strategy, standards and long-term platform evolution. Incident Management & Resilience Lead and participate in incident response, root cause analysis and blameless post-mortems. Use data and observability to reduce mean time to detect and resolve. Drive improvements through SLOs … networking, distributed systems and cloud platforms. Experience with infrastructure-as-code and automation (e.g. Terraform, Ansible, CloudFormation). Strong incident response, troubleshooting and fault-analysis skills using a scientific, data-driven approach. Experience with observability: metrics, logging, tracing, alerting and performance analysis. Ability to explain complex systems clearly ...