Senior SRE Engineer
- Hiring Organisation
- Prism Digital
- Location
- City of London, London, United Kingdom
practices across a live Azure-based platform and a new strategic platform being brought into service. The role is focused on reliability, observability, incident management, resilience, and automation . You’ll help define how services are measured and operated, introducing practical improvements around SLIs, SLOs, error budgets, monitoring … environments Azure cloud environments in enterprise-scale businesses SLO/SLI/error budget design and implementation Observability tooling (Prometheus, Grafana, OpenTelemetry or similar) Incident leadership across Sev1/Sev2 environments Disaster recovery, resilience testing, RTO/RPO Terraform infrastructure as code CI/CD pipelines and engineering enablement ...