Site Reliability Engineer
- Hiring Organisation
- SS&C Technologies
- Location
- Shrewsbury, Shropshire, UK
- Employment Type
- Full-time
incidents across services and infrastructure; reduce MTTR and prevent recurrences through high-quality post-incident actions. Observability as a first‐class practice: Use Grafana, Datadog, and Splunk (and related tools like Prometheus/OpenTelemetry) to detect anomalies, root cause issues, and create actionable alerts and dashboards. Run Kubernetes at scale … engineering). What you will bring 5+ years operating production systems as an SRE, DevOps engineer, or software engineer. Observability: Hands‐on with Grafana, Datadog, and Splunk for incident investigation, dashboarding, alerting, tracing/logs/metrics correlation, and performance analysis. Kubernetes: Strong experience running and troubleshooting workloads (controllers, pods ...