Senior Site Reliability Engineer
- Hiring Organisation
- Dunelm
- Location
- Cardiff, UK
- Employment Type
- Full-time
driving service health, deep observability, and high availability across our entire technology infrastructure. You will have strong software engineering skills (ideally in TypeScript and Rust) and a deep understanding of modern observability practices. You will be confident working across infrastructure and application layers, and you will lead by example … Proven ability to lead incident response and post-incident review processes Strong problem-solving mindset and attention to detail Desirable skills Some experience in Rust or similar compiled language e.g. Go Experience instrumenting and running OpenTelemetry in production at scale. Knowledge of distributed tracing and trace sampling Experience reducing observability ...