Senior Site Reliability Engineer (SRE)
- Hiring Organisation
- Paydock
- Location
- Bolton, Greater Manchester, UK
- Employment Type
- Full-time
scalability limits. You'll define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to maintain and improve platform health. Champion Observability: Implement and manage comprehensive monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, ELK Stack) to provide deep insights into system behavior and ensure rapid incident detection. … Work closely with software engineering teams to foster a culture of reliability. You'll provide guidance on building resilient services, implementing best practices for observability, and improving the developer experience. Secure the Foundation: Implement and maintain security best practices across our cloud infrastructure, ensuring our platform is robust and compliant. ...