Site Reliability Engineer
- Hiring Organisation
- Jobleads-UK
- Location
- Greater London, England, United Kingdom
Hybrid Mandatory primary skills on Datadog/Dynatrace tools, SLO management skills (AWS cloud skills is secondary). Primary Responsibilities: • Work closely with Product Engineering team and implement strategies for modernizing IT operations enhancing observability and toil reduction. • Architect and deploy observability platforms to monitor system health, performance … scalable, resilient, and maintainable. • Drive incident management and root cause analysis processes through automation, ensuring continuous improvement to enable autonomous operations. • Partner with engineering, architecture, and product teams to enable shift-left engineering practices ensuring reliability. • Mentor and guide teams on adopting SRE principles and tools. • Advocate ...