Site Reliability Program Manager
- Hiring Organisation
- HCLTech
- Location
- City of London, London, United Kingdom
SLAs and support major migrations. Responsible for driving initiatives that improve system reliability, availability, incident response, and overall operational excellence. Coordinate efforts across SRE, engineering, product, support, and other teams to deliver high-quality, stable services and infrastructure. Required Experience & Skills: 15+ years of technical program/project … observability tooling, on-call/incident management tools. Data-driven mindset: comfortable analysing metrics, generating reports, and driving improvements based on data. Familiarity with SRE principles — high availability, reliability, observability, incident management, error budgets, service-level objectives (SLOs)/indicators (SLIs). Strong communication and stakeholder management skills — ability ...