Senior Director - Operations and Reliability Engineering
Canary Wharf, Greater London, UK
Boston Consulting Group
Drive end-to-end automation to eliminate toil, improve efficiency, and enhance operational resilience. * Lead the transition from traditional IT operations to a proactive, AI-driven, self-healing infrastructure. * Establish a global observability, telemetry, and predictive analytics framework for real-time insights. * Align operational strategies with business goals, ensuring IT … and GCP. * Scale Infrastructure as Code (IaC), automated provisioning, and cloud workload optimization. * Drive edge computing, containerized workloads, and high-performance computing strategies. * Implement AI-driven monitoring, self-healing automation, and full-stack observability. IT Service Management & Operational Excellence: * Mandate and assure the adoption of IT Service Management (ITSM) processes … effective service delivery. * Establish SRE-based operational metrics, including SLOs, SLIs, and error budgets. * Oversee incident response, problem resolution, and root cause analysis with AI-driven remediation. * Ensure high availability, performance, and security compliance for all enterprise services. * Develop a follow-the-sun operational support model, ensuring 24x7 resilience and More ❯
Employment Type: Full-time
Posted: