3 of 3 Remote/Hybrid Incident Management Jobs in Cheshire

Mechanical & Electrical Problem Analyst, Data Centre Infrastructure

Hiring Organisation
Colt Data Centre Services
Location
Chester, Cheshire, UK
Employment Type
Full-time
resolutions, and preventive measures. o Prepare detailed RCA reports and graphs to support the technical explanation of a Problem o Keep documentation for problem management accurate and current. • Continuous Improvement: o Prepare presentations on Problem management metrics, trends, and improvements. o Proactively identify opportunities to enhance service reliability … efficiency. o Participate in problem management initiatives and projects to drive continuous improvement. o Deliver internal technical training in areas of expertise. • Collaboration and Communication: o Collaborate with stakeholders to gather information and perform thorough problem diagnostics. o Join and document problem review meetings, providing regular updates. o Actively ...

Cyber Resilience Analyst

Hiring Organisation
Searchability (UK) Ltd
Location
Chester, Cheshire, North West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£50,000
ANALYST ROLE: As a Cyber Resilience Analyst, you'll be responsible for defining, maintaining, and testing the organisation's resilience plans, covering Business Continuity, Incident Response, and Disaster Recovery. You'll work closely with IT teams and stakeholders across the wider business to ensure resilience strategies are practical, robust … effective. The role plays a key part in analysing the impact of cyber incidents on business systems, supporting incident reviews, and ensuring lessons learned are fed back into improved resilience planning. You'll also work alongside project and change teams to ensure new systems and developments are designed with ...

Site Reliability Engineer Trainer / Coach

Hiring Organisation
CBSbutler Holdings Limited
Location
Cheshire, North West, United Kingdom
Employment Type
Contract, Work From Home
Azure, GCP, Private Cloud). - Conduct capability assessments and design targeted learning pathways. - Mentor SREs and engineering teams on reliability engineering, automation, and incident management. - Guide teams in defining and implementing SLOs, SLIs, and error budgets. - Promote best practices in observability, monitoring and incident response. - Create and curate … Azure, and/or GCP. - Hands-on experience with automation and IaC (Terraform, Ansible, CloudFormation) and CI/CD pipelines. - Deep understanding of incident management, RCA, and reliability engineering. - Demonstrated experience designing and delivering technical training or bootcamps. - Familiarity with Google SRE, DevOps, and ITIL frameworks. Please apply ...