IT Service Management Analyst/Problem Manager ITSM, ITIL, Problem Management, Incident Management, Change Management, RootCauseAnalysis, Enterprise Environments, Analysis, Reporting; Permanent, London (3/2 Hybrid), £55k - £62k +Bonus +Bens Global Law firm seeks Problem Manager/IT Service Management Analyst to join the Technology team particular focus on RootCauseAnalysis … function which aims to ensure the reliability, performance, and continual improvement of critical business systems (Applications and Infrastructure). Youll work with technical teams and service owners to uncover root causes behind major incidents and implement long-term solutions that prevent recurrence. As the Problem Manager Analyst/IT Service Management Analyst, youll focus on Problem Management, leading structured … investigations, facilitating RootCauseAnalysis sessions, and embedding preventative measures that strengthen the service landscape. Youll also play a key role in supporting Incident and major incident management, ensuring a seamless link between reactive issue resolution and proactive service improvement. As such key responsibilities will include: Ownership and lead the Problem Management process from identification through to More ❯
IT Service Management Analyst/Problem Manager - ITSM, ITIL, Problem Management, Incident Management, Change Management, RootCauseAnalysis, Enterprise Environments, Analysis, Reporting; Permanent, London (3/2 Hybrid), £55k - £62k +Bonus +Bens Global Law firm seeks Problem Manager/IT Service Management Analyst to join the Technology team particular focus on RootCauseAnalysis … which aims to ensure the reliability, performance, and continual improvement of critical business systems (Applications and Infrastructure). You'll work with technical teams and service owners to uncover root causes behind major incidents and implement long-term solutions that prevent recurrence. As the Problem Manager Analyst/IT Service Management Analyst, you'll focus on Problem Management, leading … structured investigations, facilitating RootCauseAnalysis sessions, and embedding preventative measures that strengthen the service landscape. You'll also play a key role in supporting Incident and major incident management, ensuring a seamless link between reactive issue resolution and proactive service improvement. As such key responsibilities will include: Ownership and lead the Problem Management process from identification More ❯
IT Service Management Analyst/Problem Manager – ITSM, ITIL, Problem Management, Incident Management, Change Management, RootCauseAnalysis, Enterprise Environments, Analysis, Reporting; Permanent, London (3/2 Hybrid), £55k - £62k +Bonus +Bens Global Law firm seeks Problem Manager/IT Service Management Analyst to join the Technology team particular focus on RootCauseAnalysis … which aims to ensure the reliability, performance, and continual improvement of critical business systems (Applications and Infrastructure). You’ll work with technical teams and service owners to uncover root causes behind major incidents and implement long-term solutions that prevent recurrence. As the Problem Manager Analyst/IT Service Management Analyst, you’ll focus on Problem Management, leading … structured investigations, facilitating RootCauseAnalysis sessions, and embedding preventative measures that strengthen the service landscape. You’ll also play a key role in supporting Incident and major incident management, ensuring a seamless link between reactive issue resolution and proactive service improvement. As such key responsibilities will include: Ownership and lead the Problem Management process from identification More ❯
IT Service Management Analyst/Problem Manager – ITSM, ITIL, Problem Management, Incident Management, Change Management, RootCauseAnalysis, Enterprise Environments, Analysis, Reporting; Permanent, London (3/2 Hybrid), £55k - £62k +Bonus +Bens Global Law firm seeks Problem Manager/IT Service Management Analyst to join the Technology team particular focus on RootCauseAnalysis … which aims to ensure the reliability, performance, and continual improvement of critical business systems (Applications and Infrastructure). You’ll work with technical teams and service owners to uncover root causes behind major incidents and implement long-term solutions that prevent recurrence. As the Problem Manager Analyst/IT Service Management Analyst, you’ll focus on Problem Management, leading … structured investigations, facilitating RootCauseAnalysis sessions, and embedding preventative measures that strengthen the service landscape. You’ll also play a key role in supporting Incident and major incident management, ensuring a seamless link between reactive issue resolution and proactive service improvement. As such key responsibilities will include: Ownership and lead the Problem Management process from identification More ❯
IT Service Management Analyst/Problem Manager – ITSM, ITIL, Problem Management, Incident Management, Change Management, RootCauseAnalysis, Enterprise Environments, Analysis, Reporting; Permanent, London (3/2 Hybrid), £55k - £62k +Bonus +Bens Global Law firm seeks Problem Manager/IT Service Management Analyst to join the Technology team particular focus on RootCauseAnalysis … which aims to ensure the reliability, performance, and continual improvement of critical business systems (Applications and Infrastructure). You’ll work with technical teams and service owners to uncover root causes behind major incidents and implement long-term solutions that prevent recurrence. As the Problem Manager Analyst/IT Service Management Analyst, you’ll focus on Problem Management, leading … structured investigations, facilitating RootCauseAnalysis sessions, and embedding preventative measures that strengthen the service landscape. You’ll also play a key role in supporting Incident and major incident management, ensuring a seamless link between reactive issue resolution and proactive service improvement. As such key responsibilities will include: Ownership and lead the Problem Management process from identification More ❯
The Production Support Engineer is responsible for troubleshooting, maintaining, and optimizing business-critical production applications and infrastructure. This role involves handling escalated issues from Level 1 support, performing detailed rootcauseanalysis, supporting monthly maintenance activities, and ensuring SLA compliance. Your Role at Orange Logic: Application and System Support: Administer and resolve application issues, provide timely updates … and perform rootcause analysis. Perform detailed troubleshooting, log analysis, and rootcause investigations for application and infrastructure incidents. Provide software application support, including monitoring, escalation, and incident response. Support application outages by executing recovery plans and participating in post-mortem analysis. Infrastructure Management and Automation: Assist in making infrastructure adjustments and improvements using Infrastructure … best practices in production support, DevOps, SRE, and cloud operations. Participate in ongoing training and knowledge-sharing initiatives within the team. Ideal Qualifications: Technical Expertise: Proficient in application troubleshooting, rootcauseanalysis, and log diagnostics. Experience with SQL queries and database management (primarily SQL Server). Knowledge of programming/scripting languages (Python, PowerShell, Bash). Hands More ❯
improving the Quality Management System (QMS). This isn't just an audit follow-up role, you'll work directly with managers and stakeholders to drive audit close-outs, rootcauseanalysis, and process improvements across all areas of the business. Personality is just as important as technical ability: we're looking for someone energetic, flexible, and … enhancing the Quality Management System (QMS) in line with ISO9001 requirements Leading and supporting the close-out of audit findings and corrective actions, working with managers and stakeholders Driving rootcauseanalysis (8D, Fishbone/Ishikawa, 5 Whys) to resolve non-conformances and prevent recurrence Facilitating risks and opportunities analysis, including mitigation planning Supporting ISO certification … knowledge of QMS and ISO standards (ISO9001 essential) Proven track record in supporting audit close-outs and corrective actions Experienced in documentation, process improvement, and stakeholder engagement Skilled in RootCauseAnalysis tools and CAPA management Assertive communicator with the ability to influence and support stakeholders across all levels Agile, flexible, and proactive approach to problem-solving More ❯
Watford, Hertfordshire, South East, United Kingdom Hybrid/Remote Options
Zellis
stakeholders to ensure that we can deliver meaningful resolutions for our internal and external customers. You'll be a customer champion and advocate and will use insight, data and rootcauseanalysis to drive through the best possible outcomes for our customers. This role is critical in making a difference to our customers by genuinely ensuring that … Senior Leadership team specifically within Service Support to constantly review the customer online and offline journeys to identify improvement opportunities. Driving customer experience benefits and improvement through focused, proactive rootcauseanalysis and insights. Supporting on the production, completion and delivery of accurate and timely insights, MI and KPI packs, as well as complaints/escalations rootcause information as required. Skills & experience A strong communicator at all levels of the business. Attention to detail and able to interpret information, make sound decisions and take ownership of issues through to resolution. Effective stakeholder management, both internal and external. A results-driven individual who is commercially astute. Strong presentation skills and the ability to present information More ❯
Watford, Hertfordshire, England, United Kingdom Hybrid/Remote Options
Zellis
stakeholders to ensure that we can deliver meaningful resolutions for our internal and external customers. You'll be a customer champion and advocate and will use insight, data and rootcauseanalysis to drive through the best possible outcomes for our customers. This role is critical in making a difference to our customers by genuinely ensuring that … Senior Leadership team specifically within Service Support to constantly review the customer online and offline journeys to identify improvement opportunities. Driving customer experience benefits and improvement through focused, proactive rootcauseanalysis and insights. Supporting on the production, completion and delivery of accurate and timely insights, MI and KPI packs, as well as complaints/escalations rootcause information as required. Skills & experience A strong communicator at all levels of the business. Attention to detail and able to interpret information, make sound decisions and take ownership of issues through to resolution. Effective stakeholder management, both internal and external. A results-driven individual who is commercially astute. Strong presentation skills and the ability to present information More ❯
South West London, London, England, United Kingdom
Michael Page Technology
an escalation point for complex issues that junior technicians are unable to resolve. Incident Management: Take the lead on managing critical incidents, ensuring timely resolution and communication with stakeholders. RootCauseAnalysis: Perform rootcauseanalysis for recurring issues and recommend long-term solutions. Process Improvement: Identify areas for process improvement within the service More ❯
data layers capture what matters most. Adopt a consultative approach with CRO, Digital Product, and E-commerce teams to define tracking requirements, measure test results, and provide deep-dive rootcauseanalysis on performance. Manage and guide complex analytical projects, such as customer journey analysis, segmentation, or rootcauseanalysis, to explain what … structures for web and app. Advanced SQL and strong experience with BigQuery and Databricks. Strong experience with data visualisation tools, preferably Tableau or Looker. Proficiency in Python for data analysis is a strong plus. Experience with advanced analytics techniques, such as segmentation, and familiarity with A/B testing tools and CRO analysis. Experience with Git (or similar) for More ❯
IT Service Management Analyst/Problem Manager ITSM, ITIL, Problem Management, Incident Management, Change Management, RootCauseAnalysis, Enterprise Environments, Analysis, Reporting; Permanent, London (3/2 Hybrid), £55k - £62k +Bonus +Bens Global Law firm seeks Problem Manager/IT Service Management Analyst to join the Technology team particular focus on RootCauseAnalysisMore ❯
and response to cyber and data handling incidents, including misdirected emails, unauthorized data access, and policy violations. Support containment, eradication, and recovery efforts for Cyber and data-related incidents. RootCauseAnalysis & Reporting Contribute to rootcauseanalysis to determine the origin and impact of incidents. Document incidents thoroughly and support preparation of detailed More ❯
i.e., "TOIL"). Participate in operations support and on-call rotation shifts, for SRE supported systems and products. Participate in or lead problem management activities , including post-mortem incident analysis, and provision of technical insight, documented findings, outcomes and recommendations as part of a rootcauseanalysis to troubleshoot priority incidents. Implement automation to reduce probability … team leader experience Understanding of software engineering principles (source control, versioning, code reviews, etc.) Working in an environment that complies with ISO27001, NIST, CIS Benchmarks, PCIDSS amongst others Leading rootcauseanalysis and blameless postmortems in complex environments Experience of communicating complex issues to senior stakeholders and technical teams. Implementation of highly available and reliable systems, using More ❯
end-users. Serve as the point of technical escalation for DDaT staff, providing leadership and aid in the investigation and diagnosis of complex IT issues and problems (e.g., via rootcauseanalysis). Aide strategic planning for digital assets to ensure they remain fit for purpose and in support of the Trust's goals. 3. Stakeholder and … ability to communicate highly complex or multi-stranded technical information effectively to a diverse audience, including non-Digital managers, to facilitate understanding and secure cooperation. Desirable Experience in coordinating rootcauseanalysis (RCA) or proactive trend analysis to investigate IT-related problems and review process failures, thereby minimizing future service interruptions. Demonstrated experience in evaluating new More ❯
Greater London, England, United Kingdom Hybrid/Remote Options
Clarksons
and incident troubleshooting. Lead SQL Server upgrades, migrations, and implementation of disaster recovery (DR) and high availability (HA) solutions. Conduct annual disaster recovery testing across platforms. Perform incident investigation, rootcauseanalysis, and long-term remediation. Mentor junior DBAs and act as a subject matter expert on modern database practices. Participate in the on-call rota and … experience supporting SSRS, SSAS, and SSIS in hybrid environments. Ability to design and manage scalable monitoring solutions for both cloud and on-prem workloads. Demonstrated strength in problem-solving, rootcauseanalysis, and cross-system diagnostics. Desirable Skills Microsoft certifications such as DP-300, AZ-104, AZ-305, or equivalent. Experience with Azure Databricks, Data Factory, or More ❯
Deploy, configure, and optimize Wiz for continuous cloud security monitoring and compliance management. Identify vulnerabilities, misconfigurations, and risks across AWS, Azure, and GCP environments, and drive remediation efforts. Lead rootcauseanalysis (RCA) for security incidents and coordinate escalations as needed. Partner with software engineering and infrastructure teams to integrate security best practices into CI/CD … engineering using Wiz , AWS , Azure , and GCP . Strong understanding of cybersecurity principles , risk and controls , and internal control frameworks . Proficiency in incident response , security issue escalation , and rootcauseanalysis . Hands-on experience with security automation , DevSecOps tools , and infrastructure as code (e.g., Terraform, CloudFormation). Excellent problem-solving skills and ability to think More ❯
containerization, orchestration, monitoring, logging and alerting systems) for both our client facing APIs and large training runs Participate occasionally in on call rotations to respond to incidents and perform rootcauseanalysis to prevent future occurrences Development (50%) Drive continuous improvement in infrastructure automation, deployment, and orchestration using tools like Kubernetes, Flux, Terraform Collaborate with AI/… 7+ years of experience in a DevOps/SRE role Strong experience with cloud computing and highly available distributed systems Exposure to site reliability issues in critical environments (issue rootcauseanalysis, in production troubleshooting, on call rotations ) Experience working against reliability KPIs (observability, alerting, SLAs) Hands on experience with CI/CD, containerization and orchestration tools More ❯
career in a SOC environment and now works primarily in incident and threat response. The role Lead investigations into live security incidents including malware, phishing, and endpoint compromise Perform rootcauseanalysis, containment, and recovery actions Tune detection rules and develop new use cases to improve response times Utilise Microsoft Defender, Sentinel, and Azure Security tools to … incident response experience Strong working knowledge of SIEM and EDR tools (Sentinel, Defender, CrowdStrike, etc.) Solid understanding of Windows, Linux, and network security principles Experience with forensic or threat analysis techniques Familiarity with MITRE ATT&CK, NIST, or similar frameworks Desirable Exposure to automation or SOAR tooling PowerShell or Python scripting skills GIAC or Microsoft security certifications This is … career in a SOC environment and now works primarily in incident and threat response. The role * Lead investigations into live security incidents including malware, phishing, and endpoint compromise * Perform rootcauseanalysis, containment, and recovery actions * Tune detection rules and develop new use cases to improve response times * Utilise Microsoft Defender, Sentinel, and Azure Security tools to More ❯
Atherstone, Warwickshire, England, United Kingdom Hybrid/Remote Options
Aldi
trends and making recommendations. Someone who is resilient and experienced in high pressure environments and a confident individual with great communication skills. Your New Role Subject matter expertise Data analysis, data maintenance, data representation Ability to create business reports Ability to identify gaps in the new processes and define new ones as required Create PowerPoint presentations to illustrate processes … Business readiness Data cleansing and readiness of existing systems Problem solving and rootcauseanalysis through data Process creation, testing and implementation utilising automation Creating training materials Planning and delivery of training to business users Project management Ensures deadlines are complied with in area of responsibility Provides updates to Deployment Lead Reporting of risks, issues and business … Coding experience and analytical work in SQL and Python. Strong Excel skills. Strong commercial acumen. Problem solving skills for identifying supply chain issues and creating automated solutions. Evidence of rootcauseanalysis, process improvement and driving decisions. Strong communication and organisational skills. Project Management skills. Ability to proactively work towards challenging deadlines through multitasking and resilience. Experience More ❯
effective Problem Management, Incident and Major Incident Management, and providing cover for our Change Management processes. You will work closely with technical teams, service owners, and stakeholders to drive rootcauseanalysis, manage high-impact incidents, and support the governance of change activities. This is a hands-on role requiring strong analytical skills, excellent communication, and a … end-to-end Problem Management process in alignment with ITIL v4 practices. Proactively identify and log problems based on incident trends, monitoring data, and stakeholder feedback. Facilitate and lead RootCauseAnalysis (RCA) sessions using structured methodologies (e.g., 5 Whys, Fishbone, Kepner-Tregoe). Maintain and update the Known Error Database (KEDB) and ensure visibility of workarounds … prioritised, and assigned in accordance with business impact and urgency. Maintain high-quality incident records, including timelines, actions taken, and resolution details. Drive post-incident reviews (PIRs) to identify root causes, document lessons learned and ensure follow-up actions are tracked and completed. Integration with the Problem Management processes to ensure recurring incidents are investigated and addressed. Monitor incident More ❯
effective Problem Management, Incident and Major Incident Management, and providing cover for our Change Management processes. You will work closely with technical teams, service owners, and stakeholders to drive rootcauseanalysis, manage high-impact incidents, and support the governance of change activities. This is a hands-on role requiring strong analytical skills, excellent communication, and a … end-to-end Problem Management process in alignment with ITIL v4 practices. Proactively identify and log problems based on incident trends, monitoring data, and stakeholder feedback. Facilitate and lead RootCauseAnalysis (RCA) sessions using structured methodologies (e.g., 5 Whys, Fishbone, Kepner-Tregoe). Maintain and update the Known Error Database (KEDB) and ensure visibility of workarounds … prioritised, and assigned in accordance with business impact and urgency. Maintain high-quality incident records, including timelines, actions taken, and resolution details. Drive post-incident reviews (PIRs) to identify root causes, document lessons learned and ensure follow-up actions are tracked and completed. Integration with the Problem Management processes to ensure recurring incidents are investigated and addressed. Monitor incident More ❯
Somerset, England, United Kingdom Hybrid/Remote Options
Reed
new AWS services or DevOps tools to continuously enhance infrastructure capabilities. Produce and maintain platform documentation and runbooks, ensuring knowledge is shared and accessible. Contribute to incident response and rootcauseanalysis for infrastructure-related issues. Track and report platform metrics, including performance, cost efficiency, and security posture. Required Skills & Qualifications: Proven hands-on experience managing AWS … of cloud security best practices. Experience with monitoring, logging, and alerting tools. Proficiency in scripting or automation languages (Python, Bash, or PowerShell). Track record of incident response and rootcauseanalysis in cloud environments. If you are interested in this position please apply online or for more information contact me on More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Reed Technology
new AWS services or DevOps tools to continuously enhance infrastructure capabilities. Produce and maintain platform documentation and runbooks, ensuring knowledge is shared and accessible. Contribute to incident response and rootcauseanalysis for infrastructure-related issues. Track and report platform metrics, including performance, cost efficiency, and security posture. Required Skills & Qualifications: Proven hands-on experience managing AWS … of cloud security best practices. Experience with monitoring, logging, and alerting tools. Proficiency in scripting or automation languages (Python, Bash, or PowerShell). Track record of incident response and rootcauseanalysis in cloud environments. If you are interested in this position please apply online or for more information contact me on More ❯
Glasgow, Scotland, United Kingdom Hybrid/Remote Options
GIOS Technology
and automation best practices across teams. Key Responsibilities: Design, build, and optimize automation frameworks , observability tools , and incident response mechanisms to improve system reliability. Manage and troubleshoot incidents , perform rootcauseanalysis , and implement preventive measures to avoid recurrences. Collaborate with development , infrastructure , and product teams to integrate SRE best practices into the software delivery lifecycle. Develop … and automation technologies to enhance system performance and resilience. Key Skills Python, PowerShell, Go, Automation, SRE, Incident Management, Troubleshooting, Systems Engineering, Cloud Computing, Networking, Performance Tuning, Monitoring, Capacity Planning, RootCauseAnalysis, DevOps, Communication Skills, Problem Solving, Reliability, Scalability More ❯