london, south east england, United Kingdom Hybrid / WFH Options
Robertson Sumner
Monitor application performance and proactively identify potential issues before they impact the business. Lead or support incident response for application outages or degradation, including rootcauseanalysis and reporting. Maintain and update support documentation and knowledge bases. Carry out compliance activities such as user access reviews and More ❯
Warwickshire, West Midlands, United Kingdom Hybrid / WFH Options
Bis Henderson
planning concepts, including forecasting, safety stock, parameters, master data and technical setup/configuration. Problem-Solving & Detail Orientation: Skilled in identifying data anomalies, performing rootcauseanalysis, and implementing lasting solutions with high attention to detail. Effective Communication & Continuous Improvement: Strong communicator with the ability to train More ❯
Waterlooville, Hampshire, South East, United Kingdom Hybrid / WFH Options
The Workshop
when necessary to resolve complex issues. Ensure proper communication and information transfer when escalating issues, providing all necessary details to facilitate fast resolution. Perform rootcauseanalysis on recurring or complex issues to prevent future occurrences. Provide reports and documentation on findings to management and other teams More ❯
system availability and rapid incident response. Document system architecture, processes, and procedures to ensure knowledge sharing and continuity. Continuously improve system reliability through rootcauseanalysis and implementing best practices. Qualifications: Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience. Proven experience More ❯
Reigate, Surrey, South East, United Kingdom Hybrid / WFH Options
Client Server
monitoring of production environments, design and implementation of automation and processes to improve efficiency and effectiveness, taking a lead in incident response, troubleshooting and rootcauseanalysis activities to mitigate future issues. You'll collaborate with senior business stakeholders to gather requirements, address concerns and provide updates More ❯
improve order completion rates • Create performance dashboards and reporting systems for logistics KPIs • Collaborate with warehouse, transportation, and delivery teams to implement improvements • Lead rootcauseanalysis for delivery delays and missed targets BASIC QUALIFICATIONS Requirements: • 5+ years experience in logistics, supply chain, or operations management • Strong More ❯
high availability and reliability. Implement and manage automation tools and IaC solutions to streamline processes and improve system efficiency. Lead incident response efforts, including rootcauseanalysis for critical failures. Mentor and guide junior engineers, fostering a culture of learning and improvement. We reserve the right to More ❯
glasgow, central scotland, United Kingdom Hybrid / WFH Options
Net Talent
desk teams to ensure project success and client satisfaction. ✅ Troubleshooting & Optimization: Proactively identify and resolve infrastructure performance issues, ensuring maximum uptime and reliability. Conduct root-causeanalysis and implement corrective actions to prevent re-occurrence. Required Skills & Experience: 🎯 Core Microsoft Expertise: Strong knowledge of Microsoft 365 design More ❯
travel for training or vendor visits. Job Summary: To provide controls expertise to the business, supporting controls related breakdowns, driving continuous improvement initiatives through rootcauseanalysis and assisting in the development of other engineering support departments through coaching, training and mentoring. To assist in the development More ❯
are not restricted to: Providing application support services (2LS and 3LS) for existing applications and solutions; including troubleshooting, monitoring, and incident and problem resolution (rootcauseanalysis). Liaising with the Client, their Partners, and third-party suppliers who provide support and development services for the applications More ❯
Management - Proficiency in managing and resolving IT service disruptions, detecting, logging and prioritising incidents. Problem Management - Proficiency in identifying and diagnosing IT issues, conducting rootcauseanalysis, implementing long-term solutions, and trend identification. Communication skills - collaborate with cross-functional teams, present technical info and provide training More ❯
advance and configure SIEM and EDR systems to optimise threat detection and response in Azure environments. Incident response: investigate and mitigate security incidents, applying rootcauseanalysis and remediation. Security testing: conduct regular application and network security assessments to identify vulnerabilities. Threat intelligence: monitor the cybersecurity landscape More ❯
and CI/CD pipelines. Participate in on-call support and take the incident commander role when dealing with critical incidents. Run postmortems and rootcauseanalysis to unlock learnings from incidents. Adhere to agile methodologies and Kanban processes and have a coaching mindset with the ability More ❯
with our Learning & Skill development team), create detailed documentation, and maintain up-to-date knowledge bases. Manage incident resolution and problem-solving processes, conduct rootcauseanalysis, and implement preventive measures. Your profile: Apprenticeship as IT specialist or bachelor's degree in information systems, computer science, IT More ❯
on key risk indicators (KRIs) and risk events related to AI technologies, providing regular updates to senior management and the board of directors. Conduct rootcauseanalysis of AI risk events and develop action plans to address identified issues and prevent recurrence. Ensure compliance with regulatory requirements More ❯
highly integrated team environment and focus on bringing out their strengths. Drives continued cost reductions and efficiencies across the portfolios supported by means of RootCauseAnalysis reviews, Knowledge management, Performance tuning, and user training Evaluates subordinates' performance and makes decisions on pay increases, hiring, terminations and More ❯
backend strategy Partner with engineering management, product, and design to define and deliver technical solutions that support business goals Lead complex incident response, drive rootcauseanalysis, and help build systems that are resilient and observable Conduct technical reviews, mentor engineers across levels, and cultivate a strong More ❯
regular backup and disaster recovery planning and testing Stay updated on the latest trends and developments in cloud technologies Investigate and document findings from rootcauseanalysis Document procedures and configurations to ensure knowledge sharing within the team Provide technical support and assistance to external stakeholders Work More ❯
on key risk indicators (KRIs) and risk events related to AI technologies, providing regular updates to senior management and the board of directors. Conduct rootcauseanalysis of AI risk events and develop action plans to address identified issues and prevent recurrence. Ensure compliance with regulatory requirements More ❯
Responsibilities Independently design, develop, test, document, and deploy reliable backend services and APIs Participate in the on-call rotation and lead incident response and rootcauseanalysis for production issues Collaborate cross-functionally with product, frontend, DevOps, and data teams to deliver backend features end-to-end More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Amazon
as internal stakeholders Work to improve important metrics such as 'mean time to engagement' and 'mean time to communication' for all incident types Facilitate RootCauseAnalysis and Post Event Reviews after each event to minimize recurrence Work with key stakeholders across AWS as advocates on behalf More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
Tunstall Healthcare (UK) Ltd
Oversee the monitoring and management of network performance, ensuring optimal uptime and identifying areas for improvement. Lead the resolution of complex network issues, including rootcauseanalysis, and provide expertise in mitigating recurring problems. Conduct regular audits of network configurations to maintain consistency and compliance. Collaborate with More ❯
Knowledge of networking concepts and technologies. Experience with cloud platforms like Microsoft Azure or AWS Diagnosing and resolving technical issues in Windows environments. Performing rootcauseanalysis and implementing solutions. Rewards & Benefits TCS is consistently voted a Top Employer in the UK and globally. Our competitive salary More ❯
greater manchester, north west england, United Kingdom Hybrid / WFH Options
Outcomes First Group
other sites as needed. KEY RESPONSIBILITIES: Troubleshoot and resolve incidents, major incidents, problems and service requests, providing regular updates to the end user. Conduct rootcauseanalysis of major incidents and problem records, contributing to the implementation of remedies and preventative measures. To be the first point More ❯
engineering teams to build security awareness and develop a network of Security Champions. Incident & Response Readiness: Support Smarsh SOC and security incident response, including rootcauseanalysis and post-mortem reviews for your product(s). Security Compliance & Governance: Ensure alignment with regulatory requirements (SOC 2, ISO More ❯