1 to 25 of 106 Root Cause Analysis Jobs in London

RCA Analyst

Hiring Organisation
Hays
Location
City of London, London, United Kingdom
Employment Type
Permanent
Hays Recruitment | My contact information is available on my LinkedIn profile For this role, you must be able to demonstrate: Hands-on ownership of Root Cause Analysis for major incidents (P1/P2) Experience applying structured RCA methodologies (5 Whys, Fishbone, Fault Tree, Kepner-Tregoe) Ability … process, and organisational causes of failure Experience working within Incident, Major Incident, and Problem Management functions Confidence to challenge engineering teams and vendors on root causes and corrective actions Experience producing high-quality RCA reports (timeline, impact, contributing factors, actions) This role is not suitable for candidates whose experience ...

Quality Assurance Officer

Hiring Organisation
First Call Contract Services
Location
Erith, Kent, South East, United Kingdom
Employment Type
Permanent
Salary
£28,000
required. Audits & Compliance Conduct daily and weekly internal audits, including: Glass and hard plastic audits Hygiene audits GMP inspections Investigate non-conformances and complete root cause analysis reports. Support external audits and calibration activities. Assist with supplier approval processes and documentation reviews. Liaise with pest control contractors … trials, and technical documentation. Ensure product labelling complies with current legislation and customer requirements. Coordinate the submission of samples for laboratory testing, including: Microbiological analysis Chemical analysis Allergen testing Customer & Supplier Communication Liaise with customers and suppliers on quality-related matters. Manage customer complaints from investigation through ...

Site Reliability Engineer — AWS & Observability

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
issues Leverage AI tooling – Use AI‐assisted development tools (e.g. GitHub Copilot) to accelerate infrastructure work, and explore AI‐driven approaches to incident detection, root cause analysis, and remediation What We're Looking For Essential 3+ years in an SRE, Platform, or DevOps engineering role AWS services … regulated environments or with compliance frameworks Experience with AI‐driven DevOps tooling (e.g. AWS DevOps Agent or similar AI agents for incident resolution, root cause analysis, and operational improvement) Experience with SLIs, SLOs, and error budgets On‐Call We have a 24/7 customer support team ...

Front Office Systems & Devices Manager

Hiring Organisation
FBI &TMT
Location
South East London, London, United Kingdom
Employment Type
Permanent
Salary
£80,000
validators, kiosks, and associated platforms , coordinating rapid resolution with engineering and field teams. Drive operational excellence across: Incident management Ticket prioritisation and queue management Root cause analysis (RCA) Service performance and uptime Work closely with field engineers, maintenance teams, and operational stakeholders to ensure effective resolution … engineering, and operations. Support capacity planning, resource forecasting, and service readiness , ensuring resilience across peak operational periods. Produce regular performance reporting including dashboards, trend analysis, and improvement roadmaps . What We're Looking For Essential Experience Proven experience managing customer-facing technology in operational environments , such as: Ticketing/ ...

Quality Assurance Specialist

Hiring Organisation
Seven Search & Selection
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
high-quality, functional ingredients that help brands meet evolving consumer needs. The role: Lead investigations into customer, supplier, and internal quality non-conformances, conducting root cause analysis and implementing effective corrective and preventative actions. Support operational compliance by preparing for BRC, customer, and third-party audits while … experience in Quality Assurance within food manufacturing, ingredient distribution, or a related FMCG environment. Strong working knowledge of BRCGS, HACCP, GMP, CAPA management, and root cause analysis techniques including FMEA, 5 Whys, and Fishbone. Proven experience conducting supplier audits, managing customer complaints, investigating non-conformances, and implementing ...

Repairs and Test Engineer

Hiring Organisation
Insignis
Location
Greenwich, London, United Kingdom
Employment Type
Temporary
Salary
GBP 40 - 45 Hourly
Role To support manufacturing and customer service operations by: Repairing production failures and customer-returned power transmission equipment Performing component-level diagnostics and root cause analysis Supporting manufacturing test systems and engineering improvements Ensuring product quality and reliability standards are consistently maintained Key Responsibilities Repair and fault … find customer-returned and production-failed power transmission equipment to component level. Diagnose faults using schematics, datasheets, and technical documentation. Conduct root cause analysis on product and field failures. Use test equipment (oscilloscopes, multimeters, etc.) to verify faults and repairs. Perform PCB-level soldering and rework ...

Supply Chain Development Data Analyst UK&I

Hiring Organisation
Ferrero
Location
Greenford, London, United Kingdom
Employment Type
Permanent
analysed to provide meaningful insight into business performance. A key aspect of the role will be helping teams use data more effectively through trend analysis, structured problem-solving and root cause analysis, identifying risks, opportunities and performance improvement initiatives that support business objectives. You will develop … Working closely with Supply Chain leaders and functional teams, you will support performance review meetings, strategic projects and business improvement initiatives by providing robust analysis, scenario modelling and decision support. You will help measure the effectiveness of improvement programmes, track benefits realisation and ensure performance outcomes are clearly understood ...

Site Reliability Engineer

Hiring Organisation
CGI
Location
Greater London, United Kingdom
Employment Type
Full Time
across multiple environments. - Implement and enhance monitoring, alerting, logging, and observability solutions to improve platform reliability and operational visibility. - Investigate incidents, analyse logs, identify root causes, and drive timely resolution of production issues. - Participate in incident response, post-incident reviews, and continuous operational improvement initiatives. - Automate operational tasks … support functions. - Strong hands-on experience with the ELK stack (Elasticsearch, Logstash, Kibana) for logging, monitoring, troubleshooting, and operational analysis. - Demonstrated capability in log analysis, incident investigation, troubleshooting, and root cause analysis. - Strong understanding and practical experience with core SRE practices including: Monitoring and alerting Incident management ...

Infrastructure Server Engineer

Hiring Organisation
Oscar Technology
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£575 - £600 per day
with expertise across servers, networking, automation, and cloud technologies. We're looking for someone who is analytical, methodical, and naturally focused on identifying the root cause of issues rather than simply fixing symptoms. Key Responsibilities Design, implement, and support infrastructure solutions across Windows, Linux, networking, and IoT environments … Deliver Proof of Concepts (PoCs), technology evaluations, and infrastructure enhancements Troubleshoot complex issues and perform detailed root cause analysis Work closely with Network, Security, Risk, and Engineering teams on deployments and integrations Support cloud-connected services, APIs, SaaS platforms, and authentication technologies Manage infrastructure upgrades, migrations ...

Vice President, Risk and Control - Digital Engineering

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
mitigation efforts, ensuring understanding of strategic goals* Ensure departments adherence to internal policies and external regulatory requirement* Manage complex risk related loss events, conducting root cause analysis, working with Product and Platform owners to develop response plans* Perform applicable operational control checks across Infrastructure and engage with … Essential:* Extensive experience leading and managing risk and control and teams across multiple regions within a within a regulated environment.* Extensive proficiency in scenario analysis and developing mitigation strategies* Experience representing risk and control on behalf of a large Technology department to an Executive level audience* A strong track ...

Software and Systems Engineer

Hiring Organisation
17918
Location
London, United Kingdom
when combined deliver the features and systems of the vehicle under development. This particular role within CSI will focus on issues resolution, specifically understanding root cause and assisting with resolution of systems and software issues within the vehicle. They will engage directly with complex issues to aid with … issues are understood and prioritised correctly. Essential Skills, Knowledge and Experience Required: Strong knowledge of issue/defect management processes and tools. Proficiency in root cause analysis techniques and problem-solving methodologies. Ability to develop, interpret, and present metrics, KPIs and performance dashboards Solid understanding of software ...

Operational Risk - Director

Hiring Organisation
Mizuho
Location
Greater London, United Kingdom
Employment Type
Full Time
critical business services, impact tolerances, scenario testing and business continuity arrangements. Risk Framework Oversight: Lead oversight of RCSA, KRIs, risk events and scenario analysis; challenge control effectiveness, root cause analysis and remediation actions. Risk Systems: Oversee the integrity, governance and effective use of the firm … regional and Head Office systems (including dual keying where applicable) Oversight of system design, enhancements and user adoption Ensuring risk data supports meaningful reporting, analysis and decision-making Data Risk: Provide independent second line oversight and challenge of data governance frameworks, ensuring alignment with regulatory expectations (e.g. BCBS ...

SC Cleared DevOps Engineer

Hiring Organisation
ed Resourcing Ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP Annual
Manage identity and access services, including Active Directory, Kerberos authentication, service accounts, and NAS security configurations. Implement and enhance monitoring and alerting solutions, perform root cause analysis, and contribute to improving platform resilience. Work within Agile delivery environments using Jira for backlog management, sprint planning, and defect ...

Data Platform Engineer

Hiring Organisation
17918
Location
London, United Kingdom
architectures. Technical guidance and collaboration with engineering teams to improve database design, data access patterns and application performance. Continuous improvement activities, including incident response, root cause analysis and platform automation. About You and What You'll Bring You're an engineer who enjoys ownership, values continuous improvement ...

Data Engineer (DV Clearable) - Up to £140,000 + Benefits

Hiring Organisation
Sanderson Government and Defence
Location
London, United Kingdom
Employment Type
Permanent
modelling to support analytics and operational use cases Enhance and evolve data platform capabilities to meet new business and analytical needs Conduct detailed data analysis to identify improvements and solve processing challenges Ensure consistency and integrity of data models and architecture Carry out root cause analysis ...

Senior Vice President, Product Manager — Technical & AI-Enabled - Investor Services

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
operate at both strategic and tactical levels.* Excellent verbal, written, and interpersonal communication skills.* Strong analytical and problem-solving abilities, with a focus on root cause analysis and proactive issue resolution.* Bachelor's/University degree required; Master's degree preferred.**Nice-to-Haves*** Hands-on proficiency ...

PowerShell Automation Engineer (Contractor)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
efficiency. Implement PowerShell best practices, including modular design, parameter validation, error handling, logging, and secure credential management. Perform code reviews and leverage static code analysis tools to ensure high-quality, maintainable solutions. Manage source control repositories and support CI/CD practices where applicable. Solution Design & Documentation Gather … components. Troubleshoot and resolve complex technical issues impacting automation solutions and operational processes. Ensure solutions remain reliable, scalable, secure, and performant. Support incident investigation, root-cause analysis, and continuous improvement activities. Reporting & Analytics Develop and maintain operational and security reporting solutions. Work with SQL data sources ...

Application Support Engineer

Hiring Organisation
Euro Car Parks
Location
Central London, London, United Kingdom
Employment Type
Permanent
types and automations and contributing to its continual improvement Creating, amending and removing user access and permissions across the applications we support Investigating and root-causing complex, cross-system issues spanning our Azure services, databases, messaging layer, reporting platforms and third-party integrations Monitoring production health and acting proactively … remove waste Assisting the Application Support Manager with day-to-day support activities and suggesting refinements to policies and procedures as well as preparing root cause analysis reports on major issues for relevant stakeholders within the business. Analysing trends across incidents and alerts to reduce avoidable downtime ...

Site Reliability Engineer

Hiring Organisation
Huxley Associates
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£90000/annum + Bonus & Benefits Package
Experience with Infrastructure as Code and GitOps methodologies Hands-on knowledge of observability/APM tools (e.g. Grafana, Datadog, Dynatrace) Proven experience managing incidents, root cause analysis, and on-call support Understanding of SLA/SLO/SLI frameworks and reliability engineering principles Desirable Background in software ...

Senior AWS DevOps Engineer

Hiring Organisation
Data Careers
Location
South East London, London, United Kingdom
Employment Type
Permanent, Work From Home
application deployment and operational processes Support containerised workloads using Kubernetes and related technologies Improve platform monitoring, observability, alerting and operational dashboards Support incident management, root cause analysis and problem management Work with security teams to address vulnerabilities, compliance requirements and secure-by-design principles Contribute to cloud ...

Senior/Lead Market Data Support Specialist

Hiring Organisation
IT Search & Select
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £105,000 per annum
infrastructure teams to ensure seamless delivery of real-time and reference data. You’ll act as the escalation point for major incidents, lead root-cause analysis, and drive continuous improvement across client environments. You’ll also play a key role in onboarding new clients, shaping best practices ...

Cloud Operations Owner

Hiring Organisation
DS SMITH PACKAGING LIMITED
Location
City of London, London, United Kingdom
Employment Type
Permanent
with partners and internal teams Leading service reviews and maintaining robust operational reporting Governing change, release, and incident management processes to minimise disruption Conducting root cause analysis and implementing preventative improvements Overseeing AWS infrastructure operations across compute, storage, networking, and security Ensuring compliance with security, audit ...

Global IT Director - Principal Security Engineer

Hiring Organisation
Recruitics
Location
London, UK
cloud platforms, including federation (SAML, OIDC, OAuth), MFA, and Passwordless capabilities. Serve as the primary escalation point for complex IAM engineering issues; perform rootcause analysis and drive long‐term remediation and hardening of IAM platforms and related services. Partner with security architecture, infrastructure, application ...

Senior Site Reliability Engineering Manager

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
obligations are met. Incident Response & Escalation Management: Serve as senior escalation point for production incidents across European and GTH market hours; coordinate incident triage, rootcause analysis and resolution; provide timely communication to stakeholders; drive post‐incident reviews and remediation tracking. Subject Matter Expertise & Stakeholder Engagement: Provide … expertise on all aspects of platforms; advise teams on moderately complex matters; liaise with vendors and facilities operators supporting critical financial infrastructure. Reporting & Data Analysis: Create and improve reports related to Operations management; analyze technical data sets to troubleshoot or explain perceived issues; use SQL, UNIX shell and other ...

Applications Operations Analyst

Hiring Organisation
Ryder Reid Legal Ltd
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
access reviews) Monitor system performance and proactively address issues Respond to incidents and service requests, ensuring SLA adherence Troubleshoot and resolve application issues, conducting root cause analysis Support system configurations, integrations, and deployments Partner with Information Security to remediate vulnerabilities and support audits Contribute to post-incident ...