1 to 25 of 96 Root Cause Analysis Jobs in London

RCA Analyst

Hiring Organisation
Hays
Location
New Malden, England, United Kingdom
Analyst Reporting to : Incident, Problem, MI & RCA Manager Location: UK, Germany Role Objectives: Key Results/Areas : The Root Cause Analysis (RCA) Analyst supports the Incident, Problem, Major Incident & RCA Manager by overseeing the process of root cause investigations, data analysis, and documentation … focuses on identifying underlying causes of incidents and problems, enabling sustainable corrective and preventive actions that improve service stability and operational resilience. Core Responsibilities: Root Cause Analysis Oversee the process of in-depth Root Cause Analyses for incidents, major incidents, and recurring issues. In conjunction ...

EMEA Moderation Quality Assurance Specialist

Hiring Organisation
TikTok Shop
Location
City of London, London, United Kingdom
quality of BPO sites, provide indepth RCA for critical issues as well as the implementation of effective action plans . Roles & Responsibilities Content Review & Analysis • Conduct comprehensive review and analysis of sellers, products, content and Intellectual Property Rights (IPR) to ensure compliance with platform policies • Perform daily quality … assessments of moderation decisions across multiple content categories • Identify patterns and trends in content violations through systematic data analysis • Review and evaluate the accuracy of content moderation decisions made by BPO teams or machine moderation Quality Assurance & Process Improvement • Help build quality assessment processes and conduct quality assessment work ...

Front Office Production Engineer - SRE, Linux, Oracle, Root Cause Analysis, Incident Management

Hiring Organisation
Morson Edge
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£85,000
keep the production environment both highly and stable and available for daily trading activity. Primary responsibilities will be split across incident management & root cause analysis, working with development teams to resolve issues, whilst facing off to Front Office users to handle queries, provide progress reports and generally … Observability tooling Python and Shell Scripting skills for automation purposes Capability to work across end-to-end Production Support covering initial incident response, root cause, gap analysis, bug fixing through to full recovery and resolution Understanding of the trade life cycle from pre-to-post trade ...

数据科学家

Hiring Organisation
JD.COM
Location
London Area, United Kingdom
Responsibilities 1. Operational data analysis for international shared services: Understand the operational processes of shared services, comprehensively monitor key performance indicators, promptly detect data fluctuations and conduct in-depth root cause analysis, swiftly identify business anomalies and risk points, and provide solutions; 2. Aggregate international business … product lines across international business units, master current data storage methods and logical frameworks, consolidate international business data, promptly detect fluctuations and conduct root cause analysis; 3. Analyse international cash flow statements: Collaborate with Treasury and Accounting teams to understand JD's capital operations and data sources ...

Problem Management Analyst

Hiring Organisation
Hays Specialist Recruitment Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 - £58,000 per annum
mobile number and email address are available on my LinkedIn profile. For this role, you must be able to demonstrate: Hands-on ownership of Root Cause Analysis for major incidents (P1/P2) Experience applying structured RCA methodologies (5 Whys, Fishbone, Fault Tree, Kepner-Tregoe) Ability … process, and organisational causes of failure Experience working within Incident, Major Incident, and Problem Management functions Confidence to challenge engineering teams and vendors on root causes and corrective actions Experience producing high-quality RCA reports (timeline, impact, contributing factors, actions)This role is not suitable for candidates whose experience ...

Problem Management Analyst

Hiring Organisation
Hays
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£55,000
mobile number and email address are available on my LinkedIn profile. For this role, you must be able to demonstrate: Hands-on ownership of Root Cause Analysis for major incidents (P1/P2) Experience applying structured RCA methodologies (5 Whys, Fishbone, Fault Tree, Kepner-Tregoe) Ability … process, and organisational causes of failure Experience working within Incident, Major Incident, and Problem Management functions Confidence to challenge engineering teams and vendors on root causes and corrective actions Experience producing high-quality RCA reports (timeline, impact, contributing factors, actions) This role is not suitable for candidates whose experience ...

Data Ops Manager – Azure Data Platform

Hiring Organisation
Hunter Bond
Location
City of London, London, United Kingdom
stability of an Azure-based data platform (Synapse, Databricks, ADF, Power BI). Act as the primary escalation point for incidents, leading resolution, root cause analysis, and clear stakeholder communications. Define, implement, and enforce SLAs for pipelines, datasets, and reporting assets. Lead FinOps initiatives, working with business … operational strategy with business objectives and platform roadmap. Required Skills & Experience Proven experience managing operations for large-scale data platforms. Skilled in incident management, root cause analysis, and SLA enforcement. Hands-on experience with Azure Synapse, Databricks, ADF, and Power BI. Experience with CI/CD, automation ...

Azure Architect Linux

Hiring Organisation
Randstad Technologies Recruitment
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£500 - £550/day
Apache, Varnish for caching, and HAProxy for load balancing. Team Leadership: Manage and technically govern a team of 5+ infrastructure engineers, providing guidance on root-cause analysis and complex troubleshooting. Required Skills & Experience Experience: 12+ years in IT, with at least 5+ years specifically focused on Azure … storage like NFS/SMB). Network/Security: Deep understanding of Azure Networking, SIAM, and data security. Troubleshooting: Expert-level ability to perform root-cause analysis on processor, memory, IO, and cluster-related issues. Desirable Skills Automation: Familiarity with Configuration Management tools like Ansible or Chef ...

SRE Transformation Lead (Global Banking & Payments)

Hiring Organisation
Pontoon
Location
London, United Kingdom
Employment Type
Contract
background with the ability to drive automation and reduce manual toil through code, tooling, and process redesign. Deep knowledge of incident response, problem management, root cause analysis, and operational resilience practices in mission critical environments. Strong stakeholder management skills, able to influence technology and business partners … eliminate operational toil through automation, enhancing engineering practises and operational tooling. Incident & Problem Management : Strengthen incident response frameworks and improve production outcomes through effective root cause analysis and preventive engineering. Observability & Tooling : Establish observability standards to enhance service monitoring, partnering with teams to align SRE needs with ...

Linux Engineer / Architect

Hiring Organisation
Randstad Digital
Location
City of London, London, United Kingdom
Employment Type
Contract
Contract Rate
£500 - £550 per day
troubleshooting, security hardening, and automation of our global server estate. This is a role for a technical heavyweight who thrives on solving complex root-cause issues and enjoys the architectural challenge of ensuring 24/7 availability for high-traffic environments. Key Responsibilities Provide expert-level guidance … engineering teams and lead root-cause analysis on complex system-wide issues. Linux Engineering: Manage and optimise a large-scale estate across RedHat, Ubuntu, and CentOS, ensuring peak performance of Apache, Nginx, and Tomcat stacks. Automation & IaC: Drive efficiency through the development of custom Ansible, Puppet ...

Senior Reliability & Support Engineer (Azure)

Hiring Organisation
TrueNorth®
Location
Kingston Upon Thames, England, United Kingdom
problem. This is not a pure DevOps or cloud build role. We’re looking for someone who can investigate production issues, trace root cause, work across application and infrastructure layers, and help reduce recurring incidents through better monitoring, automation and operational insight. What you’ll be doing Investigating … Experience in third-line support, SRE, cloud operations or application support Strong Azure experience in a live SaaS/software environment Strong troubleshooting and root cause analysis skills Experience with Application Insights, Azure Monitor, Log Analytics and KQL SQL skills for investigation and remediation PowerShell and/ ...

Service Desk Manager

Hiring Organisation
Context Recruitment
Location
London, United Kingdom
Employment Type
Permanent
Salary
£65000 - £70000/annum
ticket quality * Produce regular service reporting (SLAs, backlog, ticket ageing, trends, first-time fix rates) and provide insights to leadership * Drive continuous improvement through root cause analysis, gap analysis and service optimisation initiatives * Coordinate service desk involvement in projects, rollouts, migrations and onboarding/offboarding activities ...

Graduate Analyst - Transaction Reporting (Managed Services)

Hiring Organisation
Novatus Global
Location
City of London, London, United Kingdom
Regulation). This role is suitable for recent graduates or early-career professionals looking to build experience in financial markets, regulatory reporting, and data analysis within a fast-growing RegTech environment. Working as part of a team, the Analyst will contribute to projects that help clients identify reporting gaps … SFTR. Support the preparation of regulatory documentation, client reports, and remediation plans. Assist in documenting regulatory interpretations, reporting procedures, and operating models. Data Analysis & Quality Assurance Perform data reconciliation between front office systems, middle office systems, and regulatory reporting outputs. Conduct root cause analysis on reporting ...

AWS DevOps Engineer

Hiring Organisation
Randstad Technologies Recruitment
Location
London, United Kingdom
Employment Type
Permanent
Salary
£60000 - £80000/annum
pipelines with Jenkins, GitHub Actions, or AWS Code Pipeline Linux Systems &Troubleshooting Perform administrative and troubleshooting tasks on Linux-based systems, including log analysis and performance tuning. Lead technical triage and root cause analysis for infrastructure-related issues Containerization & Orchestration Develop and deploy applications using Docker ...

Application Support Analyst

Hiring Organisation
Apex Hunt Ltd
Location
London Area, United Kingdom
resolving incidents quickly Incident & Problem Management • Monitor trading systems and proactively identify issues • Manage incidents in line with SLA/KPI targets • Perform root cause analysis and implement long-term fixes • Escalate critical issues appropriately and communicate effectively with stakeholders Application & System Support • Support a range … tasks • Improve monitoring, alerting, and documentation • Contribute to system enhancements and projects ⸻ Required Skills & Experience Technical Skills • Strong knowledge of: • SQL (query writing, data analysis) • Linux/Unix commands • Scripting (Python, Shell, or similar) • Understanding of system architecture (APIs, messaging, batch processes) • Experience with monitoring tools (e.g., Geneos, Splunk ...

Data Analyst

Hiring Organisation
Intellect Group
Location
London Area, United Kingdom
data and reporting responsibilities. What You’ll Be Doing: Owning BAU analytics and reporting: recurring KPI packs, portfolio/credit performance reporting, trend analysis, and stakeholder updates Building and maintaining clean, analysis-ready datasets from messy real-world sources (portfolio, transaction, performance, behavioural and macro/market inputs … assumptions and ensuring outputs are reproducible Developing scalable data workflows in SQL + Python (ingestion, cleaning, transformation, QA) Improving data quality: reconciliation, anomaly detection, root-cause analysis, and automated checks Building and maintaining dashboards for multiple stakeholders, ensuring data is accurate, timely, and clearly presented Collaborating with ...

Service Delivery Manager

Hiring Organisation
KPMG UK
Location
London Area, United Kingdom
relationships with customers Track performance of services and prepare reporting on SLAs & KPIs Produce regular management reports for customers and attend review meetings Provide analysis, feedback and actions based on trends, root cause analysis and other reports Manage service improvement plans, inclusive of formalized creation ...

Solace Administrator

Hiring Organisation
BGC Group
Location
London Area, United Kingdom
Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace ...

Senior Engineer

Hiring Organisation
Colt Technology Services UK
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Leadership Serve as the go-to technical expert for Windows Server, Citrix, VDI, Microsoft 365, and cloud platforms (Azure & GCP). Provide advanced troubleshooting, root cause analysis, and resolution for complex incidents. Define and enforce best practices for infrastructure design, security, and governance. Drive automation and Infrastructure ...

Applications Support Engineer

Hiring Organisation
FBI &TMT
Location
Harlow, London, United Kingdom
Employment Type
Permanent
support activities post-go-live. Monitoring, Incident Management & Continuous Improvement: Implement and maintain monitoring solutions to ensure system availability and performance. Manage incident triage, root cause analysis, and problem resolution for platform-related issues. Identify opportunities to enhance stability, resilience, and operational efficiency of the platform. … Strong understanding of Service Management frameworks such as ITIL. Solid background in: Operating system management (Windows/Linux) Application troubleshooting and performance optimisation Log analysis, system monitoring, and incident management Ability to collaborate effectively with technical and non-technical stakeholders. Experience working in structured delivery environments; familiarity with Agile ...

Azure Cloud Platform Engineer

Hiring Organisation
Reed
Location
Central London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£75,000 - £80,000 per annum, Inc benefits
cloud security best practices, ensuring alignment with regulatory and organisational standards. Operations & Incident Management Collaborate with support and operations teams to resolve incidents, perform root cause analysis and deliver long-term fixes. Cross-Functional Collaboration Work closely with developers, QA, product teams and platform engineers to embed ...

Deskside Engineer

Hiring Organisation
Cerco
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£165 - £214 per day
Tier 2/3) Own and resolve complex, high-impact, and escalated support issues (L3) that cannot be resolved by Tier 1 staff. Perform Root Cause Analysis (RCA) for recurring incidents, developing and implementing permanent solutions across the organization. Provide specialized, discrete support for executive ...

Data Engineer

Hiring Organisation
JSS
Location
London Area, United Kingdom
identifying and resolving data issues and performance bottlenecks Act as a technical point of contact during financial close periods (monthly, quarterly, year-end) Perform root cause analysis on data issues and implement effective solutions Collaborate with IT and business teams to resolve data challenges and improve processes … VS2022+) Knowledge of data warehousing concepts and architectures Desirable: Experience supporting financial systems (e.g. accounting or policy systems) Azure DBA experience Knowledge of Azure Analysis Services or SSAS Experience with C#, ASP.NET MVC, or web application development Familiarity with AI tools such as GitHub Copilot Exposure ...

DevOps Engineer (SC + NPPV3 Cleared)

Hiring Organisation
Syntax Consultancy Limited
Location
Croydon, Surrey, South East, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
500/day (Outside IR35)
Java. PostgreSQL admin, performance tuning, HA, replication, backup/DR, Git, Jira, Confluence, ServiceNow. Monitoring tools: Grafana, Prometheus + Alert Manager. Key Tasks: troubleshooting, root cause analysis, 24/7 production, managing technical escalations, leading workshops, supporting Agile teams + removing delivery blockers. Advantageous Skills: Linux ...

DevOps Engineer (SC + NPPV3 Cleared)

Hiring Organisation
Syntax Consultancy Limited
Location
Croydon, Greater London, UK
Java. PostgreSQL admin, performance tuning, HA, replication, backup/DR, Git, Jira, Confluence, ServiceNow. Monitoring tools: Grafana, Prometheus + Alert Manager. Key Tasks: troubleshooting, root cause analysis, 24/7 production, managing technical escalations, leading workshops, supporting Agile teams + removing delivery blockers. xehkeey Advantageous Skills: Linux ...