Root Cause Analysis Jobs in the City of London

1 to 25 of 63 Root Cause Analysis Jobs in the City of London

Infrastructure Manager

City of London, Greater London, UK
Mentmore
including on-prem services). Ensure we have Disaster Recovery environments that are fit for purpose and regularly tested. Take ownership of major incidents related to cloud infrastructure, including root cause analysis and corrective actions. Be available outside of standard working hours if required for high-priority incidents, demonstrating a commitment to seeing critical issues through to More ❯
Posted:

Senior FX Production Support Engineer

City of London, Greater London, UK
Radley James
role supporting the bank’s electronic FX trading platforms. Key Responsibilities Incident Management: Respond rapidly to production incidents with data driven decision making, minimizing downtime and financial impact. Lead root cause analysis and conduct blameless post-mortems Monitoring & Automation: Enhance application health monitoring and implement automation to reduce manual intervention and improve system resilience System Optimization: Drive More ❯
Posted:

Solace Messaging Administrator

City of London, London, United Kingdom
BGC Group
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
Posted:

Cyber Security Engineer

City of London, Greater London, UK
Hybrid / WFH Options
Infinigate UK & Ireland
configuring, updating, and monitoring security tools and software, such as antivirus, encryption, authentication, SIEM etc. Evaluate, research and manage emerging cyber security threats. Support the incident management process, through Root Cause Analysis. Responding to and resolving security incidents and events, such as malware infections, phishing attempts, denial-of-service attacks, data breaches, etc. Liaise with stakeholders in relation … Exposure to security monitoring technologies Understanding of Incident Response, Cyber Kill Chain, ATT&CK · Knowledge & experience of common program language e.g., Python, C++, PowerShell, JavaScript Being able to perform Root Cause Analysis Experience with vulnerability assessments Ability to discover, design and document security implementations. Strong networking skills. Good understanding of securing Cloud technologies through native and multi More ❯
Employment Type: Full-time
Posted:

Senior Data Analyst - Governance

City of London, Greater London, UK
Hybrid / WFH Options
dnevo Partners
and follow-up actions. Work closely with cross-functional teams on data-related projects and continuous improvement initiatives. Identify and investigate data quality issues, contributing to the development of root cause analyses and solutions. Stay up-to-date with evolving data technologies, tools, and industry trends. Support the definition of data quality methodologies and standards across the business. More ❯
Posted:

Production Application Support Linux - Fixed Income

City of London, London, England, United Kingdom
Hybrid / WFH Options
Client Server Ltd
systems. You'll manage production systems, acting as an escalation point for operational teams on a 24x7 basis, taking ownership of Major Incident Management, incident response, problem management and root cause analysis. Employing a data-driven approach you will drive production stability via metrics and reporting, partnering closely with Application Development teams and external vendors, ensuring accurate logging More ❯
Employment Type: Full-Time
Salary: £80,000 - £95,000 per annum
Posted:

Linux Sys Admin Manager

City of London, London, United Kingdom
Hybrid / WFH Options
REC SOLUTIONS LIMITED
with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, root cause analysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … understanding of cybersecurity principles and experience implementing security measures in a regulated environment. Ability to coach, mentor, and upskill staff; develop career paths and ensure team resilience. Experience undertaking root cause analysis including prevention orientated solution reporting. Working experience with deployment tools (e.g. GitLab pipelines) and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such More ❯
Employment Type: Permanent, Work From Home
Posted:

Lead Systems Administrator - Linux

City of London, London, United Kingdom
Hybrid / WFH Options
REC SOLUTIONS LIMITED
with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, root cause analysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … understanding of cybersecurity principles and experience implementing security measures in a regulated environment. Ability to coach, mentor, and upskill staff; develop career paths and ensure team resilience. Experience undertaking root cause analysis including prevention orientated solution reporting. Working experience with deployment tools (e.g. GitLab pipelines) and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such More ❯
Employment Type: Permanent, Work From Home
Posted:

SAP QA Testing & Business Analyst

City of London, London, United Kingdom
HCLTech
integration applications. Perform functional, integration, regression, and user acceptance testing. Validate system changes through servicenow Change Requests and ensure updates align with CMDB standards. Log and track defects, perform root cause analysis, and work closely with development teams for resolution. Ensure QA processes align with ITIL framework and banking governance standards. 2Business Analysis: Gather, document, and … business needs into clear specifications, user stories, and process flows. Collaborate with project managers, developers, and QA teams to ensure delivery aligns with regulatory and operational expectations. Support gap analysis, impact assessments, and end-to-end process mapping for SAP-servicenow related changes. Ensure traceability of requirements through testing and implementation. IMPLEMENTATION ARRANGEMENTS The Quality Assurance (QA) Analyst will More ❯
Posted:

Information Security Analyst

City of London, Greater London, UK
NorthMark Strategies
AWS Responsibilities: Monitor security event logs and alerts generated by various security technologies, including SIEM, IDS/IPS, firewalls, and endpoint protection systems. Conduct host forensics, network forensics, log analysis, and malware triage in support of incident response investigations. Identify, analyze, and assess potential insider threats through behavioral analytics, log review, and threat intelligence. Maintain and improve SOC processes … and refine insider risk policies to ensure they are effective and up to date. Develop and implement automated processes for monitoring and enforcing insider risk policies. Participation in security root cause analysis and forensics as part of NorthMark Strategies’ Cyber Incident Response Plan. Develop comprehensive and accurate reports and presentations for both technical and executive audiences. Stay More ❯
Posted:

Platform Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Tate Recruitment
storage, backups, and Linux systems using tools such as Ansible, Terraform, and GitHub. Collaborate with cross-functional teams to align infrastructure delivery with DevOps best practices. Lead incident response, root cause analysis, and ongoing support for critical infrastructure services. Define and implement infrastructure administration standards and procedures. Champion Infrastructure as Code and continuous improvement across the hosting More ❯
Posted:

Senior Network Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Laser Digital
Monitoring & Observability Build and manage comprehensive monitoring and logging systems for network performance, latency, and availability. Implement observability frameworks using modern tools to provide real-time insight and support root-cause analysis. Collaboration & Project Leadership Act as a key stakeholder in cross-functional teams, working with Infrastructure, Security, DevOps, and Application teams to deliver secure and high-performance More ❯
Posted:

Exec/VIP IT Support Engineer

City of London, London, England, United Kingdom
Human Capital Ventures
and peripheral equipment for executives. Mobile device support and advanced troubleshooting skills (Apple & Android technologies). Proactively identify potential technical issues and implement preventive solutions and advanced troubleshooting and root cause analysis. Liaising with and delegating tasks to relevant teams for escalation. Supporting the Exec Support Specialist and escalating support issues to the Head of IT where necessary. More ❯
Employment Type: Full-Time
Salary: £34,000 - £40,000 per annum, Inc benefits
Posted:

Executive IT Support Engineer

City of London, London, England, United Kingdom
Human Capital Ventures
and peripheral equipment for executives. Mobile device support and advanced troubleshooting skills (Apple & Android technologies). Proactively identify potential technical issues and implement preventive solutions and advanced troubleshooting and root cause analysis. Liaising with and delegating tasks to relevant teams for escalation. Supporting the Exec Support Specialist and escalating support issues to the Head of IT where necessary. More ❯
Employment Type: Full-Time
Salary: £32,000 - £36,000 per annum, Negotiable
Posted:

AWS Head of Site Reliability Engineering (Must hold current SC)

City of London, Greater London, UK
Amber Labs
SRE principles, such as Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets, to drive the team's focus on reliability. Incident Management: Lead incident response efforts, root cause analysis (RCA), and post-incident reviews to improve system reliability. Ensure rapid response to production issues and minimize downtime. Performance Optimization: Drive initiatives for performance tuning … architecture and design. SRE Best Practices: Deep understanding of SRE principles and frameworks, including SLOs, SLIs, and Error Budgets. Incident Management: Proven experience in incident management, including response, recovery, root cause analysis, and post-mortem reporting. Automation Tools: Proficient in automation tools like Terraform, CloudFormation, Jenkins, and other CI/CD tools. Preferred Qualifications: Certifications: AWS Certified More ❯
Posted:

Senior Site Reliability Engineer

City of London, London, United Kingdom
TRIA
a hands-on leadership role - you won’t just guide others, you’ll be the go-to expert when systems are under pressure. You'll lead incident response, own root cause analysis, and solve performance issues like memory leaks, outages, and flaky services. Your focus will include : Leading incident management, post-mortems, and blameless RCAs Building scalable More ❯
Posted:

Site Reliability Engineer

City of London, Greater London, UK
Hybrid / WFH Options
Explore Group
and scale Kubernetes clusters hosting critical microservices Design and enhance observability, alerting, and incident response processes Collaborate closely with engineers to ensure systems are reliable, secure, and performant Lead root cause analysis for production incidents and help prevent recurrence Build tooling to automate repetitive tasks and improve deployment pipelines (CI/CD) Participate in on-call rotation More ❯
Posted:

Solace Messaging Administrator

City of London, London, United Kingdom
H&P Executive Search
have experience with tools such as Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
Posted:

Messaging Administrator - Solace

City of London, Greater London, UK
Marlin Selection Recruitment
prem environments. What You’ll Be Doing: Managing and supporting Solace PubSub+ appliances and software brokers across cloud and on-prem platforms Responding to production incidents and working on root cause analysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message More ❯
Posted:

Data Analyst

City of London, Greater London, UK
Hybrid / WFH Options
idpp
for implementing and maintaining PySpark data tables aligned with established data models and best practices. This includes translating business requirements into production-level code, particularly focused on payment cost analysis, and ensuring data quality through systematic validation. You’ll work closely with senior analysts and stakeholders to refine data pipelines, improve infrastructure, and document technical processes to ensure team … and impact. Responsibilities Working independently to collect, prepare, and write production-ready PySpark code Translating business questions into structured data problems and insights Utilizing big data platforms to run root cause analyses and data reconciliations Maintaining reports, metrics, and data workflows within your scope Communicating findings clearly to stakeholders with varying technical expertise Participating actively in Agile team More ❯
Posted:

Technical Analyst

City of London, Greater London, UK
Hybrid / WFH Options
Halian
endpoints are properly configured and updated. 2nd Line Support: Respond to and resolve escalated 2nd line support tickets, ensuring timely resolution of technical issues. Provide expert-level troubleshooting and root cause analysis for more complex issues. Work closely with end-users, understanding their requirements and delivering technical solutions. Escalate issues to senior engineers as needed while keeping More ❯
Posted:

Digital Operations Manager, IT Manager, IT Operations Manager

City of London, Greater London, UK
Experis UK
operational performance, and security compliance. Facilitate effective communication between IT teams and business units. Problem Solving and Incident Management: Manage and resolve high-priority incidents and critical issues. Conduct root cause analysis and implement corrective actions to prevent recurrence. Develop and maintain incident response plans and procedures. Requirements: Proven experience as a Digital Operations Manager, IT Manager More ❯
Posted:

Project Manager

City of London, London, United Kingdom
Akkodis
maintain project plans, schedules, and budgets. Manage & control the project costs & financial performance, including approval of Timesheet. Facilitate stakeholder meetings to align project goals and address concerns proactively. Conduct root cause analysis for issues and propose corrective actions. Oversee project scope, risks, and changes, ensuring alignment with project objectives. Prepare and present status reports to clients and … feedback constructively. Required Skills Strong leadership and problem-solving abilities. Excellent communication and interpersonal skills. Proficiency in project management tools and techniques. Ability to work independently with some oversight. Root cause analysis and continuous improvement mindset. Preferred Skills A recognised project management certification, such as CAPM, Prince2, or APM, is preferred. Demonstrable understanding of both Agile and More ❯
Posted:

Fraud Data Analyst

City of London, Greater London, UK
Consortia
Fraud Data Analyst Fraud Data Analyst We are seeking an experienced Fraud Data Analyst to support the detection, analysis, and prevention of fraud across card and wire payment channels. This role plays a vital part in a dynamic compliance and risk function, using advanced data techniques to help protect customers and the integrity of payment systems. You will work … detect patterns associated with fraudulent behaviour Respond to real-time alerts and proactively identify potential fraud threats Develop and refine fraud rules and detection models to improve efficiency Conduct root-cause analysis of false positives to fine-tune detection strategies Ensure all fraud-related processes remain compliant with current financial regulations Data Analysis and Reporting Extract … similar role within payments, fintech, or banking Expertise in fraud typologies including phishing, CNP fraud, and payment fraud Strong skills in: SQL – complex queries and data extraction Python – for analysis, automation, and model development Data visualisation tools such as Power BI, Metabase, or Tableau Statistical methods, including regression and hypothesis testing Basic understanding of machine learning as applied to More ❯
Posted:

Infrastructure Engineer

City of London, Greater London, UK
Franklin Fitch
support Deploy patches and updates across hardware, software, and network environments Support system architecture, integrations, and high availability infrastructures Maintain compliance with ISO 27001 and Cyber Essentials Plus Conduct root cause analysis (RCA) and document major incidents/problems Skills & Experience: Essential: Proficiency with Windows 11, macOS, Linux, and Office 365 Experience with Microsoft Azure, and Microsoft More ❯
Posted:
Root Cause Analysis
the City of London
10th Percentile
£62,500
25th Percentile
£65,000
Median
£71,250
75th Percentile
£83,125
90th Percentile
£98,750