1 to 25 of 87 Root Cause Analysis Jobs in London

Lead DevOps Engineer

Hiring Organisation
Data Careers
Location
South East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£75,000
Implement progressive delivery practices Reliability & Observability Define and track SLIs/SLOs Enhance monitoring, alerting and incident response processes Lead post-incident reviews and root cause analysis Drive reduction of operational toil Security & Compliance Embed DevSecOps controls into pipelines Implement least-privilege IAM models Support … tooling experience (GitHub Actions, GitLab CI, Jenkins) Experience operating production SaaS environments Strong observability tooling knowledge (Datadog, Prometheus, ELK etc.) Incident management and root cause analysis experience Experience in regulated or security-conscious environments is highly desirable ...

Azure Architect Linux

Hiring Organisation
Randstad Digital
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£500 - £550 per day
Apache, Varnish for caching, and HAProxy for load balancing. Team Leadership: Manage and technically govern a team of 5+ infrastructure engineers, providing guidance on root-cause analysis and complex troubleshooting. Required Skills & Experience Experience: 12+ years in IT, with at least 5+ years specifically focused on Azure … storage like NFS/SMB). Network/Security: Deep understanding of Azure Networking, SIAM, and data security. Troubleshooting: Expert-level ability to perform root-cause analysis on processor, memory, IO, and cluster-related issues. Desirable Skills Automation: Familiarity with Configuration Management tools like Ansible or Chef ...

SRE Transformation Lead (Global Banking & Payments)

Hiring Organisation
Pontoon
Location
London, United Kingdom
Employment Type
Contract
background with the ability to drive automation and reduce manual toil through code, tooling, and process redesign. Deep knowledge of incident response, problem management, root cause analysis, and operational resilience practices in mission critical environments. Strong stakeholder management skills, able to influence technology and business partners … eliminate operational toil through automation, enhancing engineering practises and operational tooling. Incident & Problem Management : Strengthen incident response frameworks and improve production outcomes through effective root cause analysis and preventive engineering. Observability & Tooling : Establish observability standards to enhance service monitoring, partnering with teams to align SRE needs with ...

Senior Developer (Oracle ERP)

Hiring Organisation
Tec Partners
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£650 - £700/day
timely manner. Monitor progress against delivery plans, conducting regular reviews to ensure alignment with service and project objectives. Investigate incidents and perform root cause analysis to identify underlying issues and implement long-term solutions. Manage and resolve L2 and L3 support incidents, working closely with internal teams … Time & Labour Experience managing or coordinating workloads within a technical support or development team. Strong experience working with incident management, problem management and root cause analysis in live service environments. Ability to work on L2 and L3 support incidents, coordinating with internal teams and third-party vendors ...

Senior Reliability & Support Engineer (Azure)

Hiring Organisation
TrueNorth®
Location
Kingston Upon Thames, England, United Kingdom
problem. This is not a pure DevOps or cloud build role. We’re looking for someone who can investigate production issues, trace root cause, work across application and infrastructure layers, and help reduce recurring incidents through better monitoring, automation and operational insight. What you’ll be doing Investigating … Experience in third-line support, SRE, cloud operations or application support Strong Azure experience in a live SaaS/software environment Strong troubleshooting and root cause analysis skills Experience with Application Insights, Azure Monitor, Log Analytics and KQL SQL skills for investigation and remediation PowerShell and/ ...

3rd Line / IT Infrastructure Engineer

Hiring Organisation
SER (Staffing) Ltd
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 - £55,000 per annum
fast-growing MSP. Key Responsibilities Act as a 3rd line escalation point for complex infrastructure incidentsTroubleshoot and resolve business-critical technical issues, including root cause analysisSupport and contribute to major incident management and post-incident reviewsDeliver proactive infrastructure improvements, including monitoring, patching, and optimisationProvide technical oversight for changes … Required 2–3+ years in a 2nd/3rd Line or Infrastructure roleExperience working within an MSP or multi-client IT environmentStrong troubleshooting and root cause analysis skillsComfortable supporting complex infrastructure environmentsExperience across Microsoft cloud and on-prem infrastructure Desirable Skills Microsoft certifications (e.g. Azure Administrator ...

Finance Systems Analyst

Hiring Organisation
Ambition Europe Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
Intapp Time Emburse Expense and Invoice OneStream (budgeting and GL reporting) Paperless Billing (Nth Degree) Edicom e-Invoicing eBillingHub BI and reporting solutions (Analysis Services, SSRS, vendor dashboards) Key Responsibilities Systems Analysis & Support Analyse finance systems to identify gaps, inefficiencies, and improvement opportunities Translate finance and business requirements … into functional system specifications Provide day-to-day system support, troubleshooting, and root-cause analysis Ensure data integrity, accuracy, and consistency across systems System Upgrades & Enhancements Support system upgrades, patches, and new releases Coordinate and support testing activities (unit, integration, UAT) Assess the impact of changes ...

AWS DevOps Engineer

Hiring Organisation
Randstad Technologies Recruitment
Location
London, United Kingdom
Employment Type
Permanent
Salary
£60000 - £80000/annum
pipelines with Jenkins, GitHub Actions, or AWS Code Pipeline Linux Systems &Troubleshooting Perform administrative and troubleshooting tasks on Linux-based systems, including log analysis and performance tuning. Lead technical triage and root cause analysis for infrastructure-related issues Containerization & Orchestration Develop and deploy applications using Docker ...

Senior DevOps Engineer (Product)

Hiring Organisation
Hive Science
Location
City of London, London, United Kingdom
system reliability and rapid incident response. • Establish SLOs/SLIs and implement observability best practices to maintain high availability and performance. • Lead incident response, root cause analysis, and implement preventive measures to improve system resilience. Security & Governance: • Implement and maintain security best practices including network security, firewalls ...

Solace Administrator

Hiring Organisation
BGC Group
Location
London Area, United Kingdom
Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace ...

Azure Cloud Platform Engineer

Hiring Organisation
Reed
Location
Central London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£75,000 - £80,000 per annum, Inc benefits
cloud security best practices, ensuring alignment with regulatory and organisational standards. Operations & Incident Management Collaborate with support and operations teams to resolve incidents, perform root cause analysis and deliver long-term fixes. Cross-Functional Collaboration Work closely with developers, QA, product teams and platform engineers to embed ...

Senior ODI ETL / OBIEE BI Developer

Hiring Organisation
Proactive Appointments
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £100,000 per annum
multiple banking and finance systems Ensure data quality, consistency and reconciliation across platforms Optimise ETL performance for large-scale financial data volumes Data Management & Analysis Develop complex SQL and PL/SQL for data transformation and analysis Manage data structures to support efficient storage and retrieval Troubleshoot data … related issues and support root cause analysis Business Intelligence & Reporting Develop and maintain BI solutions using Oracle Analytics Server (OAS/OBIEE) Create dashboards, reports and extracts for business and regulatory use Analyse financial and banking data to deliver actionable insights Banking & Finance Domain Support Apply strong ...

IT Manager, Operations Manager, Digital Support Manager

Hiring Organisation
Experis
Location
London, Latchmere, United Kingdom
Employment Type
Permanent
Salary
£70000 - £75000/annum Benefits
Facilitate effective communication between IT teams and business units. Problem Solving and Incident Management: Manage and resolve high-priority incidents and critical issues. Conduct root cause analysis and implement corrective actions to prevent recurrence. Develop and maintain incident response plans and procedures. Requirements: Proven experience ...

Core Engineer

Hiring Organisation
First Point Group
Location
City of London, London, United Kingdom
expectations, contractual milestones and service level agreements. Responsibilities You will act as a technical expert in IMS , applying a strong analytical approach to troubleshooting, root cause analysis and end-to-end testing. Provide solution expertise and guidance for IMS Client and IMS Application test strategies and test … engineers. Execute client porting and verification testing , ensuring R&D inputs are captured to support development milestones. Produce test execution reports, performance logs and analysis documentation . Manage the lifecycle of trouble tickets , ensuring detailed information is captured for R&D investigation and resolution. Provide technical support for sandbox ...

Cloud Platform Engineer

Hiring Organisation
Understanding Recruitment
Location
North London, London, England, United Kingdom
Employment Type
Full-Time
Salary
£70,000 - £75,000 per annum
pipelines for infrastructure and applications * Monitoring systems, troubleshooting issues, and improving reliability * Implementing security, compliance, and governance best practices * Supporting incident management and root cause analysis * Collaborating with engineering teams to embed DevOps practices The Cloud Platform Engineer will have: * Strong experience with Azure cloud services * Proven ...

Infrastructure operations engineer

Hiring Organisation
Asset Inventories
Location
London Area, United Kingdom
disaster recovery processes Monitoring and Incident Management Experience with Datadog, BigPanda, xMatters, Grafana, and Cribl Respond to alerts, perform incident troubleshooting, and conduct root cause analysis Recommend operational improvements based on incident trends Tools and Process Use Jira for incident, change, and request tracking Maintain documentation, runbooks ...

Systems Consultant /3rd Line Infrastructure Engineer - Hybrid

Hiring Organisation
vertex-it-solutions
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £55,000 per annum
Azure, Windows Server, Active Directory/Entra ID, Group Policy. Experience with virtualisation (VMware/Hyper-V), networking, firewalls, storage, and automation. Strong troubleshooting, root cause analysis, and proactive improvement skills. Cybersecurity knowledge, backup, and disaster recovery experience. Desirable: Linux, AWS/Google Cloud, VoIP, MSP tooling ...

HPC Systems Administrator

Hiring Organisation
Accenture
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Python, Bash) to streamline operational tasks, monitoring, and reporting. •Document architecture, configurations, processes, and resolutions for compliance, knowledge transfer, and continuous improvement. Participate in root cause analysis (RCA) and post-incident reviews for compute or HPC-related incidents, implementing preventive measures as needed. Required Skills: •Expertise ...

Platform Engineer - 12-Month FTC

Hiring Organisation
Robert Walters
Location
London, South East, England, United Kingdom
Employment Type
Temporary
Salary
£85,000 - £95,000 per annum
including installation, configuration, maintenance, troubleshooting, optimisation, and user support.* Extensive experience operating across Windows and Linux environments within enterprise settings.* Proven proficiency in conducting root cause analysis for complex IT incidents utilising structured problem-solving methodologies.* Familiarity with regulatory standards such as DORA or GDPR pertinent ...

Workflow Developer

Hiring Organisation
Shaw Daniels Solutions
Location
London Area, United Kingdom
firm’s change control and release processes. Maintain accurate, audit‐ready documentation for all customisations, integrations, and data flows. Participate in problem reviews and rootcause analysis where workflows or data issues are involved. Ensure solutions are secure, resilient, and compliant with legal‐sector data handling standards ...

PingFederate Engineer

Hiring Organisation
Square One Resources
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
GBP 480 Daily
load-balanced deployments suitable for regulated production environments. Manage SSL/TLS certificates, key rotation, trust stores, and federation metadata. Provide 3rd-line support, root-cause analysis, and incident resolution for authentication and federation issues. Support change, release, and incident processes aligned to ITIL practices. Ensure solutions ...

GCP/AWS Platform Engineer - start-up experience

Hiring Organisation
Onsera Health
Location
Greater London, England, United Kingdom
Challenge Cardiometabolic conditions will impact 80-90% of people throughout their lifetimes, representing the leading cause of death globally and an important risk factor for a variety of neurodegenerative diseases and cancer. Sustainably addressing these diseases requires breakthrough therapeutics, AI/ML innovation, and transformative business models that translate … infrastructure – Deploy and manage generative AI toolkits, model serving endpoints and data governance for AI workloads Drive reliability and operations – Support incident response and root-cause analysis; improve observability through logging, metrics, and tracing; contribute to on-call processes proportionate to company stage Continuously improve contributor experience ...

Infrastructure Engineer (Virtualisation)

Hiring Organisation
KBC Technologies UK LTD
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
Salary negotiable
capacity planning, and hardware refreshes. Ensure high availability, resilience, and disaster recovery readiness. Automate infrastructure provisioning and standardise virtualisation builds. Troubleshoot complex incidents, perform root cause analysis, and implement preventive actions. Maintain compliance with security standards and regulatory requirements. Collaborate with cross-functional teams across storage, networking ...

IT Applications Analyst

Hiring Organisation
Larbey Evans
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 per annum
health checks and proactively remediate issues Respond to and resolve incidents and service requests in line with SLAs Provide break/fix troubleshooting and root cause analysis across supported systems Collaborate with infrastructure teams to support system scalability and optimization Facilitate alignment between delivery teams and Information ...

DevOps Engineer - Contract - Inside IR35

Hiring Organisation
Morgan McKinley
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
Salary negotiable
/CD pipelines for feature-level deployments Ensure reliable and repeatable deployment processes Support & Incident Management Investigate and triage platform issues and failures Perform root cause analysis and coordinate with relevant teams Unblock infrastructure-related issues impacting delivery Monitoring & Reliability Support monitoring and alerting for platform services ...