1 to 25 of 45 Permanent Root Cause Analysis Jobs in London

Reliability Engineer

Hiring Organisation
City Elite Transaction Services Ltd
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£100,000 - £130,000 per annum
upgrading and patching Solace appliances and software brokers in production Ability to build and deploy Solace infrastructure from scratch (appliance setup, redundancy configurations) Log analysis expertise - reviewing Solace logs, identifying anomalies, and performing root cause analysis 24/7 production support experience in enterprise environments …/scripting skills (Bash/Python) Financial services background preferred The Role: Administer Solace appliances/brokers (on-prem & cloud) Production support, incident response, root cause analysis System monitoring, capacity planning, performance tuning WAN optimization for low-latency messaging Automation and documentation ...

Prisma Browser Deployment Specialist, Professional Services

Hiring Organisation
Palo Alto Networks
Location
London, England, United Kingdom
recommend, and resolve potential issues or areas for security posture improvement. Resolve Complex Escalations: Act as the primary technical escalation point, performing deep-dive root cause analysis and coordinating across Support, Engineering, and Product teams for timely resolution. Document and Transfer Knowledge: Create high-quality, customer-specific … customer environments, driving deployment to successful completion. Advanced analytical and troubleshooting skills with a methodical approach to identifying, diagnosing, and resolving technical issues (Root Cause Analysis). Exceptional verbal and written communication skills with a proven ability to convey complex security topics to diverse technical ...

3rd Line Support Engineer

Hiring Organisation
IMT Resourcing Solutions
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £50,000 per annum
escalation for the service desk while working closely with the infrastructure and networking teams. You’ll take ownership of high-impact issues, lead on root cause analysis, and play a key role in designing and improving client environments. This is a client-facing role, so clear communication … across Windows Server, Microsoft 365, Azure, and virtualised environments Performing advanced configuration and troubleshooting of firewalls, WAN/LAN, backups, and DR Carrying out root cause analysis and implementing long-term fixes Creating and maintaining high-quality technical documentation Mentoring 1st and 2nd Line engineers and raising ...

Insurance Application Lead

Hiring Organisation
Pioneer Search Ltd
Location
City, London, United Kingdom
Employment Type
Permanent
Salary
GBP 70,000 - 75,000 Annual
complex incidents, investigating issues at application, database, and integration level before engaging vendors. Use SQL to interrogate data, validate incidents, confirm fixes, and support root cause analysis. Coordinate incident, problem, and change activities across internal teams and third-party suppliers, ensuring accountability through to resolution. Oversee application releases … fixes, working with vendors and delivery teams via Azure DevOps or similar tooling. Validate vendor outputs, challenge root cause analysis where required, and ensure corrective actions are completed. Maintain service readiness for audits, DR testing, and regulatory requirements. Apply ITIL-aligned practices pragmatically to ensure service stability ...

Solace Administrator

Hiring Organisation
BGC Group
Location
City of London, London, United Kingdom
Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace ...

Senior DevOps Engineer

Hiring Organisation
Reed Technology
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£70000 - £80000/annum
accelerate development velocity. Monitor system reliability, performance, and security across environments. Implement robust observability tools including logs, metrics, traces, and alerts. Lead incident response, root-cause analysis, and long-term remediation. Ensure security best practices are embedded across infrastructure and pipelines. Collaborate closely with the wider team ...

Site Reliability Engineer

Hiring Organisation
Profile 29
Location
South East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£65,000
processes to improve reliability and reduce manual toil Support development teams with environment configuration, secrets management, and container orchestration Participate in incident management and root-cause analysis to continuously improve system resilience Contribute to internal platform frameworks, templates, and automation accelerators Essential Experience in SRE, DevOps ...

Cyber Security & Centralised Services Manager

Hiring Organisation
MFK Recruitment
Location
SE1, Southwark, Greater London, United Kingdom
Employment Type
Permanent
Salary
£55000 - £65000/annum
networks, in alignment with best practices and frameworks such as ISO27001, NIST, and Cyber Essentials Plus. Lead and coordinate incident response efforts, including root cause analysis, threat containment and post-incident reporting for clients. Collaborate with the Project and Service Desk teams to embed security into deployments … junior engineers and Service Desk staff, fostering a culture of security awareness and proactive threat management. Perform ongoing threat intelligence monitoring and security trend analysis to anticipate risks and protect client environments. Support clients in security reporting, compliance reviews, and continuous improvement initiatives, helping them meet regulatory and industry ...

Technical Project Engineer

Hiring Organisation
GreatFind Recruitment
Location
Finchley, London, England, United Kingdom
Employment Type
Full-Time
Salary
£50,000 - £70,000 per annum
migrations • Networking fundamentals including routing, VPNs, switching, and hardware deployments • Backup technologies, disaster recovery planning, and bare metal recovery • Strong troubleshooting and root cause analysis Desirable • AWS or Google Cloud experience • VDI, AVD, or Citrix remote desktop technologies • ISCSI, SAN or NAS storage • PowerShell scripting • Network certifications ...

Principal Data Engineer FTC

Hiring Organisation
Chelsea and Westminster Hospital NHS Foundation Trust
Location
London, SW10 0XD, United Kingdom
Salary
£64156.00 to £71148.00
departments, taking responsibility for designing, building, maintaining and optimising data infrastructures, creating pipelines which collate data from multiple sources and making it available for analysis by other stakeholders Main duties of the job o To contribute towards continual improvements to the Trust's Azure based Data engineering environment … timely and accurate manner. o To ensure the continuation of the provision of high quality data to be used for reporting and analysis by the departments involved. o To support with evaluating key business intelligence reports, combining detailed operational awareness with knowledge of data and systems. o To lead ...

IT Service Desk Analyst

Hiring Organisation
Artis Recruitment
Location
Farringdon Without, Greater London, United Kingdom
Employment Type
Permanent
Salary
£30000 - £32000/annum + Bonus and Excellent Benefits
Group Policy, SCCM, Intune. Experience with (or willingness to learn) legal or specialist applications such as iManage, BigHand, Aderant. Strong diagnostic, problem management, and root cause analysis skills. Organised, professional and results-driven with excellent attention to detail. Team-focused, self-motivated, and enthusiastic with a desire ...

L3 Compute & HPC Assoc Manager

Hiring Organisation
Accenture
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Python, Bash) to streamline operational tasks, monitoring, and reporting. •Document architecture, configurations, processes, and resolutions for compliance, knowledge transfer, and continuous improvement. Participate in root cause analysis (RCA) and post-incident reviews for compute or HPC-related incidents, implementing preventive measures as needed. Required Skills: •Expertise ...

IT Manager, Operations Manager, Digital

Hiring Organisation
Experis
Location
London, Filton, Gloucestershire, United Kingdom
Employment Type
Permanent
Salary
£60000 - £75000/annum Benefits
Facilitate effective communication between IT teams and business units. Problem Solving and Incident Management: Manage and resolve high-priority incidents and critical issues. Conduct root cause analysis and implement corrective actions to prevent recurrence. Develop and maintain incident response plans and procedures. Requirements: Proven experience ...

Senior Infrastructure Engineer

Hiring Organisation
McCabe & Barton
Location
City of London, London, United Kingdom
Administer Active Directory, group policies, MFA, conditional access, and privileged accounts following best-practice security principles. Incident & Problem Management: Lead major incident resolution and root cause analysis, coordinating cross-functional teams and implementing preventative actions. Mobile & Remote Support: Support iOS devices via Intune, configure secure connectivity ...

Performance Tester

Hiring Organisation
scrumconnect ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 35,000 - 43,000 Annual
validate non-functional requirements (NFRs) such as response time, throughput, scalability, and stability. Ensure appropriate performance test coverage across different environments. Test Execution & Analysis Design, build, and execute load, stress, soak, and spike tests using tools such as Apache JMeter, K6, Gatling, Locust , or similar. Conduct API and microservices … metrics, and logs to identify bottlenecks and capacity issues. Defect Management & Collaboration Log, track, and manage performance defects, working closely with development teams on root-cause analysis . Support optimisation of application code, databases, and infrastructure configurations. Collaborate with DevOps teams to integrate performance testing into ...

L3 Storage & Backup Engineer, Associate Manager

Hiring Organisation
Accenture
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Oversee patch management, upgrades, and decommissioning of legacy storage and backup infrastructure. •Document architecture, configurations, procedures, and incident resolutions for compliance and knowledge transfer. •Root cause analysis for storage or backup-related incidents. Required Skills: •Proven technical expertise in enterprise storage technologies (e.g., Dell EMC, NetApp ...

Cloud Engineer x 2 Roles Available

Hiring Organisation
Police Digital Services
Location
Central London, London, United Kingdom
Employment Type
Permanent, Work From Home
stability, functionality, and integrity of Azure platforms Ensure that security, stability, and capacity are embedded in the development and deployment of services Perform root cause analysis to diagnose configuration and deployment issues Maintain regular, accurate updates on all incidents and escalate as necessary, ensuring all resolutions ...

DevOps Engineer

Hiring Organisation
Autonomai Recruitment
Location
City of London, London, United Kingdom
practices for Linux platforms powering low-latency, high-throughput trading workloads. Optimize, and tune Linux for performance, resilience, and minimal latency. Drive incident response, root cause analysis, and continuous reliability improvement across production systems. Oversee system automation and reproducibility—build, deploy, and fleet-manage bare-metal Linux ...

Site Reliability Engineer

Hiring Organisation
Autonomai Recruitment
Location
London Area, United Kingdom
Linux platforms powering low-latency, high-throughput trading workloads. Architect, optimize, and tune Linux for performance, resilience, and minimal latency. Drive incident response, root cause analysis, and continuous reliability improvement across production systems. Oversee system automation and reproducibility—build, deploy, and fleet-manage bare-metal Linux ...

Operational Resilience & Incident Manager

Hiring Organisation
Quix Recruitment Group
Location
City of London, London, United Kingdom
efforts for operational incidents, ensuring swift and effective resolution. Develop and maintain incident response plans aligned with industry best practices. Drive post-incident reviews, root cause analysis, and continuous improvement initiatives. Business Continuity & Governance Keep business continuity plans current, reflecting the evolving operational environment. Ensure compliance with ...

Network Engineer with Zscaler Experience

Hiring Organisation
Sanderson Recruitment
Location
London, United Kingdom
Employment Type
Permanent
managing Zscaler solutions. Collaborate with network, security, identity, and infrastructure teams to ensure seamless integration with enterprise systems and architectures. Troubleshoot complex issues , perform root-cause analysis, and provide advanced support for escalated incidents related to Zscaler services. Contribute to documentation, governance, compliance, and operational standards ...

System Test Engineer

Hiring Organisation
Parkside Office Professional
Location
Uxbridge, Middlesex, England, United Kingdom
Employment Type
Full-Time
Salary
£60,000 - £65,000 per annum, Inc benefits
Work closely with design and development teams to refine test strategies and resolve issues Manage compliance and specialist testing with external test laboratories Support root cause analysis, documentation, and knowledge sharing across teams What they are looking for 5+ years experience in system, integration or validation testing … across hardware and software Strong background in test documentation, debugging, failure analysis and problem solving Solid understanding of EMI/EMC and regulatory compliance testing (ETSI, FCC, EU Directives, etc.) Python experience for test automation, equipment control and data analysis Excellent communication skills and a structured engineering mindset ...

Senior BI Analyst

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £65,000 per annum
data pipelines that turn raw sources into usable insight Structuring datasets so teams across the company can self-serve effectively Supporting CX leaders with analysis, investigations, and clear recommendations Spotting opportunity areas and validating where changes will have the most impact Measuring the results of new initiatives and surfacing … what's working - and what isn't Running root-cause analysis when performance shifts unexpectedly Managing multiple stakeholders and switching context without losing momentum YOUR SKILLS AND EXPERIENCE: The ideal Senior BI Analyst will have the following skills and experience: Strong SQL and confidence working with ...

Service Manager

Hiring Organisation
Mastek
Location
Greater London, England, United Kingdom
response activities, ensuring rapid assessment and stakeholder updates. Engage with out-of-hours support teams, ensuring clear handovers, readiness and appropriate escalation routes. Support root cause identification and follow-up problem records to ensure issues are fully understood and preventive actions tracked. Change & Release Support Work with … monitoring and accurate service data/CMDB maintenance. Effective incident triage and coordination skills, including managing escalations, engaging out-of-hours teams, and driving root-cause analysis. Supplier and stakeholder coordination, with the ability to manage OOH support providers, ensure readiness, and deliver structured service reporting. Strong analytical ...

DevOps Engineer

Hiring Organisation
Code Wizards Group
Location
London, UK
Employment Type
Full-time
with an outage, working with internal teams and the customer to keep the game infrastructure online. DUTIES AND RESPONSIBILITIES Diagnosing and performing root cause analysis on customer infrastructure, including in incident scenarios Proactively identify and raise methods of improving team efficiency and procedures to improve support ...