Root Cause Analysis Jobs in the UK

26 to 50 of 746 Root Cause Analysis Jobs in the UK

Software Engineer

Sheffield, South Yorkshire, United Kingdom
Hybrid / WFH Options
Experis
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform root cause analysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
Employment Type: Contract
Rate: £395 - £430/day
Posted:

Information Systems Developer

Colchester, United Kingdom
Provide CIC
partners. Ensure Code Quality: Uphold best practices in version control, documentation, and peer review to maintain high standards. Troubleshoot and Improve Data Quality: Resolve complex data issues by identifying root causes and implementing lasting improvements. Develop Integrations and Applications: Create and maintain back-end APIs, automate data ingestion, and build front-end tools (Astro, React, or low-code) for … reviews, providing constructive feedback to colleagues and championing improvements in style, performance, and security Investigate and Improve Data Quality Proactively identify data anomalies, inconsistencies, and integrity issues by: Conducting root-cause analysis on recurring data errors or mismatches Reviewing source data feeds and transformation logic to pinpoint upstream issues Recommend and implement fixes and improvements to existing … as a Technical Escalation Point Act as tier 3 support when the service desk or level-2 teams cannot resolve incidents due to complexity: Triage incoming support tickets, identify root causes, and propose corrective actions Provide on-call availability for critical outages affecting data pipelines, integrations, or production environments Document incident resolutions to streamline future troubleshooting efforts Plan, Specify More ❯
Employment Type: Permanent
Salary: £46148.00 - £52809.00 a year
Posted:

Cloud Engineer

Watford, Hertfordshire, UK
Akkodis
cloud subject matter expert, providing AWS best practice guidance to internal teams and project stakeholders. Investigate and resolve AWS infrastructure-related incidents, ensuring minimal downtime and impact. Participate in root cause analysis and implement preventative measures. Maintain clear, detailed documentation for AWS environments, architecture diagrams, SOPs, and runbooks. Continuously look for opportunities to improve cloud architecture, security More ❯
Employment Type: Full-time
Posted:

Cloud Engineer

Stevenage, England, United Kingdom
Akkodis
cloud subject matter expert, providing AWS best practice guidance to internal teams and project stakeholders. Investigate and resolve AWS infrastructure-related incidents, ensuring minimal downtime and impact. Participate in root cause analysis and implement preventative measures. Maintain clear, detailed documentation for AWS environments, architecture diagrams, SOPs, and runbooks. Continuously look for opportunities to improve cloud architecture, security More ❯
Posted:

Senior Director - Operations and Reliability Engineering

London, United Kingdom
The Boston Consulting Group GmbH
IT Service Management (ITSM) processes across all teams, ensuring standardized, efficient, and effective service delivery. EstablishSRE-based operational metrics, includingSLOs, SLIs, and error budgets. Overseeincident response, problem resolution, and root cause analysis with AI-driven remediation. Ensurehigh availability, performance, and security compliancefor all enterprise services. Develop afollow-the-sun operational support model, ensuring24x7 resilience and uptime across More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Application Support Analyst

London, United Kingdom
TEKsystems, Inc
Description We are seeking a knowledgeable Application Support Analyst to liaise with vendors, business users and product teams to perform installations, identify route cause and deliver fixnhancements. The candidate would ideally also have knowledge in commodity trading and will be delivery focused. Knowledgeable in working with Agile (SCRUM) development and delivery teams is advantageous. The ideal candidate disposes of … ability to develop innovative solutions to technical problems whilst working within the company's governance framework. Incident management skills leading & owning the issues from start to resolution leading to root cause analysis. Problem management skills in regular checks & follow ups on defects & bug arising from incidents. An adaptable attitude, with the ability to multi-task and respond quickly More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Stott and May
hands-on role supporting high-availability systems, rapid deployments, and production incident response. Key Responsibilities - Manage and monitor AWS infrastructure for performance and security - Respond to production incidents, perform root cause analysis, and implement fixes - Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries - Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes … Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting (Python, Go, Bash, SQL) - Skilled in GitHub, CI/CD, and Kubernetes operations Desirable: - Experience with Terraform or CloudFormation - Advanced log analysis with Splunk - Strong problem-solving and analytical thinking More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer - AWS

City of London, London, United Kingdom
Hybrid / WFH Options
Cognitive Group | Part of the Focus Cloud Group
Cleared or Eligible for SC Clearance Your responsibilities: Deploy, configure, and monitor AWS services ensuring high availability, scalability, and security. Respond to and resolve infrastructure and service incidents with root cause analysis and preventive measures. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. Implement and maintain observability solutions using … configuration and deployment management experience with CI/CD Desirable skills Hands-on experience with Terraform or CloudFormation for infrastructure provisioning and automation. Strong knowledge of Splunk for log analysis and troubleshooting. Strong problem-solving skills and analytical thinking. More ❯
Posted:

DevOps Engineer - AWS

London Area, United Kingdom
Hybrid / WFH Options
Cognitive Group | Part of the Focus Cloud Group
Cleared or Eligible for SC Clearance Your responsibilities: Deploy, configure, and monitor AWS services ensuring high availability, scalability, and security. Respond to and resolve infrastructure and service incidents with root cause analysis and preventive measures. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. Implement and maintain observability solutions using … configuration and deployment management experience with CI/CD Desirable skills Hands-on experience with Terraform or CloudFormation for infrastructure provisioning and automation. Strong knowledge of Splunk for log analysis and troubleshooting. Strong problem-solving skills and analytical thinking. More ❯
Posted:

DevOps Engineer - AWS

South East London, England, United Kingdom
Hybrid / WFH Options
Cognitive Group | Part of the Focus Cloud Group
Cleared or Eligible for SC Clearance Your responsibilities: Deploy, configure, and monitor AWS services ensuring high availability, scalability, and security. Respond to and resolve infrastructure and service incidents with root cause analysis and preventive measures. Handle change requests, track recurring issues, and work on long-term fixes to improve system stability. Implement and maintain observability solutions using … configuration and deployment management experience with CI/CD Desirable skills Hands-on experience with Terraform or CloudFormation for infrastructure provisioning and automation. Strong knowledge of Splunk for log analysis and troubleshooting. Strong problem-solving skills and analytical thinking. More ❯
Posted:

Manual Tester (DV Security Clearance)

Basingstoke, Hampshire, South East
CGI
to-end tests on code commits and pull-requests. • Monitor pipeline health and test results; collaborate with DevOps to optimize build times, parallelize tests, and reduce pipeline flakiness. Result Analysis & Root Cause • Analyze test outputs, system logs, and metrics (e.g., via ELK Stack or Prometheus/Grafana) to pinpoint failures and performance regressions. • Lead root-cause … testing activity efficiently. An ISTQB Foundation Certification is a strong asset and shows your commitment to professional testing standards. A key part of this role involves problem investigation and root cause analysis, so strong analytical and communication skills are a must. You'll enjoy working as part of a collaborative team, contributing your insights to improve outcomes More ❯
Employment Type: Permanent
Posted:

Site Reliability Engineer

Glasgow, United Kingdom
Planet DDS, Inc
to define implement and improve business performance SLO's. 2+ years of experience with Production operations including 24x7 on-call support, escalation/paging with OpsGenie, incident management, RCA (Root Cause Analysis) and retrospective analysis. 2+ or more years in hands on technical roles (such as site reliability engineer, software engineer, DevOps engineer, infrastructure engineer). Experience … management. 24x7 Support: Perform deep dives into systemic and latent reliability issues, incident management, problem management. Participate in all aspects of incident management including awareness, communication, remediation, retrospective/root cause analysis. Identify and implement process improvements of MTTA (Mean Time to Acknowledge) and MTTR (Mean Time to Resolve). Support operations & engineering teams on Azure. AWS and … can talk about complex software systems and have ideas on how to build quality, performant, and easily supportable software most effectively You exhibit dogged determination to get to the root of problems You care about best-practices and evangelizing them with the team You like to research and propose new techniques and methodologies to improve quality and efficiency of More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Application Support Manager

Glasgow, Scotland, United Kingdom
Scottish Friendly
to agreed SLAs, maintaining high levels of customer satisfaction. Technical Expertise: Implement selected ITIL best practices to ensure that we develop a customer focussed service oriented. Problem Management: Conduct root cause analysis for recurring issues and implement solutions to prevent future occurrences. Stakeholder Management: Develop strong relationships through effective communication and engagement with internal and external stakeholders … for improvement. Compliance: Ensure all processes and procedures comply with regulatory requirements and company policies. Quality Assurance : Ensure that team deliverables are quality reviewed and work with Testing and Analysis functions to ensure releases are tested and fit for implementation Technical Proficiency : Be able to offer technical input into complex solutions while supported by team technical leads Disaster Recovery More ❯
Posted:

IT Analyst

Fareham, Hampshire, South East, United Kingdom
Matchtech Mobility
ITSM) processes including asset, change, incident, request, problem, and project management to meet service levels. Provide on-site IT support and assist in resolving broader technical issues. Contribute to root cause analysis and long-term problem management. Act as a key point of contact between IT and users, promoting standards, improving user satisfaction, and sharing best practices. More ❯
Employment Type: Contract
Rate: £20.79 - £27.62 per hour + PAYE
Posted:

Cyber Security Engineer

Liverpool, Merseyside, North West, United Kingdom
Hybrid / WFH Options
In Technology Group Limited
with IT and development teams to ensure secure system architecture and application development. Maintain and enhance incident response procedures and disaster recovery plans. Investigate and document security breaches, providing root cause analysis and remediation plans. Conduct security awareness training for staff and ensure compliance with internal policies and regulatory requirements (e.g., FCA, GDPR, ISO 27001). Stay More ❯
Employment Type: Permanent
Salary: £50,000
Posted:

Cyber Security Engineer

Bletchley, Buckinghamshire, United Kingdom
Hybrid / WFH Options
In Technology Group
with IT and development teams to ensure secure system architecture and application development. Maintain and enhance incident response procedures and disaster recovery plans. Investigate and document security breaches, providing root cause analysis and remediation plans. Conduct security awareness training for staff and ensure compliance with internal policies and regulatory requirements (e.g., FCA, GDPR, ISO 27001). Stay More ❯
Employment Type: Permanent
Salary: GBP 40,000 - 50,000 Annual
Posted:

Cyber Security Engineer

Milton Keynes, Buckinghamshire, South East, United Kingdom
Hybrid / WFH Options
In Technology Group Limited
with IT and development teams to ensure secure system architecture and application development. Maintain and enhance incident response procedures and disaster recovery plans. Investigate and document security breaches, providing root cause analysis and remediation plans. Conduct security awareness training for staff and ensure compliance with internal policies and regulatory requirements (e.g., FCA, GDPR, ISO 27001). Stay More ❯
Employment Type: Permanent
Salary: £50,000
Posted:

Senior Engineer- GCP, Long term Solution - Cloud Identity, London

London, United Kingdom
Photon
to-date with the latest advancements in identity management protocols and best practices. Contribute to the development and documentation of technical specifications and design decisions. Troubleshoot technical issues, conduct root cause analysis, and implement timely resolutions to minimize downtime. Qualifications: Bachelor's or Master's degree in Computer Science, Engineering, or related field. Minimum 5+ years of More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Third Line Product Support Technician

New Milton, Hampshire, United Kingdom
Hybrid / WFH Options
Appello
infrastructure and cloud services. Deep understanding of SIP, VoIP, VoLTE, STUN, and firewall bridging. Proficiency in Node.js application support and server diagnostics. Hands-on experience using tools for SIP analysis, such as Wireshark, SIP Traces, or packet analysers. Excellent problem-solving and communication skills. Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience … Azure Solutions Architect, or AWS equivalent. ITIL Foundation certification THE ROLE Key Responsibilities Advanced Technical Support Resolve complex hardware, software, and network issues escalated from lower-tier support. Conduct root cause analysis and implement long-term solutions. Manage high-impact incidents to ensure minimal business disruption. ️ Server & Application Support Troubleshoot server issues across cloud (AWS), on-premise More ❯
Employment Type: Permanent
Salary: GBP 37,000 Annual
Posted:

Global Talent Applications Support Manager

United Kingdom
Dentons
Talent application support services, ensuring timely and accurate issue resolution for SuccessFactors and associated Talent systems. Act as escalation point for complex system issues and ensure appropriate follow-up, root cause analysis, and long-term resolution. Maintain high standards of system reliability and data integrity in the live production environment. Define and monitor service level objectives (SLOs … changes go live. Strategic Planning and Roadmap Execution Contribute to the definition and delivery of the Talent technology roadmap in partnership with the Global Senior Manager, Talent Systems. Support analysis and prioritization of system enhancements, configuration changes, and new module adoption. Stay current with SAP SuccessFactors roadmap updates and industry trends. Data Quality and Reporting Support data governance initiatives More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Operations Support Analyst

London, United Kingdom
Hybrid / WFH Options
aaifire
checks to identify process defects Reporting Support the creation of routine reporting packs and dashboards for internal stakeholders, utilising and defining performance metrics - Service Level Agreements (SLAs) etc Conduct Analysis utilising tools such as Excel or PowerBI, to identify trends and opportunities for both system optimisation and improvement in operational performance Continuous Improvement - Operations process optimisation Proactively identify opportunities … generating and maintaining a knowledgeable Problem Solving Critically assess and collaboratively work alongside the function's operations team, managed service vendors and enterprise IT team to identify/support root cause analysis and remediation of issues, incidents and escalation. Bridge the gap by translating business requirements to the Tech team and vice versa Vendor Management Maintain a More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

United Kingdom
luupli
of infrastructure components. 2. Monitoring and Incident Management: - Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues. - Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents. 3. Reliability and Performance Optimization: - Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Production Engineer - Gaming Experienced Hire

Narberth, Pennsylvania, United States
Susquehanna International Group
AZ-104/AWS SOA-C02 or equivalent experience (1-3 years in cloud-based environments) Good understanding of networking concepts (comprehensive understanding of OSI Layers 2-7) Troubleshooting, root cause analysis and communication skills Understanding of system performance metrics and capacity planning Python or PowerShell scripting skills Previous experience in some of the following concepts & technologies More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Payments Product Owner

London Area, United Kingdom
Pontoon Solutions
and compliance requirements. • Act as the primary point of contact for internal business units (including Operations, Compliance & Transactional Banking), IT and external vendors, regarding service performance and enhancements. • Lead root cause analysis and resolution of major incidents. Drive problem management to reduce recurring issues and improve service stability. • Manage projects involving any future enhancements or regulatory changes More ❯
Posted:

Payments Product Owner

City of London, London, United Kingdom
Pontoon Solutions
and compliance requirements. • Act as the primary point of contact for internal business units (including Operations, Compliance & Transactional Banking), IT and external vendors, regarding service performance and enhancements. • Lead root cause analysis and resolution of major incidents. Drive problem management to reduce recurring issues and improve service stability. • Manage projects involving any future enhancements or regulatory changes More ❯
Posted:
Root Cause Analysis
10th Percentile
£27,375
25th Percentile
£41,250
Median
£52,000
75th Percentile
£67,500
90th Percentile
£83,750