1 to 25 of 165 Remote/Hybrid Root Cause Analysis Jobs in the UK

Telecoms OSS Systems Application Engineer

Hiring Organisation
MECS Communications Ltd
Location
Newbury, Berkshire, South East, United Kingdom
Employment Type
Contract, Work From Home
Operational Support Systems * Cisco Networking * Nokia Transmission * Microwave Networks * Enterprise IP Networking * Network Operations * Incident Management * Change Management * Production Support * Application Support * System Monitoring * Root Cause Analysis Core Activity: * Support and maintain business-critical telecoms OSS platforms and production applications * Administer and support internally developed telecoms operational … data quality activities * Support operational users of network inventory, provisioning and fulfilment systems * Implement approved production changes in accordance with change management processes * Perform root cause analysis and implement preventative solutions * Produce technical documentation, support records and operational procedures Deliverables: * Stable and secure OSS and production environments ...

Site Reliability Engineer — AWS & Observability

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
issues Leverage AI tooling – Use AI‐assisted development tools (e.g. GitHub Copilot) to accelerate infrastructure work, and explore AI‐driven approaches to incident detection, root cause analysis, and remediation What We're Looking For Essential 3+ years in an SRE, Platform, or DevOps engineering role AWS services … regulated environments or with compliance frameworks Experience with AI‐driven DevOps tooling (e.g. AWS DevOps Agent or similar AI agents for incident resolution, root cause analysis, and operational improvement) Experience with SLIs, SLOs, and error budgets On‐Call We have a 24/7 customer support team ...

Senior Site Reliability Engineer

Hiring Organisation
17918
Location
United Kingdom
simulate outages and improve fault tolerance Incident Management Act as the primary point of escalation for critical production issues and lead major incident response, root cause analysis, and postmortems. Perform detailed post-incident investigations to identify underlying causes. Document findings and share learnings to prevent recurrence. Implement … dashboards to visualize system health and reliability metrics. Configure intelligent alerting based on anomaly detection and thresholds. Combine metrics, logs, and traces to enable root cause analysis and reduce Mean Time to Resolution (MTTR). Knowledge of AIOps or ML-based anomaly detection for proactive reliability management. ...

Senior Site Reliability Engineer

Hiring Organisation
Experian Ltd
Location
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Employment Type
Permanent, Work From Home
simulate outages and improve fault tolerance Incident Management Act as the primary point of escalation for critical production issues and lead major incident response, root cause analysis, and postmortems. Perform detailed post-incident investigations to identify underlying causes. Document findings and share learnings to prevent recurrence. Implement … dashboards to visualize system health and reliability metrics. Configure intelligent alerting based on anomaly detection and thresholds. Combine metrics, logs, and traces to enable root cause analysis and reduce Mean Time to Resolution (MTTR). Knowledge of AIOps or ML-based anomaly detection for proactive reliability management. ...

Site Reliability Engineer

Hiring Organisation
Digital Gurus
Location
United Kingdom
Kubernetes. You will collaborate closely with developers, architects and platform teams to improve reliability, scalability, performance and operational resilience. You will support incident response, root cause analysis and blameless post-mortems, helping drive long-term improvements rather than short-term fixes. You will automate repetitive operational tasks … tools such as Dynatrace, Prometheus, Grafana, CloudWatch, OpenTelemetry, ELK or similar. Understanding of SLIs, SLOs, error budgets and golden signals. Experience supporting incident management, root cause analysis and post-incident improvement work. Automation experience using scripting or IaC tooling such as Terraform, Python, Bash, Ansible or similar. ...

Site Reliability Engineer

Hiring Organisation
Digital Gurus
Location
United Kingdom, UK
Kubernetes. You will collaborate closely with developers, architects and platform teams to improve reliability, scalability, performance and operational resilience. You will support incident response, root cause analysis and blameless post-mortems, helping drive long-term improvements rather than short-term fixes. You will automate repetitive operational tasks … tools such as Dynatrace, Prometheus, Grafana, CloudWatch, OpenTelemetry, ELK or similar. Understanding of SLIs, SLOs, error budgets and golden signals. Experience supporting incident management, root cause analysis and post-incident improvement work. Automation experience using scripting or IaC tooling such as Terraform, Python, Bash, Ansible or similar. ...

Senior 3rd Line Support Engineer

Hiring Organisation
ECS Resource Group Ltd
Location
Solihull, West Midlands, West Midlands (County), United Kingdom
Employment Type
Permanent
Salary
£37000 - £37500/annum
final escalation point for complex technical issues. You'll take ownership of incidents end-to-end, focusing on deep technical investigation, root cause analysis, and delivering long-term solutions -not just quick fixes. Working across modern on-prem, virtualised, and cloud environments, you'll support a wide … Line teams Owning and resolving complex incidents and problems through to completion Performing in-depth troubleshooting across infrastructure, cloud, and EUC environments Conducting root cause analysis and implementing permanent fixes Supporting users via tickets, phone, email, and remote sessions Managing and prioritising workload in line with SLAs ...

Customer Service Agent

Hiring Organisation
DNA Payments Group
Location
London, South East, England, United Kingdom
Employment Type
Part-Time
Salary
Salary negotiable
contact resolution wherever possible. Manage customer enquiries relating to billing, payment terminals, connectivity issues, software functionality, product logistics, stock orders, and terminal replacements. Conduct root cause analysis and escalate complex issues to relevant teams when required. Maintain accurate records of customer interactions, troubleshooting steps, and resolutions within … provided. Demonstrated excellent organisational and time-management skills in a fully remote working environment. Skill/Technical Experience with troubleshooting, fault diagnosis, and root cause analysis would be advantageous. Exposure to payment technology, telecommunications, software support, or a similar technical environment would be beneficial. Comfortable working across ...

Quality Assurance Specialist

Hiring Organisation
Seven Search & Selection
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
high-quality, functional ingredients that help brands meet evolving consumer needs. The role: Lead investigations into customer, supplier, and internal quality non-conformances, conducting root cause analysis and implementing effective corrective and preventative actions. Support operational compliance by preparing for BRC, customer, and third-party audits while … experience in Quality Assurance within food manufacturing, ingredient distribution, or a related FMCG environment. Strong working knowledge of BRCGS, HACCP, GMP, CAPA management, and root cause analysis techniques including FMEA, 5 Whys, and Fishbone. Proven experience conducting supplier audits, managing customer complaints, investigating non-conformances, and implementing ...

Human Resources Information System Specialist

Hiring Organisation
People's Partnership
Location
Crawley, England, United Kingdom
cost control, risk reduction, metrics reporting and continuous improvement capability. System Maintenance and Stability such as Day‐to‐day system health monitoring, Issue investigation, rootcause analysis, and resolution and Payroll and WFM configuration checks to prevent errors. Data integrity and reporting Release Management and System Updates … across HR, Payroll and Workforce Management, with experience operating in a live payroll environment. Demonstrated experience in day‐to‐day system monitoring, issue investigation, root cause analysis and resolution. Strong background in data integrity, reconciliation and reporting, including investigating and resolving reporting discrepancies. Hands‐on experience building ...

Human Resources Information System Specialist

Hiring Organisation
People's Partnership
Location
Crawley, West Sussex, UK
cost control, risk reduction, metrics reporting and continuous improvement capability. System Maintenance and Stability such as Day‐to‐day system health monitoring, Issue investigation, rootcause analysis, and resolution and Payroll and WFM configuration checks to prevent errors. Data integrity and reporting Release Management and System Updates … across HR, Payroll and Workforce Management, with experience operating in a live payroll environment. Demonstrated experience in day‐to‐day system monitoring, issue investigation, root cause analysis and resolution. Strong background in data integrity, reconciliation and reporting, including investigating and resolving reporting discrepancies. Hands‐on experience building ...

Workday Data Architect (HR Master Data Quality) - Birmingham

Hiring Organisation
Harvey Nash IT Recruitment UK
Location
City, London, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
Workday, Legacy HR systems, finance platforms, and local country data sources. A key focus of this role will be leading HR master data reconciliation, root cause analysis, and data quality remediation, ensuring audit findings are translated into clear, trackable actions for country-level data stewards. You will … Reconciliation - Extensive experience analysing HR master data across Workday, Legacy HR systems, finance systems, and country-level sources to identify mismatches, duplicates, and inconsistencies. Root Cause Analysis - Proven ability to investigate recurring data issues and define sustainable corrective actions across global datasets. Data Quality Frameworks - Experience defining ...

Workday Data Architect (HR Master Data Quality) - Birmingham

Hiring Organisation
Harvey Nash
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£0 per annum
Workday, legacy HR systems, finance platforms, and local country data sources. A key focus of this role will be leading HR master data reconciliation, root cause analysis, and data quality remediation, ensuring audit findings are translated into clear, trackable actions for country-level data stewards. You will … Reconciliation - Extensive experience analysing HR master data across Workday, legacy HR systems, finance systems, and country-level sources to identify mismatches, duplicates, and inconsistencies. Root Cause Analysis - Proven ability to investigate recurring data issues and define sustainable corrective actions across global datasets. Data Quality Frameworks - Experience defining ...

Data Quality Lead (Pensions)

Hiring Organisation
Local Pensions Partnership
Location
Preston, Lancashire, North West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
objectives. - Embed an effective assurance regime, ensuring processes, controls and system design support high-quality pensions administration. - Work with Planning and Insights to understand root causes and the impact of data quality issues. - Ensure work is allocated effectively, and cases are processed and prioritised appropriately. - Oversee data readiness … onboarding and data submissions, including oversight of TUPE transfers, employer cessations and new scheme employers. - Produce and present management information, with a focus on root cause and trend analysis against service levels and key metrics. - Support change projects, testing and audit activity, while reviewing processes to maximise ...

Software Engineer - Test Equipment Software

Hiring Organisation
MBDA UK
Location
Manchester, United Kingdom
Employment Type
Permanent
Salary
GBP 40,000 Annual
hardware, firmware, configuration, interface and integration issues affecting test capability Working with engineers and operators to understand symptoms, reproduce issues, gather evidence and support root-cause analysis Supporting software updates, maintenance activities and controlled changes to Test Equipment software Improving documentation, support notes, runbooks and knowledge capture … hardware, equipment, firmware, operating systems or real-world test environments Strong analytical and problem-solving ability, including the ability to investigate issues where the root cause may not initially be clear A willingness to support existing systems as well as develop new or improved software solutions A methodical ...

Software Engineer — Test Equipment Software

Hiring Organisation
MBDA UK
Location
Bolton, Greater Manchester, United Kingdom
Employment Type
Permanent
Salary
£40000/annum
hardware, firmware, configuration, interface and integration issues affecting test capability Working with engineers and operators to understand symptoms, reproduce issues, gather evidence and support root-cause analysis Supporting software updates, maintenance activities and controlled changes to Test Equipment software Improving documentation, support notes, runbooks and knowledge capture … hardware, equipment, firmware, operating systems or real-world test environments Strong analytical and problem-solving ability, including the ability to investigate issues where the root cause may not initially be clear A willingness to support existing systems as well as develop new or improved software solutions A methodical ...

Senior Cloud Ops Engineer

Hiring Organisation
ARM
Location
Worthing, West Sussex, United Kingdom
Employment Type
Permanent
Salary
£60000 - £70000/annum
cloud resources using DevOps pipelines and scripts (PowerShell, Azure CLI, CI/CD etc.). Monitor system health, respond to incidents, and participate in root cause analysis and continuous improvement. Security, Governance & Compliance Enforce cloud security best practices, including role-based access control (RBAC), encryption, and secure ...

Infrastructure Support Specialist - on site Aldermaston

Hiring Organisation
DXC
Location
Reading, Berkshire, South East, United Kingdom
Employment Type
Permanent, Work From Home
apply strong analytical skills to diagnose complex infrastructure issues across interconnected systems. The position also demands a proactive mindsetanticipating potential failures, contributing to root cause analysis, and driving improvements to prevent recurring issues. Your ability to document infrastructure designs, create operational guides, and support rigorous ITSM practices ...

SRE DevOps Engineer

Hiring Organisation
WTW
Location
Surrey, United Kingdom
Employment Type
Full Time
observability platforms such as Datadog Proactively monitor production and other environments to ensure stability, availability, security and integrity Participate in incident response, troubleshooting, and root cause analysis to mitigate and prevent future issues Work closely with engineering, support and operations teams to upskill and promote knowledge transfer ...

Platform Engineer

Hiring Organisation
Ascent Resourcing Limited
Location
Birmingham, West Midlands, England, United Kingdom
Employment Type
Full-Time
Salary
£60,000 - £65,000 per annum
monitoring, resilience improvements, and operational best practices. Implement and uphold infrastructure security controls, compliance requirements, and governance standards. Support platform troubleshooting, incident management, and root cause analysis to maintain service stability Develop and improve infrastructure-as-code practices and automation capabilities. Create, maintain, and enhance platform documentation ...

Senior DBA

Hiring Organisation
Morson Edge
Location
Manchester, North West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£75,000
Collaborate with engineering teams to improve observability, resilience and operational efficiency Provide technical guidance and mentorship to junior team members Participate in incident management, root cause analysis and continuous improvement activities Contribute to database architecture decisions and the evaluation of new technologies Essential Skills & Experiences Strong experience ...

Data Engineer

Hiring Organisation
Reed Technology
Location
Guildford, Surrey, United Kingdom
Employment Type
Permanent
Salary
£35000 - £60000/annum
deliver impactful data products. Serve as a technical leader-mentoring junior data engineers, driving best practices, and enhancing platform performance. Conduct monitoring, troubleshooting, root cause analysis, and performance optimisation of legacy data issues. Required Skills & Qualifications: Strong experience in delivering cloud data solutions, preferably in Microsoft Fabric ...

Software Engineer X3 (Back end)

Hiring Organisation
Pontoon
Location
Warwickshire, United Kingdom
Employment Type
Contract
Contract Rate
GBP Annual
premises platforms. Develop automation for deployment, configuration management, and system provisioning. Monitor, analyse, troubleshoot, and optimise system performance, availability, and reliability. Perform root cause analysis and implement long-term solutions to improve platform resilience. Actively participate in Agile ceremonies, including sprint planning, daily stand-ups, backlog refinement ...

Software Engineer X3 (Back end)

Hiring Organisation
Pontoon
Location
Warwickshire, United Kingdom
Employment Type
Contract
premises platforms. Develop automation for deployment, configuration management, and system provisioning. Monitor, analyse, troubleshoot, and optimise system performance, availability, and reliability. Perform root cause analysis and implement long-term solutions to improve platform resilience. Actively participate in Agile ceremonies, including sprint planning, daily stand-ups, backlog refinement ...

Senior DevOps Engineer

Hiring Organisation
WTW
Location
Surrey, United Kingdom
Employment Type
Full Time
observability, and performance of the platform. Implement SLOs, alerting, dashboards, and auto remediation where possible. Troubleshoot cluster level, networking, and workload deployment issues. Lead root cause analysis (RCA) and drive long-term reliability improvements. Automation & Tooling Develop automation scripts and tooling in PowerShell. Build and optimize Azure ...