1 to 25 of 411 Permanent Root Cause Analysis Jobs

Knowledge Engineer

Hiring Organisation
Thebes Group
Location
London Area, United Kingdom
diagnostic layer between agent outputs and the knowledge layer: when an agent produces an incorrect or incomplete output, you identify whether the root cause sits in the knowledge structure and fix it at source. This is a technically precise, high-accountability role. The accuracy of agent outputs across … layer. Where the problem is structural, fix it at source. Technical Qualifications RAG architecture understanding GraphRAG Semantic retrieval principles Knowledge grounding Agent output evaluation Root cause analysis within knowledge structures Key Deliverables Regular structured review of agent outputs against expected knowledge Documented root cause analysis ...

MOD DV Cleared Senior Software Engineer

Hiring Organisation
Data Careers
Location
Fareham, Hampshire, South East, United Kingdom
Employment Type
Permanent
Salary
£90,000
will work on technically challenging codebases where reliability, maintainability and engineering quality are critical. This will include: Investigating complex defects and incidents Performing root cause analysis Implementing durable fixes Refactoring and improving existing software Supporting architecture changes as requirements evolve Helping modernise codebases, tooling and engineering standards … technical leadership. Key Responsibilities Lead hands-on software maintenance, enhancement and upgrade work across complex codebases Investigate defects, incidents and technical issues, carrying out root cause analysis and implementing robust fixes Refactor and improve existing software to increase maintainability, reliability and performance Support architecture adaptation and evolution ...

Senior Software Engineer

Hiring Organisation
Data Careers
Location
Fareham, Hampshire, South East, United Kingdom
Employment Type
Permanent
Salary
£90,000
will work on technically challenging codebases where reliability, maintainability and engineering quality are critical. This will include: Investigating complex defects and incidents Performing root cause analysis Implementing durable fixes Refactoring and improving existing software Supporting architecture changes as requirements evolve Helping modernise codebases, tooling and engineering standards … technical leadership. Key Responsibilities Lead hands-on software maintenance, enhancement and upgrade work across complex codebases Investigate defects, incidents and technical issues, carrying out root cause analysis and implementing robust fixes Refactor and improve existing software to increase maintainability, reliability and performance Support architecture adaptation and evolution ...

EMEA Moderation Quality Assurance Specialist

Hiring Organisation
TikTok Shop
Location
London Area, United Kingdom
quality of BPO sites, provide indepth RCA for critical issues as well as the implementation of effective action plans . Roles & Responsibilities Content Review & Analysis • Conduct comprehensive review and analysis of sellers, products, content and Intellectual Property Rights (IPR) to ensure compliance with platform policies • Perform daily quality … assessments of moderation decisions across multiple content categories • Identify patterns and trends in content violations through systematic data analysis • Review and evaluate the accuracy of content moderation decisions made by BPO teams or machine moderation Quality Assurance & Process Improvement • Help build quality assessment processes and conduct quality assessment work ...

RCA Analyst

Hiring Organisation
Hays Specialist Recruitment Limited
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £65,000 per annum
Hays Recruitment | My contact information is available on my LinkedIn profile For this role, you must be able to demonstrate: Hands-on ownership of Root Cause Analysis for major incidents (P1/P2) Experience applying structured RCA methodologies (5 Whys, Fishbone, Fault Tree, Kepner-Tregoe) Ability … process, and organisational causes of failure Experience working within Incident, Major Incident, and Problem Management functions Confidence to challenge engineering teams and vendors on root causes and corrective actions Experience producing high-quality RCA reports (timeline, impact, contributing factors, actions)This role is not suitable for candidates whose experience ...

RCA Analyst

Hiring Organisation
Hays
Location
City of London, London, United Kingdom
Employment Type
Permanent
Hays Recruitment | My contact information is available on my LinkedIn profile For this role, you must be able to demonstrate: Hands-on ownership of Root Cause Analysis for major incidents (P1/P2) Experience applying structured RCA methodologies (5 Whys, Fishbone, Fault Tree, Kepner-Tregoe) Ability … process, and organisational causes of failure Experience working within Incident, Major Incident, and Problem Management functions Confidence to challenge engineering teams and vendors on root causes and corrective actions Experience producing high-quality RCA reports (timeline, impact, contributing factors, actions) This role is not suitable for candidates whose experience ...

Google Workspace Engineer

Hiring Organisation
Vaco LLC
Location
Addison, Texas, United States
Employment Type
Permanent
Salary
USD Annual
/Operational Efficiency Tier 3 Support/Escalations - Providing Advanced Support to Internal IT/Operations Teams for Escalated Google Workspace/Gmail Issues Root Cause Analysis/Remediation - Performing Root Cause Analysis/Implementing Permanent Fixes for Recurring or High-Impact Incidents Security … Authentication/Authorization (SSO/OAuth/SAML)/Executing Bulk Operations/Implementing Security/Compliance Hardening Advanced Remediation/Audit/Log Analysis/Configuration Reviews Supporting Incident Ownership/Permanent Fix Resolution Gmail API/REST API (hands-on expertise) - Leveraging Gmail API to Programmatically Investigate ...

Data Engineer (SC Cleared)

Hiring Organisation
scrumconnect ltd
Location
City, Newcastle Upon Tyne, United Kingdom
Employment Type
Any
Salary
GBP Annual
services for storage, compute, and analytics, you will help deliver reliable, well-governed data assets to downstream users. You will apply strong data analysis skills to identify root causes of data issues, work with dimensional data models and slowly changing dimensions, and implement infrastructure as code using Terraform. … distributed cloud infrastructure. Workflow orchestration Configure and manage Apache Airflow DAGs for task orchestration, ensuring reliable scheduling, monitoring, and execution of data processing workflows. Root cause analysis Perform data analysis to identify and resolve root causes of pipeline failures and data quality issues - including reviewing ...

Senior / Lead .NET Software Engineer / SRE

Hiring Organisation
Vaco LLC
Location
Arlington, Texas, United States
Employment Type
Permanent
Salary
USD 160,000 Annual
GITHub Copilot context files to embed SRE practices (security/performance checks) early in development cycles, debugging/refactoring code in production environments, performing root-cause analysis, and iterating post-deployment. Additionally, there's no traditional on-call production monitoring/support, whereas the focus will … Database Design/Optimization - Oracle/MS SQL Server/NoSQL (CosmosDB) SQL Scripting (hands-on) Designing/Evolving Database Schemas Performing Query Performance Analysis Indexing to Deliver Scalable/Performant Services Problem Solving/Collaboration - Driving Root Cause Analysis/Debugging for .NET Applications ...

Looking for Filenet Consultant Phoenix / Chandler, AZ (Onsite)

Hiring Organisation
TechnoGen Inc
Location
Chandler, Arizona, United States
Employment Type
Permanent
Salary
USD Annual
platform engineering capabilities are implemented, ensuring resilience, high performance, and high availability of the applications. Provide cross-functional and cross-organizational coordination on root cause analysis for any production issues. Maintain and Guide Automation, execution and undertake analysis of results to ensure that software meets … exceeds specified standards and/or client and technical requirements. Manage, coordinate, and communicate solutions, and root cause analysis. Support portfolio of applications across technologies. Provide technical consultation and support in the development of end-to-end automated software solutions capable of interfacing and integrating with in-house ...

Principal Vulnerability Engineer

Hiring Organisation
Unity Systems
Location
United Kingdom
.You will conduct original 0-day and n-day vulnerability research while building scalable, AI-powered tooling that automates vulnerability discovery, exploit validation, patch analysis, and detection engineering. Working at the intersection of offensive security, reverse engineering, software engineering, and applied AI, you will help organizations identify and eliminate … Researc hConduct original 0-day and n-day vulnerability research across enterprise technologies, cloud services, applications, appliances, firmware, and operating systems .Perform patch diffing, root-cause analysis, reverse engineering, and exploit development against both source-available and binary-only targets .Discover and validate critical vulnerabilities including remote ...

Continuity Manager

Hiring Organisation
Dunhill Professional Search
Location
United States
Employment Type
Permanent
Salary
USD 752,000 Annual
stakeholders, including plan maturity, test results, readiness assessments, and incident or recovery outcomes. Lead or support investigations following incidents or disruptions; facilitate ITIL-aligned root cause analysis and guide implementation of corrective and preventive actions. Serve as a subject matter expert in resilience, continuity, and recovery best … third-party providers to incorporate continuity and recovery requirements into service design and operations. Business Continuity Planning (BCP) & IT Disaster Recovery (ITDR) Business Impact Analysis (BIA) & Risk Assessment Recovery Planning (RTO/RPO) & Continuity Plan Development Disaster Recovery Testing, Tabletop Exercises & Failover Validation Incident Management, Root Cause ...

Sr. Network Engineer

Hiring Organisation
Eclaro
Location
Melbourne, Florida, United States
Employment Type
Permanent
Salary
USD 70 Annual
compliance with cabling standards, optimized rack layouts, and industry best practices for reliability and scalability. Diagnose and resolve complex hardware and software issues, driving root cause analysis, and corrective action to minimize downtime. Operations & Maintenance: Oversee global infrastructure, leveraging monitoring systems to detect and remediate issues before … they impact customers Provide 24/7 support for critical network events and emergencies, leading root cause analysis and restoration efforts Maintain accurate documentation, network diagrams, and performance logs to ensure operational continuity Conduct after-hours maintenance, risk analysis, and infrastructure upgrades to minimize disruption during ...

Staff Cloud Engineer (IaC/Terraform/Python)

Hiring Organisation
STAND 8
Location
Tempe, Arizona, United States
Employment Type
Permanent
Salary
USD Annual
detection mechanisms Monitor infrastructure performance and cost efficiency, implementing optimizations and creating dashboards and alerts Troubleshoot and resolve cloud infrastructure and application issues, performing root cause analysis Lead post-incident reviews and implement corrective actions Provide thought leadership on cloud technologies, infrastructure as code, and industry best … infrastructure - building and maintaining IT systems that are scalable, reliable, and secure Mastery of AWS Well Architected Framework principles Expert with troubleshooting and root cause analysis Ability to analyze systems with a high degree of detail and impact awareness Hands-on experience with at least 2 programming ...

End User Computing Engineer JBLE1 NI

Hiring Organisation
MCS Group
Location
Belfast, UK
tooling. Champion innovation initiatives, including emerging technologies and AI-driven improvements across the support function. Lead investigations into recurring or high-impact issues, performing root cause analysis and implementing long-term solutions. Support technology projects, office moves, deployments, and regional or global rollouts, ensuring successful delivery … technical concepts for non-technical audiences. Understanding of IT Security, Risk, Compliance, and Business Continuity requirements. Excellent troubleshooting and analytical problem-solving skills, including root cause analysis methodologies. Desirable Skills Experience within financial services, investment banking, trading, or other highly regulated sectors. Exposure to supporting front-office ...

ICT Senior Networking & Security Engineer

Hiring Organisation
Great Ormond Street Hospital for Children NHS Foundation Trust
Location
London, WC1N 3HZ, United Kingdom
Salary
£58133.00 to £65261.00
written communications skills in English and is highly articulate - very able to express technical ideas to a non -technical audience Able to undertake root cause analysis of simple to highly complex security issues Analytical and proven ICT based technical skills being very attentive to detail ensuring accuracy … operating systems and mobile devices (smartphones and tablets, etc.) Excellent practical knowledge of CISCO networks and equipment - able to interrogate and undertake root cause analysis to a highly complex level Excellent aptitude for sharing knowledge and skills with other ICT team members Good ability to re -prioritise ...

Power Systems Integration Engineer - Directed Energy - 27110

Hiring Organisation
HII Mission Technologies Division
Location
Syracuse, New York, United States
Employment Type
Permanent
Salary
USD Annual
power quality, transients, and thermal behavior. Conduct FAT, integration testing, fault insertion, and regression testing. Verify performance against electrical, thermal, and control requirements. Troubleshooting & Root Cause Analysis Diagnose hardware, wiring, firmware, and controls issues during bring up and testing. Perform structured root cause analysis … Follow and enforce high voltage and high energy electrical safety procedures. Support compliance with applicable military, industrial, and laboratory safety standards. Participate in hazard analysis and risk mitigation activities. What we are looking for (minimum requirements) 9 years relevant experience with Bachelors in related field; 7 years relevant experience ...

Performance Insights & Reporting Manager (Digital Operations) - Contract Inside IR35

Hiring Organisation
Bodhi
Location
Chertsey, England, United Kingdom
KPIs and performance metrics. Maintain metric definitions, calculation methodologies, aggregation rules, and data governance documentation. Drive consistency across markets and operational teams. Operational Performance Analysis Analyse delivery performance including SLA adherence, throughput, backlog trends, ticket ageing, and operational efficiency. Conduct root-cause analysis and identify performance … experience in Performance Reporting, Analytics, Business Intelligence, or Operational Insights. Strong analytical capability with experience interpreting operational datasets and performance metrics. Proven experience conducting root-cause analysis and translating findings into actionable recommendations. Ability to convert business requirements into structured reporting and automation specifications. Experience validating analytical ...

Trainee QA Engineer

Hiring Organisation
Hudson Shribman
Location
South East, United Kingdom
Employment Type
Permanent
Salary
£28,000
career in quality engineering within an engineering or manufacturing environment.As a trainee Quality Engineer you will gain hands-on experience in inspection, quality systems, root cause analysis, and continuous improvement while working towards becoming a fully competent Quality Engineer. Working across areas like automotive and aerospace, this … Measuring Machines (CMM) (training provided) - Record inspection data and ensure traceability Non-Conformance & Problem Solving - Assist in the investigation of non-conforming products - Support root cause analysis using tools such as: - Help implement corrective and preventive actions (CAPA) Process Improvement - Support continuous improvement initiatives across manufacturing processes ...

Trainee QA Engineer

Hiring Organisation
Hudson Shribman
Location
United Kingdom
Employment Type
Permanent
Salary
GBP 26,000 - 28,000 Annual
quality engineering within an engineering or manufacturing environment. As a trainee Quality Engineer you will gain hands-on experience in inspection, quality systems, root cause analysis, and continuous improvement while working towards becoming a fully competent Quality Engineer. Working across areas like automotive and aerospace, this … Measuring Machines (CMM) (training provided) Record inspection data and ensure traceability Non-Conformance & Problem Solving Assist in the investigation of non-conforming products Support root cause analysis using tools such as: Help implement corrective and preventive actions (CAPA) Process Improvement Support continuous improvement initiatives across manufacturing processes ...

DevOps Technical Lead

Hiring Organisation
Data Careers
Location
South East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Implement progressive delivery practices Reliability & Observability Define and track SLIs/SLOs Enhance monitoring, alerting and incident response processes Lead post-incident reviews and root cause analysis Drive reduction of operational toil Security & Compliance Embed DevSecOps controls into pipelines Implement least-privilege IAM models Support … tooling experience (GitHub Actions, GitLab CI, Jenkins) Experience operating production SaaS environments Strong observability tooling knowledge (Datadog, Prometheus, ELK etc.) Incident management and root cause analysis experience Experience in regulated or security-conscious environments is highly desirable ...

IT Application Delivery Analyst

Hiring Organisation
Robert Walters
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
timely and efficient manner Perform routine system checks and monitoring to ensure optimal application performance and security Identify, troubleshoot, and resolve technical issues through root cause analysis Assist with application testing, deployment, and release activities Collaborate with IT teams to develop and implement new software applications … experience working within a legal firm environment Proven experience supporting legal technology applications Strong hands-on experience with: iManage Intapp Excellent analytical, troubleshooting, and root cause analysis skills Experience using ServiceNow or similar ITSM platforms Basic PowerShell and SQL scripting experience for automation and support tasks Experience ...

Lead, Site Reliability Engineer

Hiring Organisation
Mastercard
Location
O Fallon, Missouri, United States
Employment Type
Permanent
Salary
USD Annual
teams to influence standards, roadmaps, troubleshooting approaches, and end-to-end system design -Act as a senior escalation point for complex incidents, lead root cause analysis, and mentor engineers through shared documentation, runbooks, and best practices All About You -Advanced expertise in Site Reliability Engineering, platform engineering … service level objectives -Hands-on experience with DevOps practices, including CI/CD pipelines, automation, and container-based deployments, along with strong troubleshooting and root cause analysis skills -Recognized as a technical expert who works independently on complex problems, influences outcomes across teams, and mentors others through ...

Software Engineer II - AI and Observability

Hiring Organisation
Disney Entertainment and ESPN Product & Technology Careers
Location
New York, United States
Employment Type
Permanent
Salary
USD 157,500 Annual
streaming ecosystem. You will develop agentic systems, machine learning models, and real-time pipelines that transform telemetry, logs, and user signals into automated detection, root cause analysis, and proactive insights. In this role, you will contribute to the development of autonomous agents capable of reasoning over complex … health and reliability Develop end-to-end data and decisioning pipelines that transform telemetry, logs, and user signals into actionable insights, automated detection, and root cause analysis Create and deploy scalable APIs and services that deliver predictive signals, explainability, and insights to engineering teams, operational tools ...

Software Engineer II - AI and Observability

Hiring Organisation
Disney Entertainment and ESPN Product & Technology Careers
Location
Seattle, Washington, United States
Employment Type
Permanent
Salary
USD 157,500 Annual
streaming ecosystem. You will develop agentic systems, machine learning models, and real-time pipelines that transform telemetry, logs, and user signals into automated detection, root cause analysis, and proactive insights. In this role, you will contribute to the development of autonomous agents capable of reasoning over complex … health and reliability Develop end-to-end data and decisioning pipelines that transform telemetry, logs, and user signals into actionable insights, automated detection, and root cause analysis Create and deploy scalable APIs and services that deliver predictive signals, explainability, and insights to engineering teams, operational tools ...