Root Cause Analysis Job Vacancies

1 to 25 of 870 Root Cause Analysis Jobs

Senior SOC Analyst

Glasgow, United Kingdom
Applicable Limited
look at all the evidence available and support the client on the appropriate action to contain and remediate any security incident. They will need to be able to provide root cause analysis and liaise with the customer and the Service Delivery Manager as well and ensuring the actions of the SOC Analysts follow best practice. Security Monitoring … Monitoring SIEM tools to assure high a level of security operations delivery function Oversee and enhance security monitoring systems to detect and analyse potential security incidents. Conduct real-time analysis of security events and incident and escalate as necessary Support other teams on investigations into incidents, determining the root cause and impact. Document findings and lessons learned … with the Technical Teams to ensure all new and changed services are monitored accordingly Documentation: Maintain accurate and up-to-date documentation of security procedures, incident response plans, and analysis reports. Create post-incident reports for management and stakeholders. Support the creation of monthly reporting packs as per contractual requirements. Create and document robust event and incident management processes More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Soc Analyst L3

Birmingham, Staffordshire, United Kingdom
Applicable Limited
look at all the evidence available and support the client on the appropraite action to contain and remediate any security incident. They will need to be able to provide root cause analysis and liaise with the custiomer and the Service Delivery Manager as well and ensuring the actions of the SOC Analysts follow best practice. Job Duties … Monitoring SIEM tools to assure high a level of security operations delivery function Oversee and enhance security monitoring systems to detect and analyse potential security incidents. Conduct real-time analysis of security events and incident and escalate as necessary Support other teams on investigations into incidents, determining the root cause and impact. Document findings and lessons learned … with the Technical Teams to ensure all new and changed services are monitored accordingly Documentation: Maintain accurate and up-to-date documentation of security procedures, incident response plans, and analysis reports. Create post-incident reports for management and stakeholders. Support the creation of monthly reporting packs as per contractual requirements. Create and document robust event and incident management processes More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Cloud Engineer - Azure

England, United Kingdom
H2 Performance Consulting
access control (RBAC), and ensuring compliance with DoD standards. Assist in the automation of operational tasks using Infrastructure-as-Code tools like Terraform or Bicep. Participate in incident response, root cause analysis, and post-incident reviews to improve system reliability. Provide helpdesk support by taking ownership of tickets in the Remedy ticketing solution, resolving issues, and managing More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Engineer - Azure

England, United Kingdom
Falconwood, Inc
access control (RBAC), and ensuring compliance with DoD standards. Assist in the automation of operational tasks using Infrastructure-as-Code tools like Terraform or Bicep. Participate in incident response, root cause analysis, and post-incident reviews to improve system reliability. Provide helpdesk support by taking ownership of tickets in the Remedy ticketing solution, resolving issues, and managing More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Computer Systems Engineer II with Security Clearance

Falls Church, Virginia, United States
Hybrid / WFH Options
Epsilon Inc
teams to optimize data pipelines for AI/ML initiatives, automation, and productization Lead efforts to integrate security best practices, ensuring compliance with relevant regulations and standards Conduct performance analysis, capacity planning, and system tuning to maximize uptime and reliability Guide junior team members in troubleshooting techniques, documentation, and adherence to best practices Drive continuous improvement by reviewing existing … for secure system architecture Familiarity with data engineering concepts, including ETL/ELT pipelines, big data tools, and AI/ML workflows Ability to troubleshoot complex system issues, perform root-cause analysis, and implement effective solutions Excellent communication, teamwork, and organizational skills, with a focus on innovation and continuous improvement One or more of the following certifications More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Application Support Manager

London, United Kingdom
Just Group plc
application support strategies Key Responsibilities: Own Application Support Lifecycle: Ensure end-to-end support for critical business applications, meeting SLAs and availability targets. Incident & Problem Management: Lead resolution and root cause analysis for all Retail application incidents, including major (P1/P2) issues. Escalation & Crisis Leadership: Act as the escalation point for major incidents and provide direction … containerization experience with Azure , Docker , and AKS . Familiarity with modern web technologies, including React , REST APIs , and SOAP architectures. Skilled in managing P1/P2 incidents , business impact analysis, root cause investigations, and change coordination. Strong grasp of IT service management practices; ITIL v4 certification or equivalent preferred. Proactive Monitoring : Hands-on experience with tools like More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal SRE Engineer

London, South East, England, United Kingdom
Robert Walters
to the overall success of the FX desk's technology platform. * Respond rapidly to production incidents using data-driven decision making to minimise downtime and financial impact while leading root cause analysis and conducting blameless post-mortems.* Enhance application health monitoring by implementing robust observability solutions and automating manual processes to improve system resilience.* Drive cost optimisation More ❯
Employment Type: Full-Time
Salary: £110,000 - £125,000 per annum
Posted:

Network Security Lead

London, United Kingdom
Hybrid / WFH Options
Pertemps
maintain systems according to approved design. Service Delivery & Operations: Lead key service management processes (Continuity, Capacity, Availability). Attend incident/problem bridges as the subject matter expert. Review root cause analyses (RCAs) and oversee corrective actions. Provide accurate monthly service performance reports across IT and OT. Supplier & Financial Management: Lead and manage suppliers to meet agreed SLAs … change management experience. Ability to simplify complex network architecture for non-technical audiences. Desirable Technical Skills & Qualifications: Knowledge of network security technologies and strategic supplier management. Experience in stakeholder analysis and business case development. Familiarity with cloud integration (Azure and AWS). What's in it for you? Competitive salary up to £75,000 per annum, depending on experience More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

IT Service Delivery Manager (Defence)

Farnborough, Hampshire, United Kingdom
Positiv Cohort
Proactively identify areas for improvement and implement preventive measures. Service Improvement: Continuously assess the IT service delivery process and implement improvements that enhance efficiency, effectiveness, and customer satisfaction. Lead root cause analysis for service delivery issues and define corrective actions. Change Management: Ensure that changes to the IT environment are implemented smoothly with minimal disruption to service. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Cloud Engineer with Security Clearance

Washington, Washington DC, United States
Gridiron IT Solutions
for improvement and minimizing the wastage Encouraging and building automated processes wherever possible Identifying and deploying security measures by continuously performing vulnerability assessment and risk management Incident management and root cause analysis Coordination and communication with team and with customers both external and internal Selecting and deploying appropriate CI/CD tools Managing periodic reporting on the More ❯
Employment Type: Permanent
Salary: USD 140,000 Annual
Posted:

Database Administrator I with Security Clearance

Falls Church, Virginia, United States
Hybrid / WFH Options
Epsilon Inc
of data between systems by helping with Extract, Transform, Load (ETL) processes and ensuring data consistency across different platforms. Monitor and Troubleshoot Database Performance Issues - Identify potential bottlenecks, perform root cause analysis, and work with senior architects to implement solutions that enhance database reliability and efficiency. Support Compliance and Regulatory Requirements - Ensure database structures and data management More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Information Security Analyst II with Security Clearance

Falls Church, Virginia, United States
Hybrid / WFH Options
Epsilon Inc
assessments and provide actionable recommendations for mitigation. Experience supporting security for data pipelines, AI/ML environments, or cloud-based infrastructures. Excellent incident response skills, including triage, containment, and root cause analysis. Strong communication and collaboration abilities to partner with cross-functional teams and stakeholders. One or more of the following certifications are desired: Certified Cloud Security Professional More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Cloud Engineer - TS/SCI with Security Clearance

Washington, Washington DC, United States
OneGlobe LLC
for improvement and minimizing the wastage • Encouraging and building automated processes wherever possible • Identifying and deploying security measures by continuously performing vulnerability assessment and risk management • Incident management and root cause analysis • Coordination and communication with team and with customers both external and internal • Selecting and deploying appropriate CI/CD tools • Managing periodic reporting on the More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Global IT Software Engineer Senior Manager

London, United Kingdom
The Boston Consulting Group GmbH
best practices, cloud strategies, and platform engineering. Team Leadership: Guide and coach, a team of engineers, technical specialists, and architects, encouraging the adoption of innovative technologies and practices. Technical Analysis:Lead technical analysis and estimation efforts for custom-built applications. Best Practices:Drive the adoption of release management and automation best practices. Incident Management:Ensure thorough root cause analysis and prompt remediation during any incidents or outages. Vendor Coordination:Work with external vendors to supplement team capacity and expertise when necessary. YOU'RE GOOD AT You bring solid development and program leadership experience to drive technical governance, innovation, integrations, and cloud strategies using emerging technologies like Gen AI. You thrive in environments that demand More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Cloud Engineer with Security Clearance

Chantilly, Virginia, United States
Arion Systems, inc
deployment, monitoring, and scaling. • Continuously evaluate and improve the cloud infrastructure to align with evolving technology trends and business requirements. • Respond to and resolve cloud-related incidents, providing detailed root cause analysis and long-term solutions. • Work with other teams to ensure robust disaster recovery and business continuity planning. • Stay current with emerging cloud technologies and propose More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

IT Application Support Analyst

London, United Kingdom
Hybrid / WFH Options
Kurt Geiger
base articles. Monitor application health using tools and custom dashboards. Support integration and communication between cloud platforms (Azure, Entra ID, Microsoft 365). Contribute to service improvement initiatives, including root cause analysis and automation opportunities. Participate in on-call rotations or after-hours incidents during peak retail periods. Work within established security frameworks and governance. Hybrid working More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer (SRE)

Wokingham, Berkshire, United Kingdom
Leap29
cloud and hybrid environments. Architect observability solutions (monitoring, logging, alerting) that detect and prevent failures before they impact users. Own and improve incident response workflows, including runbooks, communications, and root cause analysis. Define and enforce SLIs, SLOs, and error budgets to balance innovation with operational stability. Mentor engineers and advise teams on best practices for scalability, security, deployment … efforts, reliability reviews, and cross-functional reliability programs. Core Responsibilities Operations Leadership Act as a senior escalation point for major incidents and production outages. Lead post-incident reviews, coordinate root cause analysis, and drive remediation plans. Communicate platform health, risk, and improvement plans with technical and non-technical stakeholders. Design and build robust CI/CD workflows More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Security Analyst (Splunk Enterprise Security) with Security Clearance

Chantilly, Virginia, United States
Arion Systems, inc
Job Summary: As a Security Analyst, you will provide day-to-day security monitoring, incident response, and threat analysis leveraging Splunk Enterprise Security (ES) and SOAR platforms. You will also play an active role in the ongoing buildout, configuration, and engineering of our Splunk ES environment, including onboarding new data sources, creating detection content, and developing automated response workflows. … fast-paced government setting. Key Responsibilities: • Monitor and analyze security events using Splunk Enterprise Security (ES) dashboards, alerts, and correlation searches. • Investigate and respond to security incidents, including triage, root cause analysis, containment, and remediation support. • Develop and fine-tune correlation rules, alerts, and dashboards in Splunk ES to improve threat detection capabilities. • Design, build, and maintain … onboarding new data sources, tuning correlation rules, and developing new detection use cases. • Collaborate with other teams to support incident response, vulnerability management, and threat hunting activities. • Conduct threat analysis, log analysis, and data enrichment using Splunk and other security tools. • Participate in regular security reviews and audits, providing evidence and reporting as needed. • Contribute to documentation and More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Threat Hunter - National Security - Leeds

Leeds, Yorkshire, United Kingdom
Hybrid / WFH Options
BAE Systems (New)
hybrid and flexible working arrangements available. Please consult your recruiter for details. Grade: GG10 - GG11 Referral Bonus: £5,000 Job Description Serve as the point of escalation for intrusion analysis, forensics, and incident response queries. Provide root cause analysis for complex, non-standard findings and anomalies without existing playbooks. Mentor team members and share knowledge proactively. … red team and pentest findings to improve detection rules. Provide forensic support and threat emulation to improve alert triage and accuracy. Identify gaps in SOC processes, data collection, and analysis, demonstrating the need for improvements through scenarios and red teaming. Perform complex threat hunting, automation, and analytic enrichment tasks. Set vision and milestones for emulation and detection capabilities, influencing More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Test Engineer with Security Clearance

Annapolis, Maryland, United States
Solutions Technology, Inc
testing. Work closely with development teams to integrate testing into the software development lifecycle (SDLC). Identify, document, and track defects using issue-tracking tools such as JIRA. Conduct root cause analysis and provide insights to improve product quality. Collaborate with cross-functional teams to ensure adherence to quality standards and best practices. Mentor and guide junior More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Technical Operations Specialist

London, United Kingdom
Aztec
Knowledge Management: Maintain up-to-date technical documentation, including API/interface catalogues, data flow diagrams, environment runbooks, and integration design patterns Incident and Service Request Administration: Assist in root cause analysis for integration-related issues, serving as the primary point of contact for documenting, triaging, and coordinating the resolution of incidents and service requests. Change Coordination … a conduit between the development team and project teams to ensure consistent, transparent, and professional communication Education and Experience: Bachelor's degree in computer science, information-technology, engineering, system analysis or a related study, or equivalent experience A minimum of three years in a technology-related capacity with direct exposure to software development or IT project environments. At least More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Solace Messaging Administrator

London, UK
BGC Group
Prometheus and Grafana. Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana; proactively identify and address anomalies. Configure and optimize Solace across WAN environments, ensuring low More ❯
Posted:

Solace Messaging Administrator

City of London, Greater London, UK
BGC Group
Prometheus and Grafana. Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana; proactively identify and address anomalies. Configure and optimize Solace across WAN environments, ensuring low More ❯
Posted:

Software Engineer

Sheffield, Yorkshire, United Kingdom
Hybrid / WFH Options
Experis - ManpowerGroup
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform root cause analysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer

Sheffield, South Yorkshire, United Kingdom
Hybrid / WFH Options
Experis
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform root cause analysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
Employment Type: Contract
Rate: £395 - £430/day
Posted:
Root Cause Analysis
10th Percentile
£29,188
25th Percentile
£41,250
Median
£52,500
75th Percentile
£67,500
90th Percentile
£83,750