to ensure secure-by-design principles are embedded in systems and products. Maintain and update security tools including SIEM, endpoint protection, and identity management systems. Investigate security incidents, perform rootcauseanalysis, and recommend corrective actions. Assist in compliance efforts with standards such as ISO 27001, NIST, and MOD-specific frameworks (e.g., JSPs, DEFSTANs). Contribute to More ❯
optimising system performance based on key metrics Deploy updates, fixes, and perform regularmaintenance Provide second-line technical support Build tools and automation to reduce errors andenhance customer experience Perform rootcauseanalysis for production errorsand implement long-term solutions Troubleshoot and resolve technical issuesefficiently Automate tasks, including visualisation andreporting processes Design and implement troubleshooting andmaintenance procedures Collaborate More ❯
Day-to-day IT service delivery including in-house systems USUP Incident Management Level 5 Owns the incident process and ensures restoration of service PBMG Problem Management Level 5 Rootcauseanalysis and avoidance of recurring incidents CHMG Change Management Level 5 Authorises, schedules, and reviews IT changes ASMG Asset Management Level 5 Owns and manages the More ❯
also enjoys seeing projects through to completion, solving problems at scale, and collaborating with leadership to help grow and improve IT operations. Key Responsibilities Provide expert-level support and root-causeanalysis for persistent or critical technical issues. Support infrastructure-related tasks such as servers, networking, identity management, and security tools (e.g., EDR/MDR/SOC More ❯
distributionin SAP HANA. Proven experience with performance trace toolslike ST12, ST05, SAT, HANA Studio PlanViz. Experience in analysing dump logs, database statistics, and job performance metrics. Excellent troubleshooting, RCA (RootCauseAnalysis), and optimization mindset. Secondary Skills (Desired): Familiarity with SAP Business Technology Platform (BTP), AWS, or Google Cloud Platformfor performance analysis. Working knowledge of JVM tuning More ❯
Azure) Experience with embedded Linux kernel programming Full-stack development expertise (front-end & back-end) CI/CD pipeline implementation & optimisation Testing frameworks: Selenium, TDD, Xray, Zephyr Scale, Cucumber Rootcauseanalysis & debugging proficiency WHY JOIN US? Be part of a team working on innovative quantum technology for a fast-growing business in Birmingham. Work on diverse More ❯
architectural principles, working closely with enterprise architecture and data teams Maintain technical service roadmaps and monitor new Microsoft platform releases Provide 3rd line support for live digital services Conduct rootcauseanalysis and identify solutions Assess impact of new Microsoft releases Skills/Experience: Extensive experience with the Microsoft stack, including Power Platform, and Dynamics 365 Experience More ❯
Employment Type: Permanent
Salary: £350.0 - £400.0 per day + £400 inside IR35
architectural principles, working closely with enterprise architecture and data teams Maintain technical service roadmaps and monitor new Microsoft platform releases Provide 3rd line support for live digital services Conduct rootcauseanalysis and identify solutions Assess impact of new Microsoft releases Skills/Experience: Extensive experience with the Microsoft stack, including Power Platform, and Dynamics 365 Experience More ❯
performance. The Role: Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS Act as an escalation point for critical data incidents and lead rootcauseanalysis Optimising system performance, define SLIs/SLOs, and drive reliability Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS *Act as an escalation point for critical data incidents and lead rootcauseanalysis *Optimising system performance, define SLIs/SLOs, and drive reliability *Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions More ❯
Gloucestershire, United Kingdom Hybrid / WFH Options
Robert Half
of device access, ensuring starters and leavers are processed effectively Document administrative tasks and procedures Essential Requirements: Good standard of education Full driving licence Analytical thinking, problem-solving and rootcauseanalysis skills Attention to detail Excellent organisation and communication skills. Excellent time management with a flexible approach The ability to manage a busy workload whilst thriving More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS*Act as an escalation point for critical data incidents and lead rootcauseanalysis*Optimising system performance, define SLIs/SLOs, and drive reliability *Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Robert Half
of device access, ensuring starters and leavers are processed effectively Document administrative tasks and procedures ? Essential Requirements: Good standard of education Full driving licence Analytical thinking, problem-solving and rootcauseanalysis skills Attention to detail Excellent organisation and communication skills. Excellent time management with a flexible approach The ability to manage a busy workload whilst thriving More ❯
qualification, audits, and corrective actions. Review and approve quality documentation to ensure compliance with specifications and regulatory standards. Monitor product performance and customer feedback for areas of improvement. Conduct rootcauseanalysis and implement corrective and preventive actions. Collaborate with R&D, manufacturing, and supply chain teams to embed quality throughout the product lifecycle. Manage internal and More ❯
of collaboration, innovation, and continuous improvement. Assist with the design, implementation, and maintenance of systems to ensure high availability, scalability, and performance. Develop and implement strategies for incident response, rootcauseanalysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and More ❯
of collaboration, innovation, and continuous improvement. Assist with the design, implementation, and maintenance of systems to ensure high availability, scalability, and performance. Develop and implement strategies for incident response, rootcauseanalysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and More ❯
Newport, Gwent, Wales, United Kingdom Hybrid / WFH Options
Yolk Recruitment
leadership role where you'll take ownership of incident and problem management across a critical national infrastructure environment. You'll oversee the governance of best practice frameworks, ensuring timely rootcauseanalysis and preventative actions, while leading a collaborative team and influencing service delivery across a multi-vendor landscape. This is an opportunity to create tangible improvements More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
Other responsibilities will encompass, proactive monitoring of production environments, design and implementation of automation and processes to improve efficiency and effectiveness, taking a lead in incident response, troubleshooting and rootcauseanalysis activities to mitigate future issues. You'll collaborate with senior business stakeholders to gather requirements, address concerns and provide updates on projects and systems status More ❯
Fetcham, Leatherhead, Surrey, England, United Kingdom Hybrid / WFH Options
Recruitvirt Ltd
data protection, recovery, and disaster recovery (DR) across on-prem and hybrid workloads. - Manage incidents, service requests, and change controls via standard ITIL-based processes. - Lead and participate in rootcauseanalysis for infrastructure-related incidents and issues. - Maintain and update detailed technical documentation and configuration records. - Act as a senior point of contact for customers, attending More ❯
Gloucester, Gloucestershire, England, United Kingdom Hybrid / WFH Options
IMT Resourcing Solutions
endpoint onboarding, policy deployment, and software packaging. Overseeing server patching processes and infrastructure monitoring, including WSUS/AUM configuration and third-party update scheduling. Supporting high-priority incident resolution, rootcauseanalysis, and the documentation of best practices and knowledge articles. Driving continual service improvement and supporting major infrastructure and cloud transformation projects. What We’re Looking More ❯
lifetime of an incident or problem Undertake small scale development tasks, primarily creating checks and automation and data reconstructions Participate in or run relevant meetings, e.g. including team ceremonies, rootcauseanalysis, long running problem handover, etc. Ensure that the Incident and Problem Management process is followed Establishing, sharing and follow operational procedures and controls Participate in More ❯
Build scalable, testable, and reliable systems with a strong focus on performance Collaborate with global development and business teams to design and implement effective technical solutions Provide technical support, rootcauseanalysis, and issue resolution Job requirements: 3 - 7 years experience working with C++ (C++11 or later; C++20 preferred) Understanding of Linux-based development environments Excellent problem More ❯
Salisbury, Wiltshire, United Kingdom Hybrid / WFH Options
Sopra Steria Group
not limited to Cisco Routing, Switching, Security, SDN, Unified Communications and Wireless technologies). Identify and explore opportunities for enhancing efficiency, leveraging orchestration technologies to streamline and automate. Lead 'RootCauseAnalysis' investigations into network faults, security and performance issues. Support the Principal NetOps Engineer and Architects with project implementation. Liaise with third party service providers for More ❯
Portsmouth, Hampshire, South East, United Kingdom Hybrid / WFH Options
Sopra Steria Limited
not limited to Cisco Routing, Switching, Security, SDN, Unified Communications and Wireless technologies). Identify and explore opportunities for enhancing efficiency, leveraging orchestration technologies to streamline and automate. Lead 'RootCauseAnalysis' investigations into network faults, security and performance issues. Support the Principal NetOps Engineer and Architects with project implementation. Liaise with third party service providers for More ❯
Manage and report on SLA adherence, escalations, and ticket backlog. Implement and refine standard operating procedures (SOPs) for incident response, change control, and communications. Analyse incident trends and drive rootcauseanalysis and long-term remediation. Collaboration & Escalation Act as the escalation point for major incidents during and outside business hours as needed. Collaborate with infrastructure, cloud More ❯