architectural principles, working closely with enterprise architecture and data teams Maintain technical service roadmaps and monitor new Microsoft platform releases Provide 3rd line support for live digital services Conduct rootcauseanalysis and identify solutions Assess impact of new Microsoft releases Skills/Experience: Extensive experience with the Microsoft stack, including Power Platform, and Dynamics 365 Experience More ❯
performance. The Role: Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS Act as an escalation point for critical data incidents and lead rootcauseanalysis Optimising system performance, define SLIs/SLOs, and drive reliability Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS *Act as an escalation point for critical data incidents and lead rootcauseanalysis *Optimising system performance, define SLIs/SLOs, and drive reliability *Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS*Act as an escalation point for critical data incidents and lead rootcauseanalysis*Optimising system performance, define SLIs/SLOs, and drive reliability *Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Robert Half
of device access, ensuring starters and leavers are processed effectively Document administrative tasks and procedures ? Essential Requirements: Good standard of education Full driving licence Analytical thinking, problem-solving and rootcauseanalysis skills Attention to detail Excellent organisation and communication skills. Excellent time management with a flexible approach The ability to manage a busy workload whilst thriving More ❯
of collaboration, innovation, and continuous improvement. Assist with the design, implementation, and maintenance of systems to ensure high availability, scalability, and performance. Develop and implement strategies for incident response, rootcauseanalysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and More ❯
qualification, audits, and corrective actions. Review and approve quality documentation to ensure compliance with specifications and regulatory standards. Monitor product performance and customer feedback for areas of improvement. Conduct rootcauseanalysis and implement corrective and preventive actions. Collaborate with R&D, manufacturing, and supply chain teams to embed quality throughout the product lifecycle. Manage internal and More ❯
of collaboration, innovation, and continuous improvement. Assist with the design, implementation, and maintenance of systems to ensure high availability, scalability, and performance. Develop and implement strategies for incident response, rootcauseanalysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
Other responsibilities will encompass, proactive monitoring of production environments, design and implementation of automation and processes to improve efficiency and effectiveness, taking a lead in incident response, troubleshooting and rootcauseanalysis activities to mitigate future issues. You'll collaborate with senior business stakeholders to gather requirements, address concerns and provide updates on projects and systems status More ❯
Fetcham, Leatherhead, Surrey, England, United Kingdom Hybrid / WFH Options
Recruitvirt Ltd
data protection, recovery, and disaster recovery (DR) across on-prem and hybrid workloads. - Manage incidents, service requests, and change controls via standard ITIL-based processes. - Lead and participate in rootcauseanalysis for infrastructure-related incidents and issues. - Maintain and update detailed technical documentation and configuration records. - Act as a senior point of contact for customers, attending More ❯
Leatherhead, Surrey, England, United Kingdom Hybrid / WFH Options
Recruitvirt Ltd
data protection, recovery, and disaster recovery (DR) across on-prem and hybrid workloads. Manage incidents, service requests, and change controls via standard ITIL-based processes. Lead and participate in rootcauseanalysis for infrastructure-related incidents and issues. Maintain and update detailed technical documentation and configuration records. Act as a senior point of contact for customers, attending More ❯
lifetime of an incident or problem Undertake small scale development tasks, primarily creating checks and automation and data reconstructions Participate in or run relevant meetings, e.g. including team ceremonies, rootcauseanalysis, long running problem handover, etc. Ensure that the Incident and Problem Management process is followed Establishing, sharing and follow operational procedures and controls Participate in More ❯
Salisbury, Wiltshire, United Kingdom Hybrid / WFH Options
Sopra Steria Group
not limited to Cisco Routing, Switching, Security, SDN, Unified Communications and Wireless technologies). Identify and explore opportunities for enhancing efficiency, leveraging orchestration technologies to streamline and automate. Lead 'RootCauseAnalysis' investigations into network faults, security and performance issues. Support the Principal NetOps Engineer and Architects with project implementation. Liaise with third party service providers for More ❯
Portsmouth, Hampshire, South East, United Kingdom Hybrid / WFH Options
Sopra Steria Limited
not limited to Cisco Routing, Switching, Security, SDN, Unified Communications and Wireless technologies). Identify and explore opportunities for enhancing efficiency, leveraging orchestration technologies to streamline and automate. Lead 'RootCauseAnalysis' investigations into network faults, security and performance issues. Support the Principal NetOps Engineer and Architects with project implementation. Liaise with third party service providers for More ❯
and automation to support agile and resilient delivery. Establish observability, quality assurance, and performance metrics across the engineering lifecycle. Provide leadership in incident and problem management, ensuring resilient runbooks, rootcauseanalysis, and strong operational readiness. Collaborate with Finova and other vendors to manage technical delivery, integration, and cost performance. Develop, lead, and mentor multidisciplinary engineering teams More ❯
source, validate and update rate cards for international carriers. Loss Reductions: minimise loss-related expenses such as Item Not Received (INR) claims and arrived-damaged issues. Use data-driven rootcauseanalysis to recommend and implement systemic fixes. Cross-Functional Leadership : Collaborate with Product, Legal, Customer Service, Seller Management, Regional Ops teams, Finance and External Partners to More ❯
Doncaster, South Yorkshire, Yorkshire, United Kingdom
DFS Furniture Ltd
manage automation scripts to streamline processes and reduce manual effort. Collaborate with development, data, and security teams to understand and meet their platform requirements. Lead incident resolution efforts, including rootcauseanalysis, fixes, documentation, and preventative solutions. Apply strong analytical and problem-solving skills to resolve complex technical issues. Monitor and manage platform performance, ensuring high standards More ❯
understanding of Agile development and the role of testing throughout the sprint lifecycle. Comfort with writing clear, testable acceptance criteria using Gherkin or similar syntax. Excellent debugging, investigation, and root-causeanalysis skills. A collaborative, detail-oriented mindset and strong communication across teams. BSc in a related field such as Computer Science, Computer Engineering, or other software More ❯
Security Devices: Configure, maintain, and troubleshoot firewall technologies and other security devices to safeguard the network from potential threats. Incident Response & Troubleshooting: Respond to network and security incidents, conduct rootcauseanalysis, and ensure rapid resolution to minimize downtime. Collaboration & Integration: Work closely with our IT and software engineering teams to integrate security into system development and More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
LM RECRUITMENT SOLUTIONS LTD
. Lead the adoption of proactive monitoring and automation tools to help transition the business from reactive support to predictive, streamlined operations. Lead on service management excellence ticket discipline, rootcauseanalysis, and continuous improvement. Ensure all backup strategies (on-premises and cloud) are fit for purpose, with robust monitoring and management to maintain data integrity and More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
LM RECRUITMENT SOLUTIONS LTD
. Lead the adoption of proactive monitoring and automation tools to help transition the business from reactive support to predictive, streamlined operations. Lead on service management excellence ticket discipline, rootcauseanalysis, and continuous improvement. Ensure all backup strategies (on-premises and cloud) are fit for purpose, with robust monitoring and management to maintain data integrity and More ❯
processes for continuous security monitoring and detection of security events, including application-specific security events. Lead the investigation and resolution of security incidents, including those related to application vulnerabilities, rootcauseanalysis, and implementation of corrective actions. Reporting : Provide regular reports on the organization's security posture, including application security vulnerabilities and risks, risks, and compliance status More ❯
Security Champion Enablement: Collaborate with engineering teams to build security awareness and develop a network of Security Champions. Incident & Response Readiness: Support Smarsh SOC and security incident response, including rootcauseanalysis and post-mortem reviews for your product(s). Security Compliance & Governance: Ensure alignment with regulatory requirements (SOC 2, ISO 27001, etc.) and support audit More ❯
Reliability: Establish deep observability into cloud network paths, health indicators, and latency measurements. Apply SRE practices to ensure uptime, fast incident response, and continuous improvement. Drive performance optimization and rootcauseanalysis through telemetry, analytics, and runbooks. Define and monitor SLAs, SLOs, and KPIs related to cloud connectivity experience. Security, Compliance & Governance: Ensure secure design and enforcement More ❯
best practice. Development of strategies and initiatives related to C.I and Lean Manufacturing to drive operational performance, quality improvement, and standardisation across manufacturing. Support the incident management process, including rootcauseanalysis and remediation activity. Functional leadership of C.I activity and resources, working with operational leadership teams to ensure consistent delivery of improvement and standardisation initiatives. Lead … process mapping, analysis, documentation (SOP), and re-engineering to optimise workflows and eliminate inefficiencies. Supporting the assessment and onboarding of new business opportunities/wins including training and procedural documentation requirements. Identify and implement industry best practices and methodologies, such as Lean, Six Sigma, and Agile. Establish KPIs to monitor process performance and deliver measurable improvements. Drive cultural change … and implementation of the Operational Strategy suggesting opportunities for further improvement or refinement to the Strategy. You will develop and implement a programme of Continuous Improvement activity, conduct process analysis, and foster a culture of continuous improvement across Manufacturing. The Head of Manufacturing Excellence plays an integral role in helping to foster a culture of sustainable change through the More ❯