solutions across physical, virtual, and Azure environments Lead or contribute to infrastructure projects: migrations, upgrades, and deployments Optimise performance, reliability, and capacity across client estates Manage major incidents and rootcauseanalysis Conduct estate reviews and proactively address risks Maintain technical documentation (HLDs, LLDs, SOPs) Use service tools to manage tasks and meet SLAs Support onboarding and More ❯
Peterborough, Cambridgeshire, England, United Kingdom
Charis Grants Limited
with the design, deployment, and management of cloud-based Azure infrastructure solutions. Ensure the efficient operation, monitoring, and maintenance of cloud environments, virtual machines, storage, networking, and databases. Perform rootcauseanalysis of recurring infrastructure issues and provide solutions to improve system reliability. Collaborate with the 1st line support teams to escalate and resolve complex technical issues. More ❯
pipeline monitoring, alerting, and logging to detect failures and performance bottlenecks. Build automation to ensure data quality, lineage tracking, and schema evolution management. Participate in incident response, troubleshooting, and rootcauseanalysis for data issues. Advocate for DataOps best practices, driving automation, reproducibility, and scalability. Document infrastructure, data workflows, and operational procedures. What are we looking for More ❯
for network configurations, procedures, and policies. • Plan and execute network and virtualization projects, ensuring timely delivery and adherence to budget and quality standards. • Responsible for troubleshooting network incidents, providing rootcauseanalysis and documenting information in ticketing system (ServiceNow) and knowledge repositories (MS SharePoint). • Willing to work nights for network Change Request (CR) implementations and be More ❯
is responsible for support, administration, monitoring, troubleshooting and participating in various projects to enhance service and productivity. Job Responsibilities Provide L3+ support to delivery teams on escalated issues. Generate root-causeanalysis related to critical incidents and problems. Engage with vendor on getting support, updates, and planning as required. Take ownership of designated technologies covering Citrix environment More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
respond to changes in system behaviour as they arise. Support and troubleshooting: Second and third-line support, responding directly to business issues and questions. Problem escalation and incident response. Rootcauseanalysis and proactive problem solving. The team is empowered to deploy changes in response to arising requirements. Business-as-usual maintenance: Use of automation tools and More ❯
london (city of london), south east england, United Kingdom
Circana
is responsible for support, administration, monitoring, troubleshooting and participating in various projects to enhance service and productivity. Job Responsibilities Provide L3+ support to delivery teams on escalated issues. Generate root-causeanalysis related to critical incidents and problems. Engage with vendor on getting support, updates, and planning as required. Take ownership of designated technologies covering Citrix environment More ❯
issues. Help building tools to support client integrations. Provide support to FlexTrade clients globally and advise on questions and queries. Liaise with clients/brokers to interpret queries for rootcauseanalysis and propose appropriate workarounds and solutions Professional experience in FIX protocol Excellent analytical and problem-solving skills. Accuracy and attention to detail. Programming experience preferred More ❯
mitigation strategies. Develop and deliver real-time risk dashboards and MI for senior leadership and governance committees. Conduct control testing and assurance activities on design and operational effectiveness. Drive rootcauseanalysis of operational incidents and ensure control enhancements are implemented. Collaborate with internal audit, compliance, and technology on cross-functional risk initiatives. Prepare high-quality risk More ❯
vulnerabilities are exploited or identified in real-time. Work with relevant teams to contain and mitigate security breaches, ensuring minimal impact on the business. Develop post-incident reports, including rootcauseanalysis and remediation strategies. Security Strategy & Improvement: Stay up-to-date on the latest security trends, tools, techniques, and frameworks. Continuously evaluate and improve the organisation More ❯
Response Ace: When things get a bit wobbly, you'll be on the front lines, resolving incidents fast to minimize downtime. After the dust settles, you'll lead the rootcauseanalysis to prevent similar issues from popping up again. Automation Whizz: Got a repetitive task? You'll be the one automating it away! From environment setup More ❯
Derby, Derbyshire, United Kingdom Hybrid / WFH Options
Cooper Parry
employee experience, and directly influence margin and value creation across the firm. You'll be getting stuck into a wide range of tasks, including; Process Optimisation Delivery Leading discovery, analysis, and redesign of core processes across multiple service lines and operational functions. Mapping current and future-state processes with clear visuals, evidence-based insights, and financial impact analysis. Identifying … service leads and the Transformation Team. Collaboration & Engagement Working across the business to understand operational challenges and co-designing solutions that work in practice. Facilitating workshops, user interviews, and root-causeanalysis sessions. Partnering with Technology, Business Operations, and Finance to align process changes with automation and digital capability. Acting as a key point of engagement with More ❯
projects and building custom tools. Key job responsibilities Technical Support and Problem Resolution • Diagnose and resolve complex production software issues across multiple products and services • Perform comprehensive troubleshooting and rootcauseanalysis for technical challenges • Provide timely and effective support through ticket management and customer communication Software Development and Maintenance • Develop and implement operational tools and automation More ❯
and maintain quality management system documentation Produce and maintain Quality Management Plans for new and existing projects Support project non-conformance recovery and closure through defined QMS procedures. Facilitate rootcauseanalysis and corrective action implementation; driving timely completion of actions. Review and approve supplier First Article Inspection Reports and compile internal First Article Inspection Reports in More ❯
Doncaster, Yorkshire, United Kingdom Hybrid / WFH Options
DFS Furniture PLC
manage automation scripts to streamline processes and reduce manual effort. Collaborate with development, data, and security teams to understand and meet their platform requirements. Lead incident resolution efforts, including rootcauseanalysis, fixes, documentation, and preventative solutions. Apply strong analytical and problem-solving skills to resolve complex technical issues. Monitor and manage platform performance, ensuring high standards More ❯
a plus, but not required High proficiency with at least one programming/scripting language (e.g., Go, Python, C) and ability to learn additional languages quickly Ability to perform rootcauseanalysis Strong verbal and written communication skills, including the ability to communicate effectively and efficiently with both coworkers and third-party vendors Strong collaboration skills with More ❯
architecture will help you. What you'll be doing Diagnose and resolve complex application and infrastructure issues Participate in our 24x7 on-call rotation, SCRUM, and deployment planning Perform RootCauseAnalysis (RCA) and provide recommendations for application teams Improve availability and reduce customer impact using Industry best observability tools Ensure best-practice and security-minded architecture More ❯
hazard recognition, and incident prevention. Provide mentorship and oversight to site safety managers and coordinators; assist in hiring and training safety staff. Investigate incidents, near misses, and injuries; perform rootcauseanalysis and recommend corrective actions. Collaborate with operations team members to develop project-specific accident prevention plans (APPs), activity hazard analyses (AHAs), and site-specific high More ❯
project proposals and for the development of actionable performance measures, metrics, and standard operating procedures. Work with other contractor staff to develop design compliance performance measures and participate in rootcauseanalysis of design compliance problems. Attend DoS and inter-bureau collaborative working groups as a Subject Matter Expert (SME) on DoS Technical Security Systems installation requirements More ❯
quality and accurate reports for a wide range of stakeholders. Collaborate with Security Engineers and cross-functional teams to investigate and remediate large scale security incidents. Support security incident rootcauseanalysis, identify control gaps, and recommend mitigation strategies. Collaborate with cross-functional teams to drive improvements to security tools, policies and processes. Improve the effectiveness and More ❯
Technical Support: Provide second-level technical support for complex IT issues escalated from the 1st level support team. Troubleshoot and resolve hardware, software, network, and system-related problems. Perform rootcauseanalysis to identify underlying issues and implement appropriate solutions. IT Infrastructure Management: Manage and maintain the hotel's IT infrastructure, including servers, switches, routers, firewalls, and More ❯
Technical Support: Provide second-level technical support for complex IT issues escalated from the 1st level support team. Troubleshoot and resolve hardware, software, network, and system-related problems. Perform rootcauseanalysis to identify underlying issues and implement appropriate solutions. IT Infrastructure Management: Manage and maintain the hotel's IT infrastructure, including servers, switches, routers, firewalls, and More ❯
Incident Response Diagnose, analyse, and resolve network failures and recurring issues in LAN, WAN, and wireless environments. Respond to network-related incidents and service tickets within agreed SLAs. Conduct rootcauseanalysis (RCA) for network outages and implement permanent fixes. Use network monitoring and diagnostic tools (e.g., Wireshark, SolarWinds, PRTG) to proactively detect and resolve performance bottlenecks. More ❯
Produce and distribute clear, actionable reports and dashboards on: Planned vs Actual manned hours Shift fulfilment and attendance rates SLA performance across all contracts Provided vs Invoiced hours , with rootcauseanalysis for variances Deliver reports daily (tactical), weekly (trend), and monthly (strategic/board level). Collaborate with the Data Manager and Invoicing Manager to ensure More ❯
improvement and development of support processes. Day-to-day Diagnose and resolve high-priority, complex technical issues reported by customers, ensuring timely resolution and high customer satisfaction Conduct thorough rootcauseanalysis of recurring issues to identify and implement preventive measures Ensure high ticket productivity while maintaining a minimal backlog Manage and prioritize incidents and service requests More ❯