maintain both on-premises and cloud-based infrastructure Serve as a technical escalation point for 1st and 2nd Line Support Engineers Troubleshoot and resolve complex infrastructure incidents and perform rootcauseanalysis Support a wide range of systems including Exchange, Microsoft 365, Windows Server, VMware, and core networking Lead on system upgrades, patch management, capacity planning, and More ❯
external customers. Maintain impeccable case hygiene and customer-related files and records. Customer Advocate providing support to users/administrators of our platform. Experience assessing, troubleshooting, resolving, and providing rootcauseanalysis for Product issues related to upgrades, cloning, tables, reporting, performance analytics etc. Excellent written and verbal communication skills with the ability to clearly articulate solutions More ❯
Amersham, Buckinghamshire, South East, United Kingdom Hybrid / WFH Options
Anson Mccade
for in an Operations Support Analyst: 2-3 years' experience in operations support across enterprise platforms (HR, ERP, Finance, Integrations). Strong analytical skills, with proven experience in reporting, rootcauseanalysis, and process improvement. Excellent stakeholder management, communication, and vendor engagement skills. Comfort working in complex, multi-vendor IT environments with global teams. Familiarity with ITIL More ❯
Python, Powershell, or Go, which are essential for automating routine tasks and system deployments. Incident Management and Troubleshooting - The ability to manage incidents effectively, troubleshoot issues swiftly, and perform rootcauseanalysis to prevent future incidents. Systems Engineering and Automation - A deep understanding of systems engineering, including operating systems, networking, and cloud infrastructure. Proficiency in automation tools … reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts to automate operational processes, reducing manual workload, increasing … aligned support areas to keep up to speed with business activity and the business strategies. Create solutions based on sophisticated analytical thought comparing and selecting complex alternatives. In-depth analysis with interpretative thinking will be required to define problems and develop innovative solutions. Adopt and include the outcomes of extensive research in problem solving processes. Seek out, build and More ❯
s degree with professional experience developing software. Significant professional experience developing software for Senior positions. Demonstrable record of developing high-quality embedded software for end-user products from requirements analysis to delivery. Demonstrable experience in a technical leadership role within a software team. Expert knowledge and experience with implementation skills such as object-oriented programming, pair programming, memory management … knowledge of C++ is required. Experience with Agile/Scrum development practices. Advanced knowledge of verification skills such as test-driven development, unit testing, code reviews, defect characterization, requirements analysis and planning skills such as user story development, backlog management, high-level estimation, and detailed estimation. Advanced configuration management skills such as revision control, continuous integration, continuous deployment, automated … SCONS, FitNesse, Lauterbach debuggers. Excellent interpersonal, written, and oral communication skills, with an ability to influence and build strong, credible relationships. Aptitude in engineering excellence processes such as FMEA, RootCauseAnalysis, Design of Experiments, etc. is a plus, but not required FPGA (VHDL) programming experience is a plus, but not required. Experience in an industrial printing More ❯
other proactive or reactive tasks. Monitor and log application and infrastructure health, troubleshooting and optimising system performance based on key metrics Deploy updates, fixes, and perform regular maintenance Perform rootcauseanalysis for production errors and implement long-term solutions Troubleshoot and resolve technical issues efficiently Automate tasks, including visualisation and reporting processes Undertake development activity in More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom Hybrid / WFH Options
Lorien
the Group IT Service Desk Ensuring all changes to production services are subject to change control Producing and update documentation to improve the efficiency and effectiveness of systems Conducting rootcauseanalysis following unplanned disruptions to improve system availability Conduct Problem Management of repeat incidents affecting business services Experience with incident management systems, identify incident trends and More ❯
Aldershot, Hampshire, South East, United Kingdom Hybrid / WFH Options
JLA Resourcing Ltd
across platforms, leading scanning, remediation, and reporting activities Design and enforce endpoint security policies, access controls, and malware protections Lead incident identification and response efforts, ensuring swift containment and rootcauseanalysis Oversee system hardening and compliance with industry standards (CIS, STIG, NIST) Contribute to strategic security planning and advise engineering and programme leadership on cyber risks More ❯
Portsmouth, Hampshire, United Kingdom Hybrid / WFH Options
Toyota Financial Services, KINTO and KINTO JOIN
Maintain and support existing Miles interface/connector inventory (for example: CCS Interface, Mileage uploads, Mazda Contract Loader). Monitor interface/connector performance and batch runs Perform technical analysis to problem solve issues with interfaces Communicate with 3rd party vendors for other systems connecting to the application Define the solution design for new data interfaces Lead the implementation … ability to assess documents and systems for conformity to internal and external standards. Familiarity with principles of good data governance, cyber security and data protection Excellent problem solving and rootcauseanalysis skills Proven ability to work with minimal supervision Attributes and Behaviours Self-starter with an analytical mind and natural problem solver Natural relationship builder with More ❯
Gloucester, Gloucestershire, England, United Kingdom Hybrid / WFH Options
IMT Resourcing Solutions
endpoint onboarding, policy deployment, and software packaging. Overseeing server patching processes and infrastructure monitoring, including WSUS/AUM configuration and third-party update scheduling. Supporting high-priority incident resolution, rootcauseanalysis, and the documentation of best practices and knowledge articles. Driving continual service improvement and supporting major infrastructure and cloud transformation projects. What We’re Looking More ❯
lifetime of an incident or problem Undertake small scale development tasks, primarily creating checks and automation and data reconstructions Participate in or run relevant meetings, e.g. including team ceremonies, rootcauseanalysis, long running problem handover, etc. Ensure that the Incident and Problem Management process is followed Establishing, sharing and follow operational procedures and controls Participate in More ❯
develop standards for coding, testing, debugging, and implementation Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions Mandatory Skills: Relevant Experience in Core-Java, J2EE Spring-Boot, Microservices, Hibernate/jpa Strong experience in SQL & Database … development tracks at the same time is required Experience in leading smaller development teams is necessary Adhere to the Agile process, & also ensure others follow process too. Should have rootcauseanalysis skills, able to quickly analyze and interpret code and ensure that issues are resolved or escalated appropriately. What we'll provide you: By joining Citi More ❯
local and remote staff, ensuring swift resolution of complex issues Provide specialised support to VIP users, ensuring high availability and exceptional service quality Participate in incident management processes, including rootcauseanalysis and resolution of infrastructure-related incidents Evaluate and support third-party solutions Lead and contribute to multiple IT projects Work closely with technology teams to More ❯
Doncaster, South Yorkshire, Yorkshire, United Kingdom
DFS Furniture Ltd
manage automation scripts to streamline processes and reduce manual effort. Collaborate with development, data, and security teams to understand and meet their platform requirements. Lead incident resolution efforts, including rootcauseanalysis, fixes, documentation, and preventative solutions. Apply strong analytical and problem-solving skills to resolve complex technical issues. Monitor and manage platform performance, ensuring high standards More ❯
and a communicative candidate, this is the role for you. Core responsibilities Incident and Problem Management: Own the end-to-end problem and incident lifecycle, ensuring timely resolution and rootcause analysis. Drive proactive problem management, identifying recurring issues and working with Product and Engineering to implement preventive measures. Track and report on key metrics like MTTR, % problems … with diagnosed rootcause, and incident volume. Continuous improvement: Monitor and manage service levels, ensuring adherence to SLAs and KPIs. Identify areas for improvement and implement changes to optimize performance. Collaborate with team leads to ensure effective resource allocation and workload distribution. Foster a culture of continuous improvement and knowledge sharing within the team. Identify training needs across More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Tate Professional
Application Support Analyst Based in London Hybrid working We are seeking an experienced Application Support Analyst to liaise with vendors, business users, and product teams to perform installations, identify root causes, and deliver fixes or enhancements. The candidate would ideally have knowledge of commodity trading and a delivery-focused mindset. Experience working within Agile (SCRUM) development and delivery teams … SQL skills, including the ability to write and optimize complex JOIN queries across platforms such as SQL Server and MySQL. Excellent problem-solving skills with a structured approach to rootcause analysis. Experience in incident and problem management, including ownership from identification to resolution. Ability to manage multiple priorities in a fast-paced environment. Strong interpersonal and communication More ❯
distribution lifecycle with automated and scalable CI/CD tools including Jenkins, Maven, Stash is a must. Experience with third-party libraries and APIs. Demonstrated proficiency in troubleshooting, and root-cause analysisin a complex and interconnect systems. Hands-on experience in AI, particularly Generative AI, is highly preferred. A go-getter who is not overwhelmed by new technology More ❯
Newport, Gwent, Wales, United Kingdom Hybrid / WFH Options
Hays
governance and execution of incident and problem management processes, ensuring they align with organisational goals and industry best practices. You'll be responsible for driving service improvements through effective rootcauseanalysis and corrective actions, while managing a team of internal staff and external partners. Your leadership will foster a high-performance culture focused on operational excellence … IT service delivery, specialising in incident and problem management In-depth knowledge of ITIL, with hands-on experience in incident, problem, and service processes Strong analytical skills, focused on RootCauseAnalysis and implementing corrective actions Clear communicator, adept at engaging both technical and non-technical audiences Proactive in process improvement, streamlining incident and problem management workflows … Service Management: Proficient with tools like ServiceNow to track and resolve incidents and problems. Vendor Coordination: Experienced in managing outsourced IT partners to ensure high-quality service under pressure. RootCauseAnalysis: Skilled in identifying and addressing underlying issues through RCAs. ITIL 4 Practitioner in Incident & Problem Management. What you need to do now If you're More ❯
and Risk Director, you will handle a wide range of tasks including data reviews, chargeable project work, investigating errors and omissions, reviewing calculation issues, managing change through support teams, rootcauseanalysis, change management, and client reporting and interaction. As a leader in the marketplace, WTW offers excellent opportunities for experienced pensions administrators. We offer a competitive … opportunities for chargeable activities. Address individual member-related errors and complete rectification activities. Draft client notifications and project estimate reports. Perform chargeable tasks, including calculation automation, pension rectification, data analysis, letter review, and strategic change. Create change requests to improve processes and introduce automation, supporting testing and system error investigations. Provide rootcauseanalysis to help … ensure quality and completeness. Coach on errors and hold feedback sessions. Produce test packs for automation and sign-off calculations for DB and DC members. Assist in data quality analysis during due diligence exercises. Manage data cleaning and large mailing projects. Scope costs for non-regular projects and monitor budgets. Provide professional, client-focused service to promote the WTW More ❯
production ramp-up of our SPAD IC products. The ideal candidate will have a proven track record in product engineering and test development, including experience with ATE platforms, yield analysis, production support, and qualification processes, especially within automotive-grade environments. While a deep understanding of SPAD or optoelectronic device behavior is preferred, the emphasis of this role is on … and Support: Develop, validate, and maintain ATE test programs for SPAD ICs. Optimize test coverage, throughput, and yield performance. Support test program bring-up and production correlation activities. Yield Analysis Improvement: Analyze production test data to drive yield improvements. Investigate low-yield lots, identify root causes, and implement corrective actions. Work with fab, assembly, and test partners to … execution of qualification plans in compliance with AEC-Q100, ISO26262, and customer-specific requirements. Collaborate with reliability engineers and test teams to implement qualification hardware and test flows. Failure Analysis Debug: Perform bench-level and ATE-based debug of production and qualification failures. Guide rootcauseanalysis efforts and implement solutions to improve product robustness. Cross More ❯
Team, including workload management Ensure time management and quality delivery from different stakeholders Client-oriented strong stakeholder management, providing frequent updates (externally, internally) to ensure continuous clarity Conduct thorough root-causeanalysis and implement lessons learned to drive continuous improvement Involvement in training development for electrical systems, ensuring adherence to company standards and regional requirements Acting as … circuit and arc flash studies is desirable Proven track record of electrical optimization and efficiency improvements, preferably in Data Centers Excellent stakeholder management and communication skills Ability to conduct root-causeanalysis and implement lessons learned Experience in developing and delivering training programs Familiarity with SOP/EOP development and the CAB process Ability to act as More ❯
Hemel Hempstead, Hertfordshire, South East, United Kingdom
NTT Global Data Centers EMEA UK ltd
Team, including workload management Ensure time management and quality delivery from different stakeholders Client-oriented strong stakeholder management, providing frequent updates (externally, internally) to ensure continuous clarity Conduct thorough root-causeanalysis and implement lessons learned to drive continuous improvement Involvement in training development for electrical systems, ensuring adherence to company standards and regional requirements Acting as … circuit and arc flash studies is desirable Proven track record of electrical optimization and efficiency improvements, preferably in Data Centers Excellent stakeholder management and communication skills Ability to conduct root-causeanalysis and implement lessons learned Experience in developing and delivering training programs Familiarity with SOP/EOP development and the CAB process Ability to act as More ❯
as they are rolling-up their sleeves and working on implementation. They have demonstrated the ability to build customized logistics solutions, drive process improvement, standardize procedures and eliminate the rootcause of defects within current systems. This job will require strong communication skills while having the ability to work independently in a high pressure, and often ambiguous, work … the team to uncover insights and patterns of defects that will help us build initiatives to reduce defects at scale Ability to pull data and perform deep-dives and root-causeanalysis as needed. Lead initiatives to identify and eliminate root causes of defects in order to reduce defects and improve the delivery experience for Amazon More ❯
Applications Team Foundation Server Jenkins Excellent communication skills at all levels Motivated and Proactive Superb analytical and research skills. Advantageous Skills and Knowledge: Ability to interpret and perform defect rootcauseanalysis on program code Database: Oracle, DB2, PL/SQL Knowledge of C C# Reporting skills, and ability to analyse metrics To be considered for shortlisting More ❯
WHS stakeholders for the pillar(s) that you directly support. • Establish incident investigation, escalation and reporting structures within your area of responsibility to ensure timely notification and collaboration on rootcause analysis. • Ensure risks of maintenance operations are assessed and the reduction measures are implemented, policies and requirements are effectively communicated, sites have the ability to ask questions … starting with our customer needs and partner with stakeholders to identify and mitigate high-risk safety hazards, ensuring maintenance operations remain safe. We analyse incidents, review KPIs, and conduct rootcause investigations to drive continuous improvement. Through training, policy development, and engagement initiatives, we influence safe behaviours and strengthen Amazon's safety culture. Our work ensures that risk … influence safe behaviours by analysing human factors and fostering a positive safety culture through engagement initiatives. A core part of our role is examining KPIs, incidents, and events, conducting rootcauseanalysis to ensure lessons learned are adopted. Where applicable, we use these insights to shape safety interventions and proactively mitigate risks. We support RME teams by More ❯