pipeline monitoring, alerting, and logging to detect failures and performance bottlenecks. Build automation to ensure data quality, lineage tracking, and schema evolution management. Participate in incident response, troubleshooting, and rootcauseanalysis for data issues. Advocate for DataOps best practices, driving automation, reproducibility, and scalability. Document infrastructure, data workflows, and operational procedures. What are we looking for More ❯
opportunities to influence areas throughout the organization. Responsibilities: - Measure and improve vendor on time performance, reduce operations defects, and improve systems to streamline operations between vendors and Amazon. - Drive root-causeanalysis and reporting on operations issues, develop action plans and project manage improvements. - Collaborate with internal logistics teams to achieve best-in-class delivery time and More ❯
for network configurations, procedures, and policies. • Plan and execute network and virtualization projects, ensuring timely delivery and adherence to budget and quality standards. • Responsible for troubleshooting network incidents, providing rootcauseanalysis and documenting information in ticketing system (ServiceNow) and knowledge repositories (MS SharePoint). • Willing to work nights for network Change Request (CR) implementations and be More ❯
is responsible for support, administration, monitoring, troubleshooting and participating in various projects to enhance service and productivity. Job Responsibilities Provide L3+ support to delivery teams on escalated issues. Generate root-causeanalysis related to critical incidents and problems. Engage with vendor on getting support, updates, and planning as required. Take ownership of designated technologies covering Citrix environment More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
respond to changes in system behaviour as they arise. Support and troubleshooting: Second and third-line support, responding directly to business issues and questions. Problem escalation and incident response. Rootcauseanalysis and proactive problem solving. The team is empowered to deploy changes in response to arising requirements. Business-as-usual maintenance: Use of automation tools and More ❯
london (city of london), south east england, United Kingdom
Circana
is responsible for support, administration, monitoring, troubleshooting and participating in various projects to enhance service and productivity. Job Responsibilities Provide L3+ support to delivery teams on escalated issues. Generate root-causeanalysis related to critical incidents and problems. Engage with vendor on getting support, updates, and planning as required. Take ownership of designated technologies covering Citrix environment More ❯
mitigation strategies. Develop and deliver real-time risk dashboards and MI for senior leadership and governance committees. Conduct control testing and assurance activities on design and operational effectiveness. Drive rootcauseanalysis of operational incidents and ensure control enhancements are implemented. Collaborate with internal audit, compliance, and technology on cross-functional risk initiatives. Prepare high-quality risk More ❯
vulnerabilities are exploited or identified in real-time. Work with relevant teams to contain and mitigate security breaches, ensuring minimal impact on the business. Develop post-incident reports, including rootcauseanalysis and remediation strategies. Security Strategy & Improvement: Stay up-to-date on the latest security trends, tools, techniques, and frameworks. Continuously evaluate and improve the organisation More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Just Eat Takeaway.com
Windows services (IIS etc) Support of Containerisation technologies Configuration Management tooling such as Ansible, Puppet or Chef Interest in working within an Agile/Scrum environment Problem Solving and rootcauseanalysis of issues in complex environments At JET, this is on the menu: Our teams forge connections internally and work with some of the best-known More ❯
Response Ace: When things get a bit wobbly, you'll be on the front lines, resolving incidents fast to minimize downtime. After the dust settles, you'll lead the rootcauseanalysis to prevent similar issues from popping up again. Automation Whizz: Got a repetitive task? You'll be the one automating it away! From environment setup More ❯
and maintain quality management system documentation Produce and maintain Quality Management Plans for new and existing projects Support project non-conformance recovery and closure through defined QMS procedures. Facilitate rootcauseanalysis and corrective action implementation; driving timely completion of actions. Review and approve supplier First Article Inspection Reports and compile internal First Article Inspection Reports in More ❯
Doncaster, Yorkshire, United Kingdom Hybrid / WFH Options
DFS Furniture PLC
manage automation scripts to streamline processes and reduce manual effort. Collaborate with development, data, and security teams to understand and meet their platform requirements. Lead incident resolution efforts, including rootcauseanalysis, fixes, documentation, and preventative solutions. Apply strong analytical and problem-solving skills to resolve complex technical issues. Monitor and manage platform performance, ensuring high standards More ❯
a plus, but not required High proficiency with at least one programming/scripting language (e.g., Go, Python, C) and ability to learn additional languages quickly Ability to perform rootcauseanalysis Strong verbal and written communication skills, including the ability to communicate effectively and efficiently with both coworkers and third-party vendors Strong collaboration skills with More ❯
architecture will help you. What you'll be doing Diagnose and resolve complex application and infrastructure issues Participate in our 24x7 on-call rotation, SCRUM, and deployment planning Perform RootCauseAnalysis (RCA) and provide recommendations for application teams Improve availability and reduce customer impact using Industry best observability tools Ensure best-practice and security-minded architecture More ❯
architecture will help you. What you'll be doing Diagnose and resolve complex application and infrastructure issues Participate in our 24x7 on-call rotation, SCRUM, and deployment planning Perform RootCauseAnalysis (RCA) and provide recommendations for application teams Improve availability and reduce customer impact using Industry best observability tools Ensure best-practice and security-minded architecture More ❯
hazard recognition, and incident prevention. Provide mentorship and oversight to site safety managers and coordinators; assist in hiring and training safety staff. Investigate incidents, near misses, and injuries; perform rootcauseanalysis and recommend corrective actions. Collaborate with operations team members to develop project-specific accident prevention plans (APPs), activity hazard analyses (AHAs), and site-specific high More ❯
project proposals and for the development of actionable performance measures, metrics, and standard operating procedures. Work with other contractor staff to develop design compliance performance measures and participate in rootcauseanalysis of design compliance problems. Attend DoS and inter-bureau collaborative working groups as a Subject Matter Expert (SME) on DoS Technical Security Systems installation requirements More ❯
quality and accurate reports for a wide range of stakeholders. Collaborate with Security Engineers and cross-functional teams to investigate and remediate large scale security incidents. Support security incident rootcauseanalysis, identify control gaps, and recommend mitigation strategies. Collaborate with cross-functional teams to drive improvements to security tools, policies and processes. Improve the effectiveness and More ❯
Technical Support: Provide second-level technical support for complex IT issues escalated from the 1st level support team. Troubleshoot and resolve hardware, software, network, and system-related problems. Perform rootcauseanalysis to identify underlying issues and implement appropriate solutions. IT Infrastructure Management: Manage and maintain the hotel's IT infrastructure, including servers, switches, routers, firewalls, and More ❯
Technical Support: Provide second-level technical support for complex IT issues escalated from the 1st level support team. Troubleshoot and resolve hardware, software, network, and system-related problems. Perform rootcauseanalysis to identify underlying issues and implement appropriate solutions. IT Infrastructure Management: Manage and maintain the hotel's IT infrastructure, including servers, switches, routers, firewalls, and More ❯
Incident Response Diagnose, analyse, and resolve network failures and recurring issues in LAN, WAN, and wireless environments. Respond to network-related incidents and service tickets within agreed SLAs. Conduct rootcauseanalysis (RCA) for network outages and implement permanent fixes. Use network monitoring and diagnostic tools (e.g., Wireshark, SolarWinds, PRTG) to proactively detect and resolve performance bottlenecks. More ❯
systems. Collaborate with engineering teams/OEMs on continuous improvement, upgrades, and technical documentation. Assist in the installation, integration, and commissioning of new ROV systems and sub systems. Conduct rootcauseanalysis for critical failures and recommend corrective/preventive actions. We offer You can make your mark as Technical Support Engineer (Onshore) if you have: Minimum More ❯
Produce and distribute clear, actionable reports and dashboards on: Planned vs Actual manned hours Shift fulfilment and attendance rates SLA performance across all contracts Provided vs Invoiced hours , with rootcauseanalysis for variances Deliver reports daily (tactical), weekly (trend), and monthly (strategic/board level). Collaborate with the Data Manager and Invoicing Manager to ensure More ❯
improvement and development of support processes. Day-to-day Diagnose and resolve high-priority, complex technical issues reported by customers, ensuring timely resolution and high customer satisfaction Conduct thorough rootcauseanalysis of recurring issues to identify and implement preventive measures Ensure high ticket productivity while maintaining a minimal backlog Manage and prioritize incidents and service requests More ❯
independently managing technical projects with minimal supervision. Excellent executive-level communication skills (written and verbal) for translating technical concepts into business outcomes. Strong troubleshooting and problem-solving abilities, including rootcauseanalysis and resolution of complex technical issues. Ability to build and maintain relationships with both technical and non-technical stakeholders. Experience in working cross-functionally with More ❯