visibility into data quality metrics and trends. Collaborating closely with data engineering teams, you will design and implement automated data quality checks and monitoring processes. Additionally, you will perform rootcauseanalysis for data quality issues, identifying underlying causes and proposing effective solutions. You will also assist product teams in developing and executing test plans, cases, and More ❯
catalog. Implementation of digital solutions in shopfloor and leading PLC integration, cybersecurity processes Provide technical expertise on advanced automated manufacturing/production processes. Programme and troubleshoot PLC systems; conduct rootcauseanalysis on equipment issues. Use structured problem-solving techniques (e.g., 5 Whys, Kaizen) to drive improvements. Skills and attributes: Recognised Apprenticeship in Electrical, Instrumentation/Controls More ❯
Farringdon, Mansion House. The impact you will have: Supporting Senior members of the Group ERM team in driving the ERM agenda; Support product, outsourcing and commercial risks assessments; and rootcauseanalysis resulting from operational incidents; Support the maintenance of the firm's risk registers and Governance, Risk and Compliance (GRC) system; Submission of reports to Regulators More ❯
Due Diligence Prior experience in a Quality Control or Quality Assurance function within AML/KYC/EDD operations, with a strong understanding of review frameworks, error typologies, and rootcauseanalysis to ensure consistent adherence to regulatory and internal quality standards (preferred) Excellent communication skills across all levels of seniority Strong interpersonal skills, with good relationship More ❯
deliver new features and capabilities using AI/ML, preparing estimates for upcoming deliverables, document proposed solutions, reviewing code of other members, writing well structured and optimized code, performing rootcauseanalysis on operational events, providing project updates to leadership and other team members. This position involves on-call responsibilities. As part of this team you will More ❯
contact and subject matter expert (SME) for TMS-related inquiries and escalations. Collaborate closely with Compliance and other key stakeholders to resolve case ambiguities and drive process clarity. Conduct RootCause Analyses (RCA) based on QC/QA findings and implement continuous improvement initiatives. Drive operational excellence through process optimization and cross-functional program management, including coordination with … Strong interpersonal, analytical, and communications (verbal and written) skills Excellent grasp of English grammar and strong English language writing skills Nice to haves: Proficient in Google Suite and blockchain analysis (e.g. Chainalysis). G2791 Pay Transparency Notice: The target annual salary for this position can range as detailed below. Full time offers from Coinbase also include target bonus + More ❯
of SLIs/SLOs, alert tuning, dashboard creation). Have proven experience leading incident response for high severity outages or service disruptions. Able to perform real time incident coordination, rootcauseanalysis, and drive follow ups (post mortems, action items) to prevent recurrence. Knowledge of industry best practices for incident management and fault diagnosis. Have strong skills More ❯
across different regions, requiring an understanding of diverse cultural perspectives and effective communication in a global environment. • You possess a solid understanding of support concepts such as ticketing systems, rootcauseanalysis, and task prioritization, ensuring that all issues are tracked and resolved in a timely manner. Key job responsibilities As a Global Field Operations Command Center More ❯
emergency services on a variety of critical systems such as: switchgear, generators, UPS systems, power distribution equipment, chillers, cooling towers, computer room air handlers, building monitoring systems, etc. Perform rootcauseanalysis of equipment failures, and troubleshooting of facility within internal Service Level Agreements (SLA). Provide training and guidance to Engineering Operations Technicians and responsible for More ❯
emergency services on a variety of critical systems such as: switchgear, generators, UPS systems, power distribution equipment, chillers, cooling towers, computer room air handlers, building monitoring systems, etc. Perform rootcauseanalysis of equipment failures, and troubleshooting of facility within internal Service Level Agreements (SLA). Provide training and guidance to Engineering Operations Technicians and responsible for More ❯
emergency services on a variety of critical systems such as: switchgear, generators, UPS systems, power distribution equipment, chillers, cooling towers, computer room air handlers, building monitoring systems, etc. - Perform rootcauseanalysis of equipment failures, and troubleshooting of facility within internal Service Level Agreements (SLA). - Provide training and guidance to Engineering Operations Technicians and responsible for More ❯
emergency services on a variety of critical systems such as: switchgear, generators, UPS systems, power distribution equipment, chillers, cooling towers, computer room air handlers, building monitoring systems, etc. - Perform rootcauseanalysis of equipment failures, and troubleshooting of facility within internal Service Level Agreements (SLA). - Provide training and guidance to Engineering Operations Technicians and responsible for More ❯
emergency services on a variety of critical systems such as: switchgear, generators, UPS systems, power distribution equipment, chillers, cooling towers, computer room air handlers, building monitoring systems, etc. Perform rootcauseanalysis of equipment failures, and troubleshooting of facility within internal Service Level Agreements (SLA). Provide training and guidance to Engineering Operations Technicians and responsible for More ❯
emergency services on a variety of critical systems such as: switchgear, generators, UPS systems, power distribution equipment, chillers, cooling towers, computer room air handlers, building monitoring systems, etc. Perform rootcauseanalysis of equipment failures, and troubleshooting of facility within internal Service Level Agreements (SLA). Provide training and guidance to Engineering Operations Technicians and responsible for More ❯
leadership, customer engagement, and lifecycle product management. You will engage in a wide range of activities-from evaluating COTS components and system-level architecture to managing risk and leading rootcause analyses for fielded systems. You'll work closely with stakeholders across Program Management, Business Development, Manufacturing, Engineering, and Navy customer organizations. Responsibilities: Provide technical leadership for the … Procedures. Review and interpret customer requirements and specifications, leading the team through requirements decomposition, traceability, and validation. Participate in and lead Design Reviews (PDR, CDR), Failure Mode and Effects Analysis (FMEA), and formal verification/validation test events. Identify obsolescence risks and develop mitigation strategies such as alternative component evaluations, upgrades, or redesigns. Assist in developing and delivering technical More ❯
emergency services on a variety of critical systems such as: switchgear, generators, UPS systems, power distribution equipment, chillers, cooling towers, computer room air handlers, building monitoring systems, etc. - Perform rootcauseanalysis of equipment failures, and troubleshooting of facility within internal Service Level Agreements (SLA). - Provide training and guidance to Engineering Operations Technicians and responsible for More ❯
and result logging. Understanding of testing life cycle. Collaborate with development, business, and QA teams to understand requirements and ensure test coverage. Understanding of defect life cycle. Experience of rootcause analysis. Good stakeholder management skills Familiarity with Agile and DevOps methodologies. Strong Interpersonal Skills - Mandatory More ❯
functional teams to gather requirements, define project scopes, and deliver storage solutions aligned with business objectives. Provide technical expertise and support for storage-related issues, including incident resolution and rootcause analysis. Develop and maintain documentation, including system configurations, procedures, and operational guidelines. Implement and enforce data management policies, ensuring compliance with regulatory requirements and internal standards. Stay More ❯
Edinburgh, Midlothian, Scotland, United Kingdom Hybrid / WFH Options
Reed
ensuring alignment with business and technical requirements. Support smooth transitions from development to operations, ensuring operational readiness. Provide ongoing monitoring, troubleshooting, and optimisation of systems, including incident resolution and rootcause analysis. Required Skills & Qualifications: Previous experience as a DevOps engineer, designing, implementing, and maintaining scalable, secure, and reliable cloud solutions. Proficiency with Azure DevOps for coding and More ❯
Use of Force policy, and meet all federal, state, and local requirements for armed security personnel prior to starting Services under this SOW. • Escalates and documents security incidents, conducts rootcause analyses and after-action reports. QUALIFICATIONS: • US Citizenship. • Minimum of 18 years of age. • High School Diploma or GED. • Impeccable customer service skills. • Excellent verbal communication skills More ❯
working with System Integrators & partner resources. Conduct threat modeling and risk assessments on network infrastructure and recommend mitigations. Support incident response teams during network-related security incidents and perform rootcause analysis. Evaluate and recommend security tools and technologies, and stay informed on emerging threats and vulnerabilities. Required Skills and Qualifications: Proven experience as a Network Security Architect More ❯
as Order Gateway, Execution Algo, Market Data, Risk, and Middle Office. Help monitor and improve test coverage across services. Assist in bug reproduction, investigation, and work with developers on rootcause analysis. Participate in regular test cycle planning and release validation processes. Requirements Must-Have Skills Basic understanding of crypto trading workflows, including exchange matching, order lifecycle, and More ❯
alarms to support ITSM performance, ensuring resolutions are within SLA. Build and nurture strong relationships both internally and externally to enhance service delivery for our customers. Complete and document RootCause Analyses (RCAs) and Post Incident Reviews (PIRs), recommending improvements where necessary. Contribute to ITSM-driven initiatives, collaborating as a chapter to implement positive changes. Create and maintain More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Pontoon
alarms to support ITSM performance, ensuring resolutions are within SLA. Build and nurture strong relationships both internally and externally to enhance service delivery for our customers. Complete and document RootCause Analyses (RCAs) and Post Incident Reviews (PIRs), recommending improvements where necessary. Contribute to ITSM-driven initiatives, collaborating as a chapter to implement positive changes. Create and maintain More ❯
tools across diverse hardware and software environments. Create and maintain feature pipelines, feature stores, and model stores. Develop frameworks to enable scalable, reproducible research. Proactively troubleshoot performance bottlenecks, conduct root-cause analyses, and implement solutions to optimize GPU or CPU resource usage. Qualifications: Bachelor's or advanced degree in Computer Science, Machine Learning, Artificial Intelligence, or a related More ❯