Helm charts, and pod definition. • Kubernetes Administration: Manage and configure Kubernetes clusters for high availability, scalability, and security. • Debugging and Defect Correction: Troubleshoot and resolve software defects with effective rootcauseanalysis and debugging techniques. • GPU Configuration and Support: Configure and optimize GPU resources using CUDA or other technologies for compute-intensive workloads. • Automated Testing and Deployment … Strong analytical and problem-solving mindset • Excellent verbal and written communication skills • Adaptability and a drive for continuous learning and improvement Bonus If You Have: • Understanding of RF signal analysis or satellite communications systems Engineer smarter, Build bolder. Apply now and learn more about our extensive benefits and customizable compensation packages More ❯
Burke, Virginia, United States Hybrid / WFH Options
ALTA IT Services
and ensure accurate data visualization. • Design and maintain dashboards for network health, application performance, and security insights. • Integrate LiveAction solutions with SIEMs, orchestration tools, and other monitoring platforms. • Conduct root-causeanalysis of performance bottlenecks, latency, and packet loss across large-scale networks. • Create custom reports and alerts for performance metrics, compliance monitoring, and capacity planning. • Assist … with an understanding of cybersecurity frameworks. Preferred Qualifications: • LiveAction product certifications (e.g., LiveAction Certified Professional). • Experience integrating LiveAction with Splunk, Elastic, or other SIEMs. • Familiarity with packet-level analysis tools like Wireshark or Riverbed. • Experience with scripting and automation (Python, Ansible, PowerShell). • Prior experience supporting agencies such as DoD, DHS, VA, or IC. More ❯
Loughborough, Leicestershire, East Midlands, United Kingdom Hybrid / WFH Options
Oscar Associates (UK) Limited
migration, working with project teams and senior stakeholders. Drive process improvement, automation, and security best practice using PowerShell, deployment tools (SCCM/Intune/PDQ), and cloud technologies. Conduct rootcauseanalysis for critical incidents and coordinate long-term resolutions. Develop and maintain comprehensive technical documentation, knowledge base articles, and asset records. Build strong relationships with senior … and best practice. Participate in an on-call rota (with enhanced allowance) to ensure 24/7 service continuity. Ideal Candidate: Significant 2nd/3rd line support or application analysis experience in a complex/regulated environment. Deep knowledge of Microsoft technologies: Windows Server, Active Directory, PowerShell, Exchange, SharePoint, Teams, Azure. Track record delivering major upgrades, migrations, or automation More ❯
test plans (ITPs), ensuring proper verification and validation of hardware components and systems against design requirements and customer specifications. Develop quality metrics and dashboards to monitor process health, drive rootcauseanalysis, and support strategic quality planning. Collaborate with Engineering and Production to evaluate design feasibility, manufacturability, and adherence to quality standards during reviews and milestone phases. … Skills: Demonstrated experience with ISO 9001 QMS, along with working knowledge of additional standards such as ISO 45001, 14001, 27001, 31000, and ITIL. Strong analytical skills with experience in rootcauseanalysis, corrective actions, and statistical process control (SPC) Proven leadership in guiding teams to adhere to internal procedures and government/industry regulations. Experience writing, revising More ❯
project progress, identifying and resolving issues and risks to ensure on-time and high-quality delivery. Responsible for prioritization of product backlog items based on input from stakeholders, market analysis, and user feedback Provide clarity and guidance to the development and run teams on product requirements, acceptance criteria, service level objectives and desired outcomes. Drive a culture of continuous … improvement by implementing best practices, fostering innovation, and promoting experimentation within the value stream. Lead and facilitate regular retrospectives, rootcauseanalysis, and problem-solving sessions to identify improvement opportunities and drive actionable outcomes. Build and maintain strong relationships with key senior RGM & TPM stakeholders, including business leaders, customers, and partners. Communicate value stream performance, challenges, and More ❯
talented team that enjoys exploring and designing new technologies. Aufgaben Lead problem management efforts, including external moderation and team improvements Handle P1/P2 incident management and conduct 8D rootcauseanalysis and lessons learned Oversee a specific cost center regarding forecasting, purchase orders, and bills Ensure proper cost center separation and adherence to SLAs Manage upgrades More ❯
on-prem environments. What Youll Be Doing: Managing and supporting Solace PubSub+ appliances and software brokers across cloud and on-prem platforms Responding to production incidents and working on rootcauseanalysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message More ❯
who is passionate about discovering and solving complicated problems, learning complex systems, working with numbers, and organizing and communicating data and reports. The Research Analyst will perform Big data analysis to identify patterns, train model to generate product to product relationship and product to brand & model relationship. The Research Analyst is also expected to continuously improve the ML/… Research Analyst should be able to write clear and detailed functional specifications based on business requirements. Key job responsibilities • Scoping, driving and delivering complex projects across multiple teams. • Performs rootcauseanalysis by understanding the data need, get data/pull the data and analyze it to form the hypothesis and validate it using data. • Build programs … team(s)/partners to meet program and/or product goals. • Coordinates design effort between internal team and External team to develop optimal solutions. • Performs supporting research, conduct analysis of the bigger part of the projects and effectively interpret reports to identify opportunities, optimize processes, and implement changes. • Ability to convince and interact with stakeholders at all level More ❯
who is passionate about discovering and solving complicated problems, learning complex systems, working with numbers, and organizing and communicating data and reports. In compatibility program, RA perform Big data analysis to identify patterns, train model to generate product to product relationship and product to brand & model relationship. RA also continuously improve the ML solution for higher solution accuracy, efficiency … and detailed functional specifications based on business requirements as well as writes and reviews business cases. Key job responsibilities • Scoping, driving and delivering complex projects across multiple teams. • Performs rootcauseanalysis by understand the data need, get data/pull the data and analyze it to form the hypothesis and validate it using data. • Conducting a … thorough analysis of large datasets to identify patterns, trends, and insights that can inform the development of NLP applications. • Developing and implementing machine learning models and deep learning architectures to improve NLP systems. • Designing and implementing core NLP tasks such as named entity recognition, classification and part-of-speech tagging. • Dive deep to drive product pilots, build and analyze More ❯
Ansible, Terraform, GitHub Operate and support Kubernetes (Tanzu preferred), VMware, and Cisco UCS environments Administer Linux (RHEL 7-9), storage (NFS, object), and backup systems (CommVault) Handle incident response, rootcauseanalysis, and continuous improvement Collaborate across teams to align with DevOps best practices Contribute to infrastructure standards, documentation, and disaster recovery planning Preferred Experience: 5+ years More ❯
administration (Windows 10/11). Proven experience leading and executing Tanium solution deployments. Hands-on experience with endpoint lifecycle management, patching, and compliance. Strong analytical, problem-solving, and rootcauseanalysis skills. Ability to work independently and within distributed teams. Excellent oral, written, and interpersonal communication skills. Detail-oriented with strong organizational and task management capabilities. More ❯
Weybridge, Surrey, England, United Kingdom Hybrid / WFH Options
Proactive Appointments
to ensure technical feasibility and alignment with business goals.• Create data models, user journeys, UI/UX Design, process flows, and system diagrams to support solution design.• Conduct gap analysis, impact assessments, and rootcause analysis.• Support QA and UAT by defining acceptance criteria, test cases and validating outcomes. Technical Business Analyst Due to the volume of More ❯
rotations to support critical systems and address incidents. Collaborate with development teams to ensure that reliability concerns are addressed throughout the software lifecycle. Lead post-incident reviews and drive rootcauseanalysis and follow-up actions to mitigate future incidents. Collaboration and Planning: Work with cross-functional teams to understand requirements and develop effective testing strategies. Participate More ❯
fixed income data model to deliver a consistent, cross-platform client experience. Collaborate with Product, Engineering, Ontologists, and Fixed Income SMEs to co-design an interconnected data model supporting analysis across multiple datasets. Translate business and product requirements into clear, maintainable data modelling artifacts. Define and document metadata standards, entity relationships, and model schemas to support semantic alignment and … and maintain metadata inventories. Communicate data modelling requirements to stakeholders, and drive alignment across metadata/modelling functions to ensure practices are well understood & followed. Perform data profiling and rootcauseanalysis to guide objective, data-driven modelling decisions. Promote FAIR data principles across the modelling lifecycle. You'll need to have: Please note we use years More ❯
automated testing is an integral part of the IT delivery process Execute automated test scripts, analyse test results, and report defects, providing accurate and detailed information to aid in rootcauseanalysis and issue resolution Actively support the control environment, maintaining control effectiveness across existing controls, being mindful of emergent risks across IT Delivery functions Ensure Testing More ❯
as STRIDE, PASTA, MITRE ATT&CK, and DREAD. Build and refine detection and response capabilities using logs, alerts, and behavioral signals. Lead or support incident response activities, including log analysis, querying, forensic investigation, threat mitigation, and rootcause analysis. Conduct internal security reviews, network scans, and targeted penetration tests of applications and infrastructure using common security tooling … architectures and modern frameworks (e.g., Django, Node.js , React). Expert-level scripting and automation skills (e.g., Python, Bash, PowerShell) for workflow automation, tooling, and log analysis. Proficient in log analysis, SIEM usage/configuration, threat hunting, and querying tools to support detection and response. Familiarity with static and dynamic analysis techniques and vulnerability mitigation. Strong understanding of modern … automation platforms. Prior experience driving security engineering for a SaaS-based company. Experience leveraging automation or AI/ML tools to improve secure development, detection, incident response, or code analysis workflows. Benefits: (US-ONLY) 100% of medical, dental, and vision covered including 75% for dependents Flexible vacation days and quarterly mental health days so you can recharge Enjoy a More ❯
Doncaster, Yorkshire, United Kingdom Hybrid / WFH Options
Hiya Technology Ltd
and Reporting: Define and track key quality metrics to measure the effectiveness of QA processes. Provide regular reports and updates on testing progress, coverage, and results to stakeholders. Conduct rootcauseanalysis on defects, working closely with development teams to resolve issues and prevent recurrence. Skills and Experience of a Lead QA Analyst; Experience in quality assurance More ❯
Central London, London, United Kingdom Hybrid / WFH Options
Halian Technology Limited
endpoints are properly configured and updated. 2nd Line Support: Respond to and resolve escalated 2nd line support tickets, ensuring timely resolution of technical issues. Provide expert-level troubleshooting and rootcauseanalysis for more complex issues. Work closely with end-users, understanding their requirements and delivering technical solutions. Escalate issues to senior engineers as needed while keeping More ❯
capacity, and availability - and planning smart enhancements. Supporting compliance with SOx, audit and security standards such as ISO27001 and Cyber Essentials. Investigating and resolving incidents, supporting users, and ensuring rootcauseanalysis is actioned. Mentoring junior colleagues and shaping the multi-year IT strategy with your SAP expertise. What we're looking for: Proven experience with SAP More ❯
and dashboards to identify potential control failures as part of the control testing process. Ensure the accuracy and timely completion of control testing, providing peer review. Document findings, including rootcauseanalysis and applicable recommendations for remediation. Be the primary liaison with partners, delivering clear progress updates and results. Contribute lessons learned by integrating partner feedback to … and organizational requirements. Experience applying governance, risk, and control principles. Experience in automated and manual testing of security controls. Experience facilitating meetings and conveying complex ideas. Data collection, validation, analysis, and interpretation. Experience Researching and applying latest technologies. Experience with Agile methodology. Big 4 accounting experience. Hold a professional certification such as CISA, CISM, CISSP, PCI QSA, ISO More ❯
Newport, Gwent, Wales, United Kingdom Hybrid / WFH Options
Yolk Recruitment
leadership role where you'll take ownership of incident and problem management across a critical national infrastructure environment. You'll oversee the governance of best practice frameworks, ensuring timely rootcauseanalysis and preventative actions, while leading a collaborative team and influencing service delivery across a multi-vendor landscape. This is an opportunity to create tangible improvements More ❯
Strong understanding of distributed systems, fault tolerant design, and high availability architectures. Knowledge of CI/CD pipelines and infrastructure as code tools (Terraform, Ansible, CloudFormation). Experience in rootcauseanalysis and implementing systemic improvements. Preferred: Significant experience with UX/UI writing or design Knowledge of regulatory standards and compliance (e.g., PCI DSS, HIPAA). More ❯
Storage (Block, Object, SQL, NOSQL) • Authentication, Authorisation, Identity Platforms • Information Security, Privacy and Regulatory Compliance • Performance Tuning, Hardening and Troubleshooting • Problem Solving Skills to Methodically Find Faults and perform RootCauseAnalysis • Able to evaluate multiple courses of action, achieving goals by non-standard means if necessary • System Regression • Protocol Analysis • Load Testing • Availability and Resilience More ❯
smarter operations across the nation. Service Delivery Manager - What you'll do: Manage incident and service escalations, ensuring SLA and KPI adherence Conduct daily reviews of closed incidents and rootcauseanalysis of SLA failures Maintain accurate, up-to-date service documentation Create and deliver customer SLA performance reports Identify and implement service improvement initiatives with internal More ❯
in Human Capital Management (HCM), specifically within Workforce Management modules. The ideal candidate will serve as the team's T&L subject matter expert and be responsible for requirements analysis, fit/gap, configuration, customization designs, development, testing, user training, and team knowledge transfer. This techno-functional candidate will work with the existing HCM team to identify and report … Work closely with functional teams - HR, Benefits, and Payroll to understand system requirements and ensure system stability and performance Consider impact of application upgrades, patches, and system enhancements Perform rootcauseanalysis and resolve issues related to Time and Labor data processing, approvals, and configurations Collaborate with cross-functional teams to improve system functionality and streamline business More ❯