Falls Church, Virginia, United States Hybrid / WFH Options
Epsilon Inc
teams to optimize data pipelines for AI/ML initiatives, automation, and productization Lead efforts to integrate security best practices, ensuring compliance with relevant regulations and standards Conduct performance analysis, capacity planning, and system tuning to maximize uptime and reliability Guide junior team members in troubleshooting techniques, documentation, and adherence to best practices Drive continuous improvement by reviewing existing … for secure system architecture Familiarity with data engineering concepts, including ETL/ELT pipelines, big data tools, and AI/ML workflows Ability to troubleshoot complex system issues, perform root-causeanalysis, and implement effective solutions Excellent communication, teamwork, and organizational skills, with a focus on innovation and continuous improvement One or more of the following certifications More ❯
Falls Church, Virginia, United States Hybrid / WFH Options
Epsilon Inc
of data between systems by helping with Extract, Transform, Load (ETL) processes and ensuring data consistency across different platforms. Monitor and Troubleshoot Database Performance Issues - Identify potential bottlenecks, perform rootcauseanalysis, and work with senior architects to implement solutions that enhance database reliability and efficiency. Support Compliance and Regulatory Requirements - Ensure database structures and data management More ❯
Falls Church, Virginia, United States Hybrid / WFH Options
Epsilon Inc
assessments and provide actionable recommendations for mitigation. Experience supporting security for data pipelines, AI/ML environments, or cloud-based infrastructures. Excellent incident response skills, including triage, containment, and rootcause analysis. Strong communication and collaboration abilities to partner with cross-functional teams and stakeholders. One or more of the following certifications are desired: Certified Cloud Security Professional More ❯
base articles. Monitor application health using tools and custom dashboards. Support integration and communication between cloud platforms (Azure, Entra ID, Microsoft 365). Contribute to service improvement initiatives, including rootcauseanalysis and automation opportunities. Participate in on-call rotations or after-hours incidents during peak retail periods. Work within established security frameworks and governance. Hybrid working More ❯
San Antonio, Texas, United States Hybrid / WFH Options
BridgePhase, LLC
tracing-to support 24/7 mission awareness. Automate platform operations, including system provisioning, patching, and recovery, to reduce manual effort and increase uptime. Monitor system performance and lead rootcauseanalysis and incident response for infrastructure-related issues. Collaborate with development and cybersecurity teams to ensure deployments are secure, compliant, and aligned with COSC and DoD More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
BAE Systems (New)
hybrid and flexible working arrangements available. Please consult your recruiter for details. Grade: GG10 - GG11 Referral Bonus: £5,000 Job Description Serve as the point of escalation for intrusion analysis, forensics, and incident response queries. Provide rootcauseanalysis for complex, non-standard findings and anomalies without existing playbooks. Mentor team members and share knowledge proactively. … red team and pentest findings to improve detection rules. Provide forensic support and threat emulation to improve alert triage and accuracy. Identify gaps in SOC processes, data collection, and analysis, demonstrating the need for improvements through scenarios and red teaming. Perform complex threat hunting, automation, and analytic enrichment tasks. Set vision and milestones for emulation and detection capabilities, influencing More ❯
Reston, Virginia, United States Hybrid / WFH Options
CGI
methodologies, assumption, validation techniques and findings to align with regulatory expectations and internal governance standards Support the Funds Transfer Pricing and Enterprise Financial Analytics teams with any ad-hoc analysis projects or reporting Experience working within Capital Markets, Treasury or balance sheet management preferred Proficient in MS Excel technical skills, i.e. Python, R, SAS and using BI tools for … financial analysis desired Required qualifications to be successful in this role: 8-9 years of relevant experience Proficiency in Microsoft Excel; familiarity with Python, R, SAS, and BI tools (e.g., Power BI, Tableau) for financial analysis Strong experience in financial modeling, documentation, and regulatory compliance Experience in Capital Markets, Treasury, or balance sheet management Excellent planning and organizational … skills using tools like Microsoft Project Strong facilitation, communication, and relationship-building skills Ability to manage and coordinate project teams and resolve technical issues Skilled in process mapping, rootcauseanalysis, and structured problem-solving Familiarity with project management methodologies and risk management practices Education: Bachelors degree in Business, Computer Science, Information Systems, or a related field More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
Experis - ManpowerGroup
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform rootcauseanalysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
Experis
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform rootcauseanalysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
Falls Church, Virginia, United States Hybrid / WFH Options
Epsilon Inc
of data between systems by helping with Extract, Transform, Load (ETL) processes and ensuring data consistency across different platforms. Monitor and Troubleshoot Database Performance Issues - Identify potential bottlenecks, perform rootcauseanalysis, and work with senior architects to implement solutions that enhance database reliability and efficiency. Support Compliance and Regulatory Requirements - Ensure database structures and data management More ❯
hands-on role supporting high-availability systems, rapid deployments, and production incident response. Key Responsibilities - Manage and monitor AWS infrastructure for performance and security - Respond to production incidents, perform rootcauseanalysis, and implement fixes - Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries - Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes … Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting (Python, Go, Bash, SQL) - Skilled in GitHub, CI/CD, and Kubernetes operations Desirable: - Experience with Terraform or CloudFormation - Advanced log analysis with Splunk - Strong problem-solving and analytical thinking More ❯
Shrivenham, Oxfordshire, United Kingdom Hybrid / WFH Options
Gold Group
Collaborate with engineering teams to support unified access devices (UADs), endpoint management, and virtualized environments. * Provide hands-on support for automation scripts, workflows, and infrastructure monitoring tools. * Contribute to rootcauseanalysis efforts for recurring platform incidents. * Support capacity planning and performance optimization by analysing system usage and trends. * Offer feedback on tools and processes, identifying improvements More ❯
and legacy systems/technical debt activities Collaborate with Senior Engineers to improve delivery automation and enhance DevEx and self-servicing Aligns to effective incident response processes, helping with rootcauseanalysis and problem resolution during incident management sessions Take ownership and pride in the work you deliver, ensure what is delivered is of quality and takes More ❯
our tools and platforms Collaborate with the team to troubleshoot and resolve issues, shadowing and learning from Mid and Senior-level Engineers Aligns to incident response processes, helping with rootcauseanalysis and problem resolution during incident management sessions Take ownership and pride in the work delivered, ensure what is delivered is of quality and takes into More ❯
members, stakeholders, and customers. Manage major incident bridges with calmness and experience, ensuring timely resolution, formalized communication of impact, and minimal impact to the business. Drive Lessons Learned and RootCauseAnalysis (RCA) on all P1/P2 incidents and some business-impacting P3 incidents to prevent recurrence. Develop and maintain the strategy for Operational Support to More ❯
Newcastle Upon Tyne, United Kingdom Hybrid / WFH Options
NHS Business Services Authority
platforms, ensuring the availability and stability of NHSBSA services.o Carrying out proactive support activities, such as evaluation of performance, tuning and running backup/recovery schedules.o Providing troubleshooting and rootcauseanalysis to identify issues, understand underlying cause and suggest future improvements.o Evaluating and interpreting technical data to resolve complex issues when performance is impaired.o Maintaining … to clinicians, NHS bodies and citizens. 2. Carry out proactive support activities, such as evaluation of performance, tuning and running backup/recovery schedules. 3. Carry out troubleshooting and rootcauseanalysis to identify issues, understand their underlying cause and suggest improvements for the future. 4. Carry out impact analysis to understand how change will … roles: Understanding of DevOps concepts such as version control, test automation, continuous integration; continuous deployment; infrastructure as code, containerisation, and pipeline orchestration. A strong focus on customer service Technical rootcauseanalysis skills Self-motivated, with an ability to work independently as well as part of an effective team. Proactive Desirable Strong Knowledge of a variety of More ❯
San Antonio, Texas, United States Hybrid / WFH Options
BridgePhase, LLC
systems to identify anomalous or malicious activity. Support incident response activities by conducting initial investigations and escalating issues as needed. Lead investigations into high-priority security incidents, including malware analysis and reverse engineering to determine intent and impact, and provide rootcauseanalysis and remediation guidance to system teams. Leverage SIEM platforms and threat intelligence feeds … looking for analysts who are adaptable, curious, and eager to support cyber defense in a mission-focused environment. Preferred Experience and Qualifications: 3-5 years of experience in cybersecurity analysis or security operations, including defending AWS-hosted environments and Internet-facing web services. Hands-on experience with SIEM platforms, log analysis, and basic incident response techniques. Experience developing … detection content such as alerts, dashboards, and correlation rules to support threat monitoring. Familiarity with malware analysis and reverse engineering techniques to determine impact and intent. Ability to produce rootcauseanalysis reports and remediation guidance following security incidents. Understanding of common cybersecurity frameworks such as RMF, NIST SP 800-53, and DISA STIGs. Working knowledge More ❯
Arlington, Virginia, United States Hybrid / WFH Options
CGI
frameworks and metrics. - Assist in developing, tracking, and refining outcomes and driver metrics, including creating driver trees and updating functional and technical data definitions. - Support cross-functional teams with rootcauseanalysis, corrective actions, and process improvement initiatives. - Provide support for P2P forums, including preparing executive-level briefs and summaries and updating task management systems. - Monitor progress … for performance improvement initiatives through strategic communication and change management efforts. - Support cross-functional teams by applying process improvement tools and methodologies to address performance deficiencies and assist with rootcause analysis. - Benchmark and incorporate best practices from industry to recommend correction actions and implementation timelines. - Assist in creating workflows, dashboards, and analytics to optimize performance management activities. … improvement frameworks such as Change Management, Lean Six Sigma, Theory of Constraints, Agile or Scrum methodologies, and/or P2P. - Experience in developing and tracking metrics, driver trees, conducting cause-and-effect analysis, and reporting structures. - Proven ability to conduct rootcauseanalysis, recommend, and implement corrective action plans. - Exceptional written and verbal communication skills More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, rootcauseanalysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … understanding of cybersecurity principles and experience implementing security measures in a regulated environment. Ability to coach, mentor, and upskill staff; develop career paths and ensure team resilience. Experience undertaking rootcauseanalysis including prevention orientated solution reporting. Working experience with deployment tools (e.g. GitLab pipelines) and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, rootcauseanalysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … understanding of cybersecurity principles and experience implementing security measures in a regulated environment. Ability to coach, mentor, and upskill staff; develop career paths and ensure team resilience. Experience undertaking rootcauseanalysis including prevention orientated solution reporting. Working experience with deployment tools (e.g. GitLab pipelines) and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such More ❯
Leicester, Leicestershire, United Kingdom Hybrid / WFH Options
Oliver James Associates Ltd
Key Responsibilities: Lead and manage the Application Support team in resolving incidents, service requests, and change requests. Serve as an escalation point for complex technical issues requiring in-depth analysis and resolution. Perform hands-on troubleshooting, rootcauseanalysis, and issue resolution using SQL and system diagnostics tools. Design and execute test cases for application upgrades More ❯
Huntsville, Alabama, United States Hybrid / WFH Options
Gridiron IT Solutions
and artifacts Experience with SIEM technologies, including Splunk, Microsoft Sentinel, or Elastic Experience with forensics tools, including Magnet Axiom and FTK Experience performing forensic imaging, remote collection, and forensic analysis Experience with malware analysis, including static, dynamic, and reverse engineering Experience performing rootcauseanalysis and following through with all phases of the incident response … lifecycle Top Secret clearance Bachelor's degree Additional Qualifications Experience acquiring memory from the host and performing memory analysis with tools, including Volatility Experience with Endpoint Detection and Response (EDR) tools, including CrowdStrike Falcon and FireEye HX Experience performing analysis of packet capture using tools, including Wireshark Experience with Python or PowerShell Experience performing Incident Response and Forensics More ❯
Oxfordshire, South East, United Kingdom Hybrid / WFH Options
Network IT
and critical platform services Develop and manage automation scripts and workflows using Ansible , Terraform , or PowerShell Collaborate with engineering teams to support infrastructure upgrades and issue resolution Contribute to rootcauseanalysis and implement preventative measures Document support procedures and maintain a comprehensive knowledge base Participate in on-call rotations and incident response efforts as needed Critical More ❯
Shrivenham, Swindon, Wiltshire, England, United Kingdom Hybrid / WFH Options
Network IT
and critical platform services Develop and manage automation scripts and workflows using Ansible , Terraform , or PowerShell Collaborate with engineering teams to support infrastructure upgrades and issue resolution Contribute to rootcauseanalysis and implement preventative measures Document support procedures and maintain a comprehensive knowledge base Participate in on-call rotations and incident response efforts as needed Critical More ❯
San Diego, California, United States Hybrid / WFH Options
SAIC
System life cycle software engineering support for our NAVWAR and NIWC customers. Focus primarily on software components loaded on supported tactical networks to sustain currently fielded CANES systems. Includes analysis and modification to replace commercial off the shelf (COTS) components that are end of support (EOS) modifications to support new interfaces to other systems, and modifications to existing systems … distance support, and emergent onsite Casualty Report (CASREP) support as needed for the warfighters. Engineering solutions will primarily be for Windows and Redhat/Linux based operating systems. Provide rootcauseanalysis and be comfortable recommending permanent configuration changes when necessary. Develop, integrate, test, debug, and tune complex Software solutions designed to satisfy customer requirements. Experience in … environment and is familiar with Agile practices. 2 yrs. of experience with scripting program languages and automation using PowerShell scripting and XML development. 2 years' experience in the design, analysis and support of local area networks. Current IAT Level II Compliant (Security +) and OS Cert. Would need to obtain within 6 months if does candidate does not currently More ❯