Falls Church, Virginia, United States Hybrid / WFH Options
Epsilon Inc
teams to optimize data pipelines for AI/ML initiatives, automation, and productization Lead efforts to integrate security best practices, ensuring compliance with relevant regulations and standards Conduct performance analysis, capacity planning, and system tuning to maximize uptime and reliability Guide junior team members in troubleshooting techniques, documentation, and adherence to best practices Drive continuous improvement by reviewing existing … for secure system architecture Familiarity with data engineering concepts, including ETL/ELT pipelines, big data tools, and AI/ML workflows Ability to troubleshoot complex system issues, perform root-causeanalysis, and implement effective solutions Excellent communication, teamwork, and organizational skills, with a focus on innovation and continuous improvement One or more of the following certifications More ❯
maintain systems according to approved design. Service Delivery & Operations: Lead key service management processes (Continuity, Capacity, Availability). Attend incident/problem bridges as the subject matter expert. Review rootcause analyses (RCAs) and oversee corrective actions. Provide accurate monthly service performance reports across IT and OT. Supplier & Financial Management: Lead and manage suppliers to meet agreed SLAs … change management experience. Ability to simplify complex network architecture for non-technical audiences. Desirable Technical Skills & Qualifications: Knowledge of network security technologies and strategic supplier management. Experience in stakeholder analysis and business case development. Familiarity with cloud integration (Azure and AWS). What's in it for you? Competitive salary up to £75,000 per annum, depending on experience More ❯
Falls Church, Virginia, United States Hybrid / WFH Options
Epsilon Inc
of data between systems by helping with Extract, Transform, Load (ETL) processes and ensuring data consistency across different platforms. Monitor and Troubleshoot Database Performance Issues - Identify potential bottlenecks, perform rootcauseanalysis, and work with senior architects to implement solutions that enhance database reliability and efficiency. Support Compliance and Regulatory Requirements - Ensure database structures and data management More ❯
Falls Church, Virginia, United States Hybrid / WFH Options
Epsilon Inc
assessments and provide actionable recommendations for mitigation. Experience supporting security for data pipelines, AI/ML environments, or cloud-based infrastructures. Excellent incident response skills, including triage, containment, and rootcause analysis. Strong communication and collaboration abilities to partner with cross-functional teams and stakeholders. One or more of the following certifications are desired: Certified Cloud Security Professional More ❯
base articles. Monitor application health using tools and custom dashboards. Support integration and communication between cloud platforms (Azure, Entra ID, Microsoft 365). Contribute to service improvement initiatives, including rootcauseanalysis and automation opportunities. Participate in on-call rotations or after-hours incidents during peak retail periods. Work within established security frameworks and governance. Hybrid working More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
BAE Systems (New)
hybrid and flexible working arrangements available. Please consult your recruiter for details. Grade: GG10 - GG11 Referral Bonus: £5,000 Job Description Serve as the point of escalation for intrusion analysis, forensics, and incident response queries. Provide rootcauseanalysis for complex, non-standard findings and anomalies without existing playbooks. Mentor team members and share knowledge proactively. … red team and pentest findings to improve detection rules. Provide forensic support and threat emulation to improve alert triage and accuracy. Identify gaps in SOC processes, data collection, and analysis, demonstrating the need for improvements through scenarios and red teaming. Perform complex threat hunting, automation, and analytic enrichment tasks. Set vision and milestones for emulation and detection capabilities, influencing More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
Experis - ManpowerGroup
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform rootcauseanalysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
Experis
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform rootcauseanalysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
Reading, Berkshire, United Kingdom Hybrid / WFH Options
Pertemps
you'll be doing as a Senior Cyber Security Analyst: Security Incident Response : Investigate security alerts from SIEM and third-party MSSPs, triage and respond to incidents, and support rootcauseanalysis to drive remediation. Stakeholder Engagement : Work closely with technology and business teams to communicate cyber risks, recommend actions, and ensure controls are proportionate and effective. More ❯
Falls Church, Virginia, United States Hybrid / WFH Options
Epsilon Inc
of data between systems by helping with Extract, Transform, Load (ETL) processes and ensuring data consistency across different platforms. Monitor and Troubleshoot Database Performance Issues - Identify potential bottlenecks, perform rootcauseanalysis, and work with senior architects to implement solutions that enhance database reliability and efficiency. Support Compliance and Regulatory Requirements - Ensure database structures and data management More ❯
hands-on role supporting high-availability systems, rapid deployments, and production incident response. Key Responsibilities - Manage and monitor AWS infrastructure for performance and security - Respond to production incidents, perform rootcauseanalysis, and implement fixes - Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries - Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes … Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting (Python, Go, Bash, SQL) - Skilled in GitHub, CI/CD, and Kubernetes operations Desirable: - Experience with Terraform or CloudFormation - Advanced log analysis with Splunk - Strong problem-solving and analytical thinking More ❯
Milton Keynes, Buckinghamshire, South East, United Kingdom Hybrid / WFH Options
In Technology Group Limited
with IT and development teams to ensure secure system architecture and application development. Maintain and enhance incident response procedures and disaster recovery plans. Investigate and document security breaches, providing rootcauseanalysis and remediation plans. Conduct security awareness training for staff and ensure compliance with internal policies and regulatory requirements (e.g., FCA, GDPR, ISO 27001). Stay More ❯
Shrivenham, Oxfordshire, United Kingdom Hybrid / WFH Options
Gold Group
Collaborate with engineering teams to support unified access devices (UADs), endpoint management, and virtualized environments. * Provide hands-on support for automation scripts, workflows, and infrastructure monitoring tools. * Contribute to rootcauseanalysis efforts for recurring platform incidents. * Support capacity planning and performance optimization by analysing system usage and trends. * Offer feedback on tools and processes, identifying improvements More ❯
and legacy systems/technical debt activities Collaborate with Senior Engineers to improve delivery automation and enhance DevEx and self-servicing Aligns to effective incident response processes, helping with rootcauseanalysis and problem resolution during incident management sessions Take ownership and pride in the work you deliver, ensure what is delivered is of quality and takes More ❯
our tools and platforms Collaborate with the team to troubleshoot and resolve issues, shadowing and learning from Mid and Senior-level Engineers Aligns to incident response processes, helping with rootcauseanalysis and problem resolution during incident management sessions Take ownership and pride in the work delivered, ensure what is delivered is of quality and takes into More ❯
Newcastle Upon Tyne, United Kingdom Hybrid / WFH Options
NHS Business Services Authority
platforms, ensuring the availability and stability of NHSBSA services.o Carrying out proactive support activities, such as evaluation of performance, tuning and running backup/recovery schedules.o Providing troubleshooting and rootcauseanalysis to identify issues, understand underlying cause and suggest future improvements.o Evaluating and interpreting technical data to resolve complex issues when performance is impaired.o Maintaining … to clinicians, NHS bodies and citizens. 2. Carry out proactive support activities, such as evaluation of performance, tuning and running backup/recovery schedules. 3. Carry out troubleshooting and rootcauseanalysis to identify issues, understand their underlying cause and suggest improvements for the future. 4. Carry out impact analysis to understand how change will … roles: Understanding of DevOps concepts such as version control, test automation, continuous integration; continuous deployment; infrastructure as code, containerisation, and pipeline orchestration. A strong focus on customer service Technical rootcauseanalysis skills Self-motivated, with an ability to work independently as well as part of an effective team. Proactive Desirable Strong Knowledge of a variety of More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, rootcauseanalysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … understanding of cybersecurity principles and experience implementing security measures in a regulated environment. Ability to coach, mentor, and upskill staff; develop career paths and ensure team resilience. Experience undertaking rootcauseanalysis including prevention orientated solution reporting. Working experience with deployment tools (e.g. GitLab pipelines) and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, rootcauseanalysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … understanding of cybersecurity principles and experience implementing security measures in a regulated environment. Ability to coach, mentor, and upskill staff; develop career paths and ensure team resilience. Experience undertaking rootcauseanalysis including prevention orientated solution reporting. Working experience with deployment tools (e.g. GitLab pipelines) and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such More ❯
Leicester, Leicestershire, United Kingdom Hybrid / WFH Options
Oliver James Associates Ltd
Key Responsibilities: Lead and manage the Application Support team in resolving incidents, service requests, and change requests. Serve as an escalation point for complex technical issues requiring in-depth analysis and resolution. Perform hands-on troubleshooting, rootcauseanalysis, and issue resolution using SQL and system diagnostics tools. Design and execute test cases for application upgrades More ❯
Huntsville, Alabama, United States Hybrid / WFH Options
Gridiron IT Solutions
and artifacts Experience with SIEM technologies, including Splunk, Microsoft Sentinel, or Elastic Experience with forensics tools, including Magnet Axiom and FTK Experience performing forensic imaging, remote collection, and forensic analysis Experience with malware analysis, including static, dynamic, and reverse engineering Experience performing rootcauseanalysis and following through with all phases of the incident response … lifecycle Top Secret clearance Bachelor's degree Additional Qualifications Experience acquiring memory from the host and performing memory analysis with tools, including Volatility Experience with Endpoint Detection and Response (EDR) tools, including CrowdStrike Falcon and FireEye HX Experience performing analysis of packet capture using tools, including Wireshark Experience with Python or PowerShell Experience performing Incident Response and Forensics More ❯
Oxfordshire, South East, United Kingdom Hybrid / WFH Options
Network IT
and critical platform services Develop and manage automation scripts and workflows using Ansible , Terraform , or PowerShell Collaborate with engineering teams to support infrastructure upgrades and issue resolution Contribute to rootcauseanalysis and implement preventative measures Document support procedures and maintain a comprehensive knowledge base Participate in on-call rotations and incident response efforts as needed Critical More ❯
Shrivenham, Swindon, Wiltshire, England, United Kingdom Hybrid / WFH Options
Network IT
and critical platform services Develop and manage automation scripts and workflows using Ansible , Terraform , or PowerShell Collaborate with engineering teams to support infrastructure upgrades and issue resolution Contribute to rootcauseanalysis and implement preventative measures Document support procedures and maintain a comprehensive knowledge base Participate in on-call rotations and incident response efforts as needed Critical More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
Maxwell Bond
resilient hybrid infrastructure solutions across Azure and traditional platforms. Collaborate with DevOps, SecOps, and development teams to support deployments and maintain secure, reliable environments. Support incident response and perform rootcauseanalysis of infrastructure-related issues. Contribute to disaster recovery and business continuity planning. Lead infrastructure product evaluations and take part in implementing new technologies. Ensure solutions More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
Interface Recruitment UK
setting Excellent communication skills, both written and verbal Ability to work independently and under pressure Self-motivated with good time management Experience in customer-facing roles Ability to perform rootcauseanalysis Technical documentation skills Professional attitude Knowledge of: Microsoft Office 2010+ Windows Server 2016/2019 Active Directory Administration Microsoft Exchange Server Office 365 Desirable skills More ❯
checks to identify process defects Reporting Support the creation of routine reporting packs and dashboards for internal stakeholders, utilising and defining performance metrics - Service Level Agreements (SLAs) etc Conduct Analysis utilising tools such as Excel or PowerBI, to identify trends and opportunities for both system optimisation and improvement in operational performance Continuous Improvement - Operations process optimisation Proactively identify opportunities … generating and maintaining a knowledgeable Problem Solving Critically assess and collaboratively work alongside the function's operations team, managed service vendors and enterprise IT team to identify/support rootcauseanalysis and remediation of issues, incidents and escalation. Bridge the gap by translating business requirements to the Tech team and vice versa Vendor Management Maintain a More ❯