Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
Maxwell Bond
resilient hybrid infrastructure solutions across Azure and traditional platforms. Collaborate with DevOps, SecOps, and development teams to support deployments and maintain secure, reliable environments. Support incident response and perform rootcauseanalysis of infrastructure-related issues. Contribute to disaster recovery and business continuity planning. Lead infrastructure product evaluations and take part in implementing new technologies. Ensure solutions More ❯
Washington, Washington DC, United States Hybrid / WFH Options
General Dynamics Information Technology
Make an Impact: Analyzes customer requirements and provides highly innovative technical expertise on cloud computing techniques, technologies, infrastructure, DevOps and related cloud architecture Recognized Subject Matter Expert in systems analysis Maintains current knowledge of relevant technology as assigned May serve as a team or task lead Work with project managers and developers to plan and implement database projects Systems … roles/features, or backing up system configurations) Monitor system performance, CPU, memory, disk usage and event logs via Azure Monitor or custom scripts Respond to alerts and conduct rootcauseanalysis for any downtime or performance degradation Maintain data backups and recovery procedures for critical systems Periodically test restore processes to ensure COOP and disaster recovery More ❯
Huntsville, Alabama, United States Hybrid / WFH Options
SAIC
proactively identify and resolve operational, tooling and process inefficiencies. On-Call after hours support may be required for critical systems. The candidate will collaborate with the customer to determine rootcauseanalysis and corrective actions. Key Responsibilities: Lead Red Hat Enterprise Linux (RHEL) administration and provide principal level leadership. Support IaaS environments with RHEL systems engineering, administration More ❯
checks to identify process defects Reporting Support the creation of routine reporting packs and dashboards for internal stakeholders, utilising and defining performance metrics - Service Level Agreements (SLAs) etc Conduct Analysis utilising tools such as Excel or PowerBI, to identify trends and opportunities for both system optimisation and improvement in operational performance Continuous Improvement - Operations process optimisation Proactively identify opportunities … generating and maintaining a knowledgeable Problem Solving Critically assess and collaboratively work alongside the function's operations team, managed service vendors and enterprise IT team to identify/support rootcauseanalysis and remediation of issues, incidents and escalation. Bridge the gap by translating business requirements to the Tech team and vice versa Vendor Management Maintain a More ❯
Accrington, England, United Kingdom Hybrid / WFH Options
World Options Ltd
governance across the UK operations and ensuring that every technology investment delivers tangible, measurable benefits that positively impact revenue, margin, and EBITDA. Key Responsibilities Requirements Management: Lead the collection, analysis, and prioritisation of functional and non-functional requirements across the three UK business units. Translate approved requirements into clear user stories, detailed acceptance criteria, and well-defined delivery plans … IT Manager. Establish and monitor effective Service Level Agreements (SLAs) and Operational Level Agreements (OLAs), curate a comprehensive knowledge base, measure user satisfaction (CSAT, NPS), and drive thorough incident root-cause analysis. Stakeholder Engagement & Communication: Act as a trusted advisor and key liaison for UK franchise partners, country management, and functional leads. Produce clear, data-driven status reports … UK IT Manager & Help Desk Team Development partners (internal & external) supporting UK systems UK Franchise partners & store owners Skills & Experience Proven track record of 7+ years in IT business analysis, product ownership, or IT governance roles, ideally within multi-site or franchise organisations operating in the UK. Demonstrable success in managing technology initiatives within complex, multi-platform environments (experience More ❯
storage, backups, and Linux systems using tools such as Ansible, Terraform, and GitHub. Collaborate with cross-functional teams to align infrastructure delivery with DevOps best practices. Lead incident response, rootcauseanalysis, and ongoing support for critical infrastructure services. Define and implement infrastructure administration standards and procedures. Champion Infrastructure as Code and continuous improvement across the hosting More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Tate Professional
storage, backups, and Linux systems using tools such as Ansible, Terraform, and GitHub. Collaborate with cross-functional teams to align infrastructure delivery with DevOps best practices. Lead incident response, rootcauseanalysis, and ongoing support for critical infrastructure services. Define and implement infrastructure administration standards and procedures. Champion Infrastructure as Code and continuous improvement across the hosting More ❯
Be proficient in Linux server and system administration (e.g., package management, kernel updates, filesystems, volume management) Have experience managing containerized workloads using Docker or Kubernetes Be an expert in RootCauseAnalysis Have a strong desire to learn new skills and technologies, with proven research capabilities and adaptability Possess at least two years of experience training and More ❯
Waterwells Business Park, Quedgeley, Gloucester, Gloucestershire, England, United Kingdom Hybrid / WFH Options
IMT Resourcing Solutions
and infrastructure (RMS, mobile and CAD platforms). Key Responsibilities Validate, cleanse and enrich large, operational datasets; fix anomalies before they hit production. Profile data, uncover patterns and perform root-causeanalysis using T-SQL and BI/visualisation tools. Own data-quality KPIs (completeness, accuracy, timeliness) and present clear insights to stakeholders. Maintain data dictionaries, quality More ❯
Salisbury, Wiltshire, United Kingdom Hybrid / WFH Options
Sopra Steria Group
not limited to Cisco Routing, Switching, Security, SDN, Unified Communications and Wireless technologies). Identify and explore opportunities for enhancing efficiency, leveraging orchestration technologies to streamline and automate. Lead 'RootCauseAnalysis' investigations into network faults, security and performance issues. Support the Principal NetOps Engineer and Architects with project implementation. Liaise with third party service providers for More ❯
Chantilly, Virginia, United States Hybrid / WFH Options
Edgesource
like Grafana and Prometheus. Ensure comprehensive monitoring, logging, and alerting for all services. Reliability and Performance: Ensure high availability and performance of services. Conduct capacity planning, performance tuning, and rootcauseanalysis for incidents. Implement and maintain service level objectives (SLOs) and service level indicators (SLIs). Operational Excellence: Develop and enforce best practices for incident management More ❯
features. Authors and maintains comprehensive technical documentation including detailed system configurations, governance models, and operational procedures. Acts as a senior escalation point for Level 3/4 support, performing rootcauseanalysis and driving long-term resolution of complex issues. Manages the technical scope, delivery timelines, and risk mitigation strategies for cloud engineering initiatives. Tracks and reports More ❯
and operational health across critical payment systems. You will collaborate with our application support DevOps Labs to utilize observability platforms such as Splunk and Dynatrace for monitoring, incident response, rootcauseanalysis, and system performance optimization. About us We are an innovative bank committed to shaping finance as a force for good, empowering our people to innovate … SLAs and performance indicators. Knowledge of configuring alerts, reducing noise, and automating alert routing in collaboration with Technical Recovery Managers. Proficiency in Splunk SPL, report scheduling, app management, and rootcause analysis. Ability to conduct proactive system performance analysis and generate reports for stakeholders. Development of scripts for automation and integration of observability tools with CI/ More ❯
partnership with our application support DevOps Labs to deliver deep insights using observability platforms such as Splunk and Dynatrace. Your responsibilities will involve providing direct support for incident response, rootcauseanalysis, performance optimization, and system performance improvement! About us If you think all banks are the same, you'd be wrong. We're an innovative, fast … develop automated alert routing. Hold advanced knowledge of Splunk SPL, dashboard development, report scheduling and app management. Proficient in crafting service-level dashboards, setting up custom metrics, and conducting rootcauseanalysis using advanced technology. Analysis & Reporting: Conduct proactive analysis on system performance, availability and failures. Generate regular reports for senior stakeholders, summarising trends, anomalies More ❯
Roseville, California, United States Hybrid / WFH Options
LHH Recruitment Solutions
in system-level validation, test plan development, and hands-on debugging. Key Responsibilities Lead product validation activities, including test plan creation, execution, and failure analysis. Conduct performance characterization and root-causeanalysis of anomalies in ASIC and storage systems. Develop and maintain detailed test documentation, including test plans, matrices, and reports for new features and regression testing. …/programming (Python preferred, Bash, C/C++ also valuable). Solid understanding of system architecture, power management flows, and platform integration. Experience with debug tools and methodologies for root-cause analysis. Comfortable operating in Linux-based test environments. Preferred Qualifications Familiarity with security protocols such as TLS 1.3, IPsec, Secure Boot, Secure Firmware Download. Experience with Wireshark … or other traffic analysis tools. Background in ASIC validation or firmware/hardware integration. Strong automation skills and experience integrating tests into CI/CD pipelines. Prior experience working in Agile/Scrum teams. Why Work with Us? Opportunity to join a respected industry leader with a track record of innovation. Collaborative environment that values problem-solving and continuous More ❯
Leicester, Leicestershire, England, United Kingdom Hybrid / WFH Options
Oliver James
satisfaction.Key Responsibilities: Lead and manage the Application Support team in resolving incidents, service requests, and change requests. Serve as an escalation point for complex technical issues requiring in-depth analysis and resolution. Perform hands-on troubleshooting, rootcauseanalysis, and issue resolution using SQL and system diagnostics tools. Design and execute test cases for application upgrades More ❯
Burke, Virginia, United States Hybrid / WFH Options
ALTA IT Services
and ensure accurate data visualization. • Design and maintain dashboards for network health, application performance, and security insights. • Integrate LiveAction solutions with SIEMs, orchestration tools, and other monitoring platforms. • Conduct root-causeanalysis of performance bottlenecks, latency, and packet loss across large-scale networks. • Create custom reports and alerts for performance metrics, compliance monitoring, and capacity planning. • Assist … with an understanding of cybersecurity frameworks. Preferred Qualifications: • LiveAction product certifications (e.g., LiveAction Certified Professional). • Experience integrating LiveAction with Splunk, Elastic, or other SIEMs. • Familiarity with packet-level analysis tools like Wireshark or Riverbed. • Experience with scripting and automation (Python, Ansible, PowerShell). • Prior experience supporting agencies such as DoD, DHS, VA, or IC. More ❯
Loughborough, Leicestershire, East Midlands, United Kingdom Hybrid / WFH Options
Oscar Associates (UK) Limited
migration, working with project teams and senior stakeholders. Drive process improvement, automation, and security best practice using PowerShell, deployment tools (SCCM/Intune/PDQ), and cloud technologies. Conduct rootcauseanalysis for critical incidents and coordinate long-term resolutions. Develop and maintain comprehensive technical documentation, knowledge base articles, and asset records. Build strong relationships with senior … and best practice. Participate in an on-call rota (with enhanced allowance) to ensure 24/7 service continuity. Ideal Candidate: Significant 2nd/3rd line support or application analysis experience in a complex/regulated environment. Deep knowledge of Microsoft technologies: Windows Server, Active Directory, PowerShell, Exchange, SharePoint, Teams, Azure. Track record delivering major upgrades, migrations, or automation More ❯
Weybridge, Surrey, England, United Kingdom Hybrid / WFH Options
Proactive Appointments
to ensure technical feasibility and alignment with business goals.• Create data models, user journeys, UI/UX Design, process flows, and system diagrams to support solution design.• Conduct gap analysis, impact assessments, and rootcause analysis.• Support QA and UAT by defining acceptance criteria, test cases and validating outcomes. Technical Business Analyst Due to the volume of More ❯
Washington, Washington DC, United States Hybrid / WFH Options
Agile Defense, Inc
required in Washington, DC. 5x per week SUMMARY This federal program has a Network Security Operations Center and requires a dedicated analyst to join the SOC team to perform analysis of cyber threats. Monitor and analyze network traffic, Intrusion Detection Systems (IDS), security events and logs and provide a technical resource and escalation point for tier 1 analyst. JOB … to be able to determine between false and true positives events, prioritizing them appropriately and see them through from end to end. Additionally, the candidate will perform or review rootcauseanalysis efforts following incident recovery. The candidate will compose security alert notifications and other communications on behalf of the SOC QUALIFICATIONS Education, Background, and Years of … PowerShell scripts) • Familiarity with Splunk Enterprise Security Strong understanding of networking (TCP Flags, TCP Handshake, IP addressing, Firewalls, Proxy, IDS, IPS) • Ability to perform Netflow/packet capture (PCAP) analysis • Experience with cyber threat hunting WORKING CONDITIONS Environmental Conditions • Schedule: Monday - Friday. Currently hybrid, remote and onsite with the expectation to be onsite 2-3 days a week in More ❯
as STRIDE, PASTA, MITRE ATT&CK, and DREAD. Build and refine detection and response capabilities using logs, alerts, and behavioral signals. Lead or support incident response activities, including log analysis, querying, forensic investigation, threat mitigation, and rootcause analysis. Conduct internal security reviews, network scans, and targeted penetration tests of applications and infrastructure using common security tooling … architectures and modern frameworks (e.g., Django, Node.js , React). Expert-level scripting and automation skills (e.g., Python, Bash, PowerShell) for workflow automation, tooling, and log analysis. Proficient in log analysis, SIEM usage/configuration, threat hunting, and querying tools to support detection and response. Familiarity with static and dynamic analysis techniques and vulnerability mitigation. Strong understanding of modern … automation platforms. Prior experience driving security engineering for a SaaS-based company. Experience leveraging automation or AI/ML tools to improve secure development, detection, incident response, or code analysis workflows. Benefits: (US-ONLY) 100% of medical, dental, and vision covered including 75% for dependents Flexible vacation days and quarterly mental health days so you can recharge Enjoy a More ❯
Chantilly, Virginia, United States Hybrid / WFH Options
Peraton
complexity from architecture and processes Configure and use state-of-the-art monitoring tools to gather insights and then act upon the results Conduct incident response and in-depth rootcauseanalysis This position is hands-on, requiring the ability to provide first-level system and network support and problem resolution identification Responsible for the monitoring the … daily software and network operations in a distributed environment Responsible for monitoring, working with users on fault isolation and resolution, as well as system analysis and reporting This job will include shift work to allow for complete 24x7 monitoring of software systems. Will need to have flexibility to work multiple shifts (day, mid, swing), as needed. Job is on More ❯
Doncaster, Yorkshire, United Kingdom Hybrid / WFH Options
Hiya Technology Ltd
and Reporting: Define and track key quality metrics to measure the effectiveness of QA processes. Provide regular reports and updates on testing progress, coverage, and results to stakeholders. Conduct rootcauseanalysis on defects, working closely with development teams to resolve issues and prevent recurrence. Skills and Experience of a Lead QA Analyst; Experience in quality assurance More ❯
Central London, London, United Kingdom Hybrid / WFH Options
Halian Technology Limited
endpoints are properly configured and updated. 2nd Line Support: Respond to and resolve escalated 2nd line support tickets, ensuring timely resolution of technical issues. Provide expert-level troubleshooting and rootcauseanalysis for more complex issues. Work closely with end-users, understanding their requirements and delivering technical solutions. Escalate issues to senior engineers as needed while keeping More ❯
and dashboards to identify potential control failures as part of the control testing process. Ensure the accuracy and timely completion of control testing, providing peer review. Document findings, including rootcauseanalysis and applicable recommendations for remediation. Be the primary liaison with partners, delivering clear progress updates and results. Contribute lessons learned by integrating partner feedback to … and organizational requirements. Experience applying governance, risk, and control principles. Experience in automated and manual testing of security controls. Experience facilitating meetings and conveying complex ideas. Data collection, validation, analysis, and interpretation. Experience Researching and applying latest technologies. Experience with Agile methodology. Big 4 accounting experience. Hold a professional certification such as CISA, CISM, CISSP, PCI QSA, ISO More ❯