talented team that enjoys exploring and designing new technologies. Aufgaben Lead problem management efforts, including external moderation and team improvements Handle P1/P2 incident management and conduct 8D rootcauseanalysis and lessons learned Oversee a specific cost center regarding forecasting, purchase orders, and bills Ensure proper cost center separation and adherence to SLAs Manage upgrades More ❯
on-prem environments. What Youll Be Doing: Managing and supporting Solace PubSub+ appliances and software brokers across cloud and on-prem platforms Responding to production incidents and working on rootcauseanalysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message More ❯
RMF) Understanding of NIST Cybersecurity Framework (CSF) Knowledge of information assurance, cybersecurity, and privacy policies disciplines and methodologies Understanding of CSAM reporting and controls management Comprehensive understanding of Systems Analysis, Business Analysis, and Business Intelligence principles Ability to design, manage, and deploy data systems, reports, and dynamic dashboards Knowledge of specialized Business Intelligence software, SQL query language, Microsoft … Power BI, Hyperion, SQL Reporting Services preferred) Proficiency in Microsoft tools: Word, Excel, Project, PowerPoint, and Visio Ability to present analytic findings, extract data from multiple sources, and conduct rootcauseanalysis Excellent oral and written communication skills Work Location Primary work location: 1200 New Jersey Ave SE, Washington, DC 20590 (onsite) Situational telework may be approved More ❯
Ansible, Terraform, GitHub Operate and support Kubernetes (Tanzu preferred), VMware, and Cisco UCS environments Administer Linux (RHEL 7-9), storage (NFS, object), and backup systems (CommVault) Handle incident response, rootcauseanalysis, and continuous improvement Collaborate across teams to align with DevOps best practices Contribute to infrastructure standards, documentation, and disaster recovery planning Preferred Experience: 5+ years More ❯
the implementation, troubleshooting and maintenance of IT systems. Rapidly distinguish isolated user problems from enterprise-wide application/system problems. Coordinate with customers and stakeholders to collect data, conduct analysis, develop, and implement solutions associated with incident tickets and requirements. Develop solutions to complex technical issues. Provide follow-up reports (technical findings, feedback, resolution steps taken) for RootCauseanalysis, engineering technical assessment and process improvement initiatives. Support customer requirements in a 24/7/365 environment and be able to provide on-call support during outages occurring after hours. Update operations and monitoring documentation for 24/7/365 Operations Watch personnel. Successful candidates must possess the following skills: Foundational knowledge of … or MacOS Experience with Scripting/Automation through Linux Shell scripts, PowerShell, PowerCLI, or other scripting languages Experience troubleshooting issues in a growing environment Experience with log reviews, incident analysis, and identification of issue trends Strong experience with server patch management methodologies Time management skills with the ability to work within an IT Service Management/ticketing system (ServiceNow More ❯
administration (Windows 10/11). Proven experience leading and executing Tanium solution deployments. Hands-on experience with endpoint lifecycle management, patching, and compliance. Strong analytical, problem-solving, and rootcauseanalysis skills. Ability to work independently and within distributed teams. Excellent oral, written, and interpersonal communication skills. Detail-oriented with strong organizational and task management capabilities. More ❯
Weybridge, Surrey, England, United Kingdom Hybrid / WFH Options
Proactive Appointments
to ensure technical feasibility and alignment with business goals.• Create data models, user journeys, UI/UX Design, process flows, and system diagrams to support solution design.• Conduct gap analysis, impact assessments, and rootcause analysis.• Support QA and UAT by defining acceptance criteria, test cases and validating outcomes. Technical Business Analyst Due to the volume of More ❯
Assist in the development and documentation of technical standards and ticketing system Partner with others to deliver and maintain configuration that supports Cybersecurity Maturity Model Certification (CMMC) compliance Perform rootcauseanalysis for outages and other business impacting issues Oversee advanced/tier 3 complicated, urgent and high priority support issues, incidents and problems Act as escalation More ❯
fixed income data model to deliver a consistent, cross-platform client experience. Collaborate with Product, Engineering, Ontologists, and Fixed Income SMEs to co-design an interconnected data model supporting analysis across multiple datasets. Translate business and product requirements into clear, maintainable data modelling artifacts. Define and document metadata standards, entity relationships, and model schemas to support semantic alignment and … and maintain metadata inventories. Communicate data modelling requirements to stakeholders, and drive alignment across metadata/modelling functions to ensure practices are well understood & followed. Perform data profiling and rootcauseanalysis to guide objective, data-driven modelling decisions. Promote FAIR data principles across the modelling lifecycle. You'll need to have: Please note we use years More ❯
TCP/IP fundamentals (e.g. DNS, FTP, SSH, ACL, VLAN, DMZ, BGP), general networking technologies, network architecture and connectivity troubleshooting -Ability to diagnose and resolve complex network issues, perform rootcauseanalysis, and implement preventive measures to ensure continuous network performance. -Oversee daily monitoring and maintenance of network performance, using network management tools to detect, mitigate, and … Ex. ASRs, ISRs, Catalysts, NXOS, Nexus -Ability to work weekends and evening hours as needed -Ability to work independently with little direction and guidance -Thorough experience performing packet capture analysis or running Wireshark is highly desirable. -Review and/or approve Network engineering documentation to ensure that processes and specifications meet Network needs and are accurate, comprehensive, and complete More ❯
and analyze security events. Investigate and escalate security incidents, including malware infections, phishing attempts, and unauthorized access. Respond to and mitigate cybersecurity incidents following established incident response protocols. Perform rootcauseanalysis of security breaches and recommend remediation strategies. Coordinate with internal and external stakeholders to contain and recover from incidents. Conduct vulnerability scans using tools like … assessments and security audits of systems, applications, and networks. Support security accreditation and certification processes. Validate system configurations and ensure alignment with organizational cybersecurity policies. Document incident reports, threat analysis findings, and remediation steps. Prepare and deliver security metrics, reports, and dashboards for leadership and stakeholders. Maintain detailed records of security operations to support audits and compliance. Assist in More ❯
vulnerabilities or known flaws to ensure that critical missions are resilient to cyber exploits and attacks. Implement coding foundation in various languages to create tools and techniques, perform code analysis, conduct code manipulation and develop coding solutions tailored to the area of need. Offensive development responsibilities include vulnerability research and analysis, reversing engineering threats to determine methods of … alerts and events using Security Information and Event Management (SIEM) tools. Investigate and respond to security incidents, including malware infections, phishing attempts, and unauthorized access attempts. Assist in conducting rootcauseanalysis (RCA) and implementing corrective actions to prevent future incidents. Collaborate with the security operations center (SOC) to escalate and resolve complex security events. Conduct regular More ❯
Washington, Washington DC, United States Hybrid / WFH Options
Agile Defense, Inc
required in Washington, DC. 5x per week SUMMARY This federal program has a Network Security Operations Center and requires a dedicated analyst to join the SOC team to perform analysis of cyber threats. Monitor and analyze network traffic, Intrusion Detection Systems (IDS), security events and logs and provide a technical resource and escalation point for tier 1 analyst. JOB … to be able to determine between false and true positives events, prioritizing them appropriately and see them through from end to end. Additionally, the candidate will perform or review rootcauseanalysis efforts following incident recovery. The candidate will compose security alert notifications and other communications on behalf of the SOC QUALIFICATIONS Education, Background, and Years of … PowerShell scripts) • Familiarity with Splunk Enterprise Security Strong understanding of networking (TCP Flags, TCP Handshake, IP addressing, Firewalls, Proxy, IDS, IPS) • Ability to perform Netflow/packet capture (PCAP) analysis • Experience with cyber threat hunting WORKING CONDITIONS Environmental Conditions • Schedule: Monday - Friday. Currently hybrid, remote and onsite with the expectation to be onsite 2-3 days a week in More ❯
as STRIDE, PASTA, MITRE ATT&CK, and DREAD. Build and refine detection and response capabilities using logs, alerts, and behavioral signals. Lead or support incident response activities, including log analysis, querying, forensic investigation, threat mitigation, and rootcause analysis. Conduct internal security reviews, network scans, and targeted penetration tests of applications and infrastructure using common security tooling … architectures and modern frameworks (e.g., Django, Node.js , React). Expert-level scripting and automation skills (e.g., Python, Bash, PowerShell) for workflow automation, tooling, and log analysis. Proficient in log analysis, SIEM usage/configuration, threat hunting, and querying tools to support detection and response. Familiarity with static and dynamic analysis techniques and vulnerability mitigation. Strong understanding of modern … automation platforms. Prior experience driving security engineering for a SaaS-based company. Experience leveraging automation or AI/ML tools to improve secure development, detection, incident response, or code analysis workflows. Benefits: (US-ONLY) 100% of medical, dental, and vision covered including 75% for dependents Flexible vacation days and quarterly mental health days so you can recharge Enjoy a More ❯
Chantilly, Virginia, United States Hybrid / WFH Options
Peraton
complexity from architecture and processes Configure and use state-of-the-art monitoring tools to gather insights and then act upon the results Conduct incident response and in-depth rootcauseanalysis This position is hands-on, requiring the ability to provide first-level system and network support and problem resolution identification Responsible for the monitoring the … daily software and network operations in a distributed environment Responsible for monitoring, working with users on fault isolation and resolution, as well as system analysis and reporting This job will include shift work to allow for complete 24x7 monitoring of software systems. Will need to have flexibility to work multiple shifts (day, mid, swing), as needed. Job is on More ❯
Doncaster, Yorkshire, United Kingdom Hybrid / WFH Options
Hiya Technology Ltd
and Reporting: Define and track key quality metrics to measure the effectiveness of QA processes. Provide regular reports and updates on testing progress, coverage, and results to stakeholders. Conduct rootcauseanalysis on defects, working closely with development teams to resolve issues and prevent recurrence. Skills and Experience of a Lead QA Analyst; Experience in quality assurance More ❯
Central London, London, United Kingdom Hybrid / WFH Options
Halian Technology Limited
endpoints are properly configured and updated. 2nd Line Support: Respond to and resolve escalated 2nd line support tickets, ensuring timely resolution of technical issues. Provide expert-level troubleshooting and rootcauseanalysis for more complex issues. Work closely with end-users, understanding their requirements and delivering technical solutions. Escalate issues to senior engineers as needed while keeping More ❯
and solution evaluation, testing, integration, and security certification relevant to DNS, IPAM, Load Balancer, and Webproxy services. Support Tier-4 production troubleshooting of the services above. Develop pro-active analysis and solutions for the NMS and Operations teams. Leverage complex network analysis tools to identify and predict relevant services problems and aid in the post-mortem analysis … as well as using modern tools and methods such as Ansible, Python, Terraform, and/or Windows PowerShell. Troubleshoot, diagnose, and resolve complex system and services issues to include RootCauseAnalysis (RCA). System performance analysis and tuning to ensure efficient and optimally configured solutions. Leading engineering projects and teams through the Technical Processes of More ❯
capacity, and availability - and planning smart enhancements. Supporting compliance with SOx, audit and security standards such as ISO27001 and Cyber Essentials. Investigating and resolving incidents, supporting users, and ensuring rootcauseanalysis is actioned. Mentoring junior colleagues and shaping the multi-year IT strategy with your SAP expertise. What we're looking for: Proven experience with SAP More ❯
and dashboards to identify potential control failures as part of the control testing process. Ensure the accuracy and timely completion of control testing, providing peer review. Document findings, including rootcauseanalysis and applicable recommendations for remediation. Be the primary liaison with partners, delivering clear progress updates and results. Contribute lessons learned by integrating partner feedback to … and organizational requirements. Experience applying governance, risk, and control principles. Experience in automated and manual testing of security controls. Experience facilitating meetings and conveying complex ideas. Data collection, validation, analysis, and interpretation. Experience Researching and applying latest technologies. Experience with Agile methodology. Big 4 accounting experience. Hold a professional certification such as CISA, CISM, CISSP, PCI QSA, ISO More ❯
and dashboards to identify potential control failures as part of the control testing process. Ensure the accuracy and timely completion of control testing, providing peer review. Document findings, including rootcauseanalysis and applicable recommendations for remediation. Be the primary liaison with partners, delivering clear progress updates and results. Contribute lessons learned by integrating partner feedback to … and organizational requirements. Experience applying governance, risk, and control principles. Experience in automated and manual testing of security controls. Experience facilitating meetings and conveying complex ideas. Data collection, validation, analysis, and interpretation. Experience Researching and applying latest technologies. Experience with Agile methodology. Big 4 accounting experience. Hold a professional certification such as CISA, CISM, CISSP, PCI QSA, ISO More ❯
Newport, Gwent, Wales, United Kingdom Hybrid / WFH Options
Yolk Recruitment
leadership role where you'll take ownership of incident and problem management across a critical national infrastructure environment. You'll oversee the governance of best practice frameworks, ensuring timely rootcauseanalysis and preventative actions, while leading a collaborative team and influencing service delivery across a multi-vendor landscape. This is an opportunity to create tangible improvements More ❯
Newport, Wales, United Kingdom Hybrid / WFH Options
Yolk Recruitment
leadership role where you'll take ownership of incident and problem management across a critical national infrastructure environment. You'll oversee the governance of best practice frameworks, ensuring timely rootcauseanalysis and preventative actions, while leading a collaborative team and influencing service delivery across a multi-vendor landscape. This is an opportunity to create tangible improvements More ❯
Strong understanding of distributed systems, fault tolerant design, and high availability architectures. Knowledge of CI/CD pipelines and infrastructure as code tools (Terraform, Ansible, CloudFormation). Experience in rootcauseanalysis and implementing systemic improvements. Preferred: Significant experience with UX/UI writing or design Knowledge of regulatory standards and compliance (e.g., PCI DSS, HIPAA). More ❯
Storage (Block, Object, SQL, NOSQL) • Authentication, Authorisation, Identity Platforms • Information Security, Privacy and Regulatory Compliance • Performance Tuning, Hardening and Troubleshooting • Problem Solving Skills to Methodically Find Faults and perform RootCauseAnalysis • Able to evaluate multiple courses of action, achieving goals by non-standard means if necessary • System Regression • Protocol Analysis • Load Testing • Availability and Resilience More ❯