look at all the evidence available and support the client on the appropriate action to contain and remediate any security incident. They will need to be able to provide rootcauseanalysis and liaise with the customer and the Service Delivery Manager as well and ensuring the actions of the SOC Analysts follow best practice. Security Monitoring … Monitoring SIEM tools to assure high a level of security operations delivery function Oversee and enhance security monitoring systems to detect and analyse potential security incidents. Conduct real-time analysis of security events and incident and escalate as necessary Support other teams on investigations into incidents, determining the rootcause and impact. Document findings and lessons learned … with the Technical Teams to ensure all new and changed services are monitored accordingly Documentation: Maintain accurate and up-to-date documentation of security procedures, incident response plans, and analysis reports. Create post-incident reports for management and stakeholders. Support the creation of monthly reporting packs as per contractual requirements. Create and document robust event and incident management processes More ❯
look at all the evidence available and support the client on the appropraite action to contain and remediate any security incident. They will need to be able to provide rootcauseanalysis and liaise with the custiomer and the Service Delivery Manager as well and ensuring the actions of the SOC Analysts follow best practice. Job Duties … Monitoring SIEM tools to assure high a level of security operations delivery function Oversee and enhance security monitoring systems to detect and analyse potential security incidents. Conduct real-time analysis of security events and incident and escalate as necessary Support other teams on investigations into incidents, determining the rootcause and impact. Document findings and lessons learned … with the Technical Teams to ensure all new and changed services are monitored accordingly Documentation: Maintain accurate and up-to-date documentation of security procedures, incident response plans, and analysis reports. Create post-incident reports for management and stakeholders. Support the creation of monthly reporting packs as per contractual requirements. Create and document robust event and incident management processes More ❯
access control (RBAC), and ensuring compliance with DoD standards. Assist in the automation of operational tasks using Infrastructure-as-Code tools like Terraform or Bicep. Participate in incident response, rootcauseanalysis, and post-incident reviews to improve system reliability. Provide helpdesk support by taking ownership of tickets in the Remedy ticketing solution, resolving issues, and managing More ❯
Dashboards. Contribute to process and technical capabilities (e.g., Data Modeling, Data Visualizations, Artificial Intelligence (AI), Machine Learning (ML to enhance identification of service improvement opportunities. Complex data mining, trend analysis, metric and report production will be required. Identify and review service improvement opportunities with stakeholders based on TR enterprise-wide performance metrics. Proactive collaboration with stakeholders to create and … improvement initiatives. Be responsive to internal stakeholder needs and engage with stakeholders across multiple functions. Typical daily work may include but is not limited to complex data mining, trend analysis, metric and report production, process flow charting, and iterative service improvement activities (e.g. daily standups, data quality checks, change reviews, tool enhancement design and review). Contribute to proactive … enhance service reliability and availability. Support for Service Management activities to ensure a consistent standard of incident, problem, change and other practice areas for enhanced accuracy of data quality, rootcauseanalysis and identification of preventative measures. Support the recurring service performance reporting cycle (e.g., weekly, monthly, quarterly). About You: Experience in enterprise problem management, application More ❯
maintain systems according to approved design. Service Delivery & Operations: Lead key service management processes (Continuity, Capacity, Availability). Attend incident/problem bridges as the subject matter expert. Review rootcause analyses (RCAs) and oversee corrective actions. Provide accurate monthly service performance reports across IT and OT. Supplier & Financial Management: Lead and manage suppliers to meet agreed SLAs … change management experience. Ability to simplify complex network architecture for non-technical audiences. Desirable Technical Skills & Qualifications: Knowledge of network security technologies and strategic supplier management. Experience in stakeholder analysis and business case development. Familiarity with cloud integration (Azure and AWS). What's in it for you? Competitive salary up to £75,000 per annum, depending on experience More ❯
application support strategies Key Responsibilities: Own Application Support Lifecycle: Ensure end-to-end support for critical business applications, meeting SLAs and availability targets. Incident & Problem Management: Lead resolution and rootcauseanalysis for all Retail application incidents, including major (P1/P2) issues. Escalation & Crisis Leadership: Act as the escalation point for major incidents and provide direction … containerization experience with Azure , Docker , and AKS . Familiarity with modern web technologies, including React , REST APIs , and SOAP architectures. Skilled in managing P1/P2 incidents , business impact analysis, rootcause investigations, and change coordination. Strong grasp of IT service management practices; ITIL v4 certification or equivalent preferred. Proactive Monitoring : Hands-on experience with tools like More ❯
within internal andexternal service operations. Requirements Key Responsibilities Incident and ServiceManagement Act as the escalation point for complex incidentsand service requests, ensuring timely resolution in accordance with agreedSLAs. Perform root-causeanalysis and drive resolution ofrecurring problems. Monitor service delivery performance, proactivelyidentifying potential disruptions and coordinating corrective actions. Technical Support and Analysis Provide advanced technical support … Incident, Problem, Change, and Service Level Management. Demonstrable experience using ITSM tools (e.g.,ServiceNow, Zendesk, Jira Service Management, Freshdesk). Excellent analytical and problem-solving skills,with experience conducting root-cause analyses and recommending effectivesolutions. Strong technical background with proficiency incommon enterprise technologies (Microsoft 365, Azure/AWS, networktroubleshooting, databases, application support). Outstanding communication and interpersonal skills More ❯
systems to detect data anomalies, system failures, and performance issues and leverage advanced scripting and orchestration tools (e.g., Python, Bash, Apache Airflow) to automate workflows and reduce operational overhead. RootCauseAnalysis & Incident Management: Lead post-incident reviews, perform rootcauseanalysis for data disruptions, and implement corrective actions, while creating detailed reports and More ❯
systems to detect data anomalies, system failures, and performance issues and leverage advanced scripting and orchestration tools (e.g., Python, Bash, Apache Airflow) to automate workflows and reduce operational overhead. RootCauseAnalysis & Incident Management: Lead post-incident reviews, perform rootcauseanalysis for data disruptions, and implement corrective actions, while creating detailed reports and More ❯
availability Develop common framework components (to be leveraged by enterprise applications), define standards for configuration, monitoring, reliability, and performance engineering Work with Technology teams to resolve major incidents Conduct rootcauseanalysis (RCA) for incidents and implement preventive measures. Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets. Continuously improve automated remediation … Managers (GTMs), Local Traffic Managers (LTMs) Hands on experience on configuring Splunk, Grafana dashboards, Kibana, Elasta alerts etc. Working experience on network rules creation, load balancer configurations, network packet analysis Analytical knowledge and exposure on rootcause identification using analyzer tools like IBM support assistant, Splunk etc. Good understanding of Linux OS internals, performance tools, Core commands More ❯
Knowledge Management: Maintain up-to-date technical documentation, including API/interface catalogues, data flow diagrams, environment runbooks, and integration design patterns Incident and Service Request Administration: Assist in rootcauseanalysis for integration-related issues, serving as the primary point of contact for documenting, triaging, and coordinating the resolution of incidents and service requests. Change Coordination … a conduit between the development team and project teams to ensure consistent, transparent, and professional communication Education and Experience: Bachelor's degree in computer science, information-technology, engineering, system analysis or a related study, or equivalent experience A minimum of three years in a technology-related capacity with direct exposure to software development or IT project environments. At least More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
Us3 Consulting
internal and/or 3rd party support teams Ensure resolution of incidents according to agreed SLA's Apply problem solving skills to recreate, debug, identify and resolve issues Perform rootcauseanalysis of issues to prevent reoccurrence Form part of the on-call rota for out of hours critical incidents Provide proactive support & maintenance across the application More ❯
best practices, cloud strategies, and platform engineering. Team Leadership: Guide and coach, a team of engineers, technical specialists, and architects, encouraging the adoption of innovative technologies and practices. Technical Analysis:Lead technical analysis and estimation efforts for custom-built applications. Best Practices:Drive the adoption of release management and automation best practices. Incident Management:Ensure thorough rootcauseanalysis and prompt remediation during any incidents or outages. Vendor Coordination:Work with external vendors to supplement team capacity and expertise when necessary. YOU'RE GOOD AT You bring solid development and program leadership experience to drive technical governance, innovation, integrations, and cloud strategies using emerging technologies like Gen AI. You thrive in environments that demand More ❯
Proactively identify areas for improvement and implement preventive measures. Service Improvement: Continuously assess the IT service delivery process and implement improvements that enhance efficiency, effectiveness, and customer satisfaction. Lead rootcauseanalysis for service delivery issues and define corrective actions. Change Management: Ensure that changes to the IT environment are implemented smoothly with minimal disruption to service. More ❯
1st and 2nd line support teams, ensuring timely and effective resolution of complex issues interfacing with 3rd line across the wider business. Lead incident and problem management processes, including root-causeanalysis for recurring incidents, working closely with internal teams and external vendors. Own and manage the organisation’s IT Service Management (ITSM) platform (HALO), including administration … standards. Proven track record in managing SLA compliance and delivering results in a fast-paced environment. Exceptional organisational and multitasking abilities. Analytical mindset with the ability to identify trends, root causes, and implement solutions. Desirable (Not Essential): CompTIA Network+ or equivalent — to support effective collaboration with network teams. Foundation-level cloud certification (e.g., Microsoft Azure Fundamentals, AWS Cloud Practitioner More ❯
standards and procedures, and suggesting ways to improve those processes over time Delivering resolutions to live incidents within SLA, prioritising service availability Taking the lead in problem investigation/rootcauseanalysis and delivering resolutions that prevent recurrence and minimise technical debt Providing out of hours support where necessary - shifts managed via on-call rota Interpreting the … TDD, ensuring appropriate test coverage and evidencing the outcomes of testing Service onboarding and transition Taking part in proactive knowledge transfer activities with incumbent suppliers Code review and quality analysis including the review of complete services, including the implementation of code scanning tooling Reviewing and improving technical documentation such as architecture overviews, deployment process definition and incident resolution runbooks More ❯
base articles. Monitor application health using tools and custom dashboards. Support integration and communication between cloud platforms (Azure, Entra ID, Microsoft 365). Contribute to service improvement initiatives, including rootcauseanalysis and automation opportunities. Participate in on-call rotations or after-hours incidents during peak retail periods. Work within established security frameworks and governance. Hybrid working More ❯
business - Build datasets, metrics, and KPIs supporting business - Design and develop highly available dashboards and metrics using SQL and Excel/Quicksight or other BI reporting tools - Perform business analysis and data queries using scripting languages like R, Python etc - Design, implement and support end-to-end analytical solutions that are highly available, reliable, secure, and scale economically - Collaborate … cross-functionally to recognize and help adopt best practices in reporting and analysis, data integrity, test design, analysis, validation, and documentation - Proactively identify problems and opportunities and perform rootcauseanalysis/diagnosis leading to significant business impact - Work closely with internal stakeholders such as Operations, Program Managers, Workforce, Capacity planning, machine learning, finance teams … Excel - 5+ years using data visualization tools like Tableau, Quicksight or similar tools - Experience with R, Python or other statistical/machine learning tools - Experience demonstrating problem solving and rootcauseanalysis - Experience using databases with a large-scale data set - Bachelor's degree in engineering, analytics, mathematics, statistics or a related technical or quantitative field - Detail More ❯
hybrid and flexible working arrangements available. Please consult your recruiter for details. Grade: GG10 - GG11 Referral Bonus: £5,000 Job Description Serve as the point of escalation for intrusion analysis, forensics, and incident response queries. Provide rootcauseanalysis for complex, non-standard findings and anomalies without existing playbooks. Mentor team members and share knowledge proactively. … red team and pentest findings to improve detection rules. Provide forensic support and threat emulation to improve alert triage and accuracy. Identify gaps in SOC processes, data collection, and analysis, demonstrating the need for improvements through scenarios and red teaming. Perform complex threat hunting, automation, and analytic enrichment tasks. Set vision and milestones for emulation and detection capabilities, influencing More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Infinigate Group
configuring, updating, and monitoring security tools and software, such as antivirus, encryption, authentication, SIEM etc. Evaluate, research and manage emerging cyber security threats. Support the incident management process, through RootCause Analysis. Responding to and resolving security incidents and events, such as malware infections, phishing attempts, denial-of-service attacks, data breaches, etc. Liaise with stakeholders in relation … Exposure to security monitoring technologies Understanding of Incident Response, Cyber Kill Chain, ATT&CK · Knowledge & experience of common program language e.g., Python, C++, PowerShell, JavaScript Being able to perform RootCauseAnalysis Experience with vulnerability assessments Ability to discover, design and document security implementations. Strong networking skills. Good understanding of securing Cloud technologies through native and multi More ❯
configuring, updating, and monitoring security tools and software, such as antivirus, encryption, authentication, SIEM etc. Evaluate, research and manage emerging cyber security threats. Support the incident management process, through RootCause Analysis. Responding to and resolving security incidents and events, such as malware infections, phishing attempts, denial-of-service attacks, data breaches, etc. Liaise with stakeholders in relation … Exposure to security monitoring technologies Understanding of Incident Response, Cyber Kill Chain, ATT&CK · Knowledge & experience of common program language e.g., Python, C++, PowerShell, JavaScript Being able to perform RootCauseAnalysis Experience with vulnerability assessments Ability to discover, design and document security implementations. Strong networking skills. Good understanding of securing Cloud technologies through native and multi More ❯
Modeling Develop and implement sophisticated statistical models and machine learning algorithms to forecast trends, predict outcomes, and identify opportunities for performance enhancement. Utilize advanced analytics techniques such as regression analysis, time series forecasting, and clustering to extract deeper insights from multifaceted datasets. Design and execute A/B tests to optimize strategies and validate hypotheses. Strategic Performance Analysis and Optimization Conduct in-depth analysis of KPIs, benchmarking against industry standards and historical performance. Perform multi-dimensional analysis to uncover hidden patterns and correlations in client data. Develop and maintain a comprehensive performance measurement framework, aligning metrics with client's strategic objectives. Lead rootcause analyses for complex performance issues, proposing data-driven solutions. More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
BAE Systems Applied Intelligence
range of hybrid and flexible working arrangements - please speak to your recruiter about the options for this particular role. Grade: GG10 - GG11 Job Description Point of escalation for intrusion analysis, forensics and Incident Response queries. Able to provide rootcauseanalysis of complex, non-standard analytic findings and anomaly-based detections for which a playbook does … quality review of analyst activities Provide strategy and goals of operational team exercises with full autonomy as detection needs require. Influence the formation of team requirements inclusive of engineering, analysis and continuous improvement strategy. Devise technical interview questions, conduct technical interviews and evaluate candidate responses. Experience: Demonstrable experience of security testing practises and techniques Knowledge of Azure, desirable to … best and brightest minds - can work together to achieve excellence and realise individual and organisational potential. Job Title: Threat Hunter - National Security - Leeds Job City: Gloucester Professional Area: Business Analysis and Consultancy More ❯
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including rootcauseanalysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including rootcauseanalysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯