look at all the evidence available and support the client on the appropriate action to contain and remediate any security incident. They will need to be able to provide rootcauseanalysis and liaise with the customer and the Service Delivery Manager as well and ensuring the actions of the SOC Analysts follow best practice. Security Monitoring … Monitoring SIEM tools to assure high a level of security operations delivery function Oversee and enhance security monitoring systems to detect and analyse potential security incidents. Conduct real-time analysis of security events and incident and escalate as necessary Support other teams on investigations into incidents, determining the rootcause and impact. Document findings and lessons learned … with the Technical Teams to ensure all new and changed services are monitored accordingly Documentation: Maintain accurate and up-to-date documentation of security procedures, incident response plans, and analysis reports. Create post-incident reports for management and stakeholders. Support the creation of monthly reporting packs as per contractual requirements. Create and document robust event and incident management processes More ❯
look at all the evidence available and support the client on the appropraite action to contain and remediate any security incident. They will need to be able to provide rootcauseanalysis and liaise with the custiomer and the Service Delivery Manager as well and ensuring the actions of the SOC Analysts follow best practice. Job Duties … Monitoring SIEM tools to assure high a level of security operations delivery function Oversee and enhance security monitoring systems to detect and analyse potential security incidents. Conduct real-time analysis of security events and incident and escalate as necessary Support other teams on investigations into incidents, determining the rootcause and impact. Document findings and lessons learned … with the Technical Teams to ensure all new and changed services are monitored accordingly Documentation: Maintain accurate and up-to-date documentation of security procedures, incident response plans, and analysis reports. Create post-incident reports for management and stakeholders. Support the creation of monthly reporting packs as per contractual requirements. Create and document robust event and incident management processes More ❯
access control (RBAC), and ensuring compliance with DoD standards. Assist in the automation of operational tasks using Infrastructure-as-Code tools like Terraform or Bicep. Participate in incident response, rootcauseanalysis, and post-incident reviews to improve system reliability. Provide helpdesk support by taking ownership of tickets in the Remedy ticketing solution, resolving issues, and managing More ❯
access control (RBAC), and ensuring compliance with DoD standards. Assist in the automation of operational tasks using Infrastructure-as-Code tools like Terraform or Bicep. Participate in incident response, rootcauseanalysis, and post-incident reviews to improve system reliability. Provide helpdesk support by taking ownership of tickets in the Remedy ticketing solution, resolving issues, and managing More ❯
application support strategies Key Responsibilities: Own Application Support Lifecycle: Ensure end-to-end support for critical business applications, meeting SLAs and availability targets. Incident & Problem Management: Lead resolution and rootcauseanalysis for all Retail application incidents, including major (P1/P2) issues. Escalation & Crisis Leadership: Act as the escalation point for major incidents and provide direction … containerization experience with Azure , Docker , and AKS . Familiarity with modern web technologies, including React , REST APIs , and SOAP architectures. Skilled in managing P1/P2 incidents , business impact analysis, rootcause investigations, and change coordination. Strong grasp of IT service management practices; ITIL v4 certification or equivalent preferred. Proactive Monitoring : Hands-on experience with tools like More ❯
to the overall success of the FX desk's technology platform. * Respond rapidly to production incidents using data-driven decision making to minimise downtime and financial impact while leading rootcauseanalysis and conducting blameless post-mortems.* Enhance application health monitoring by implementing robust observability solutions and automating manual processes to improve system resilience.* Drive cost optimisation More ❯
maintain systems according to approved design. Service Delivery & Operations: Lead key service management processes (Continuity, Capacity, Availability). Attend incident/problem bridges as the subject matter expert. Review rootcause analyses (RCAs) and oversee corrective actions. Provide accurate monthly service performance reports across IT and OT. Supplier & Financial Management: Lead and manage suppliers to meet agreed SLAs … change management experience. Ability to simplify complex network architecture for non-technical audiences. Desirable Technical Skills & Qualifications: Knowledge of network security technologies and strategic supplier management. Experience in stakeholder analysis and business case development. Familiarity with cloud integration (Azure and AWS). What's in it for you? Competitive salary up to £75,000 per annum, depending on experience More ❯
best practices, cloud strategies, and platform engineering. Team Leadership: Guide and coach, a team of engineers, technical specialists, and architects, encouraging the adoption of innovative technologies and practices. Technical Analysis:Lead technical analysis and estimation efforts for custom-built applications. Best Practices:Drive the adoption of release management and automation best practices. Incident Management:Ensure thorough rootcauseanalysis and prompt remediation during any incidents or outages. Vendor Coordination:Work with external vendors to supplement team capacity and expertise when necessary. YOU'RE GOOD AT You bring solid development and program leadership experience to drive technical governance, innovation, integrations, and cloud strategies using emerging technologies like Gen AI. You thrive in environments that demand More ❯
base articles. Monitor application health using tools and custom dashboards. Support integration and communication between cloud platforms (Azure, Entra ID, Microsoft 365). Contribute to service improvement initiatives, including rootcauseanalysis and automation opportunities. Participate in on-call rotations or after-hours incidents during peak retail periods. Work within established security frameworks and governance. Hybrid working More ❯
cloud and hybrid environments. Architect observability solutions (monitoring, logging, alerting) that detect and prevent failures before they impact users. Own and improve incident response workflows, including runbooks, communications, and rootcause analysis. Define and enforce SLIs, SLOs, and error budgets to balance innovation with operational stability. Mentor engineers and advise teams on best practices for scalability, security, deployment … efforts, reliability reviews, and cross-functional reliability programs. Core Responsibilities Operations Leadership Act as a senior escalation point for major incidents and production outages. Lead post-incident reviews, coordinate rootcauseanalysis, and drive remediation plans. Communicate platform health, risk, and improvement plans with technical and non-technical stakeholders. Design and build robust CI/CD workflows More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
BAE Systems (New)
hybrid and flexible working arrangements available. Please consult your recruiter for details. Grade: GG10 - GG11 Referral Bonus: £5,000 Job Description Serve as the point of escalation for intrusion analysis, forensics, and incident response queries. Provide rootcauseanalysis for complex, non-standard findings and anomalies without existing playbooks. Mentor team members and share knowledge proactively. … red team and pentest findings to improve detection rules. Provide forensic support and threat emulation to improve alert triage and accuracy. Identify gaps in SOC processes, data collection, and analysis, demonstrating the need for improvements through scenarios and red teaming. Perform complex threat hunting, automation, and analytic enrichment tasks. Set vision and milestones for emulation and detection capabilities, influencing More ❯
Modeling Develop and implement sophisticated statistical models and machine learning algorithms to forecast trends, predict outcomes, and identify opportunities for performance enhancement. Utilize advanced analytics techniques such as regression analysis, time series forecasting, and clustering to extract deeper insights from multifaceted datasets. Design and execute A/B tests to optimize strategies and validate hypotheses. Strategic Performance Analysis and Optimization Conduct in-depth analysis of KPIs, benchmarking against industry standards and historical performance. Perform multi-dimensional analysis to uncover hidden patterns and correlations in client data. Develop and maintain a comprehensive performance measurement framework, aligning metrics with client's strategic objectives. Lead rootcause analyses for complex performance issues, proposing data-driven solutions. More ❯
dedicated team, collaborating with software engineers, AI researchers, designers and medical professionals. Responsibilities · Implement requested integrations and infrastructures/environments. · Deploy updates and fixes through all supported environments. · Perform rootcauseanalysis for production incidents. · Develop scripts to automate CI/CD procedures. · Design procedures for system troubleshooting and maintenance. · Defining and setting development, test, release, update More ❯
Knowledge Management: Maintain up-to-date technical documentation, including API/interface catalogues, data flow diagrams, environment runbooks, and integration design patterns Incident and Service Request Administration: Assist in rootcauseanalysis for integration-related issues, serving as the primary point of contact for documenting, triaging, and coordinating the resolution of incidents and service requests. Change Coordination … a conduit between the development team and project teams to ensure consistent, transparent, and professional communication Education and Experience: Bachelor's degree in computer science, information-technology, engineering, system analysis or a related study, or equivalent experience A minimum of three years in a technology-related capacity with direct exposure to software development or IT project environments. At least More ❯
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including rootcauseanalysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including rootcauseanalysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
Experis - ManpowerGroup
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform rootcauseanalysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
Sheffield, South Yorkshire, United Kingdom Hybrid / WFH Options
Experis
and GCP , ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform rootcauseanalysis, and implement solutions for incidents and performance bottlenecks. Qualifications and experience The ideal candidate for this role will have the below experience and qualifications: Bachelor More ❯
and GCP, ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform rootcauseanalysis, and implement solutions for incidents and performance bottlenecks. Required Skills/Experience The ideal candidate will have the following: Bachelor's or Master's degree More ❯
and GCP, ensuring resilience, cost-efficiency, and data security. Collaborate closely with infrastructure, architecture, and cybersecurity teams to meet internal risk, compliance, and governance requirements. Support live systems, perform rootcauseanalysis, and implement solutions for incidents and performance bottlenecks. Required Skills/Experience The ideal candidate will have the following: Bachelor's or Master's degree More ❯
Reading, Berkshire, United Kingdom Hybrid / WFH Options
Pertemps
you'll be doing as a Senior Cyber Security Analyst: Security Incident Response : Investigate security alerts from SIEM and third-party MSSPs, triage and respond to incidents, and support rootcauseanalysis to drive remediation. Stakeholder Engagement : Work closely with technology and business teams to communicate cyber risks, recommend actions, and ensure controls are proportionate and effective. More ❯
cloud subject matter expert, providing AWS best practice guidance to internal teams and project stakeholders. Investigate and resolve AWS infrastructure-related incidents, ensuring minimal downtime and impact. Participate in rootcauseanalysis and implement preventative measures. Maintain clear, detailed documentation for AWS environments, architecture diagrams, SOPs, and runbooks. Continuously look for opportunities to improve cloud architecture, security More ❯
solutions on AWS, ensuring scalability, reliability, and security. Collaborate with cross-functional teams to understand requirements, develop solutions, and deliver high-quality software solutions. Troubleshoot and debug issues, perform rootcauseanalysis, and implement effective solutions. Write clean, efficient, and maintainable code in production following best practices and coding standards, such as Test Driven Development and implementing More ❯
IT Service Management (ITSM) processes across all teams, ensuring standardized, efficient, and effective service delivery. EstablishSRE-based operational metrics, includingSLOs, SLIs, and error budgets. Overseeincident response, problem resolution, and rootcauseanalysis with AI-driven remediation. Ensurehigh availability, performance, and security compliancefor all enterprise services. Develop afollow-the-sun operational support model, ensuring24x7 resilience and uptime across More ❯
expectations in partnership with a member of the Project Management team or acting as project Lead. Your Responsibilities: Support incident management for the support team, ensuring robust troubleshooting and rootcause analysis. Ability to support and resolve incidents effectively for the support team This role is joint Application Management Service (support) team and project implementation Collaborate with functional More ❯