Incident Response Jobs in London

201 to 225 of 247 Incident Response Jobs in London

Business Systems Analyst - FTC 6 months

London, United Kingdom
Willis Towers Watson
in line with our Mid-Market technology roadmap. The Role Technology & Systems Management Oversee the ongoing maintenance and development of Mid-Market business applications and platforms. Lead fault resolution, incident response, and ensure timely ticket management. Ensure compliance with security policies and lead on risk remediation activities. Manage cloud development, architecture, and system integrations. Coordinate licensing, certificates, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineering Team Lead, SRE - Real-time Data London, GBR

London, United Kingdom
Bloomberg L.P
readiness and scalability through monitoring and forecasting. System Observability - Proactively detect issues, build alerting systems, and centralize health dashboards. Production Risk Management - Ensure safe software releases, drive infrastructure improvements. Incident Response - Lead or support fast, effective remediation during live incidents; build automation for common operational issues. What We're Looking For We're seeking a leader who can More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Technology Data Operations Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Frontier Economics
Compliance Partner with the Technology Partnership team to uphold security standards and comply with internal policies and regulations. Implement encryption, data masking, secure transmission, and robust access controls. Support incident response and integrate threat detection into data workflows. Contribute to developing and maintaining data governance frameworks. Stakeholder Engagement & Communication Work alongside the Technology Operations Manager to deliver on More ❯
Employment Type: Permanent, Work From Home
Posted:

Staff Software Engineer

London, United Kingdom
Molten Ventures plc
proactive refactoring and system improvements Drive and approve high-impact technical decisions with long-term maintainability and scalability in mind Monitor system performance and ensure strong observability, alerting, and incident response practices Contribute to architecture documentation and facilitate system knowledge sharing Partner with engineering and product leadership to influence long-term engineering strategy and technical roadmap About You More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Operations Manager - Third Party Supplier and Procurement

London, United Kingdom
Hybrid / WFH Options
ZAVA
to improve financial performance from external partnerships. Key Accountabilities Own provider relationships : Act as the operational lead for key suppliers. Performance & SLA management : Define and track SLAs, KPIs, and incident response processes to ensure consistently high performance. Issue resolution : Coordinate with internal teams (Product, Tech, Finance, Legal) to quickly resolve provider-related disruptions (e.g. payment failures, ID verification More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Head of Engineering - Automotive & Data

London, United Kingdom
Parkopedia
s data strategy , enabling the intelligent use of mobility, behavioural, and payment data to unlock new product and commercial opportunities. Ensure platform reliability, performance, and scalability through robust observability, incident response processes, performance testing, and fault-tolerant architecture. Partner with Security, Compliance, and Infrastructure teams to meet regulatory and certification standards (e.g., PCI DSS, TISAX, ISO 27001), and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Operations Manager

London, United Kingdom
VertoFX ltd
operations. Designing, documenting, and implementing new operational processes related to our activities, ensuring they are robust, compliant, and built for hyper-scalability while maintaining a positive customer experience. Leading incident responses for payment disruptions, driving them to a swift and effective close, and developing, implementing strategies to prevent recurrence. Developing, maintaining comprehensive documentation for all payment operational processes and … and translate strategic objectives into operational plans and execution. Experience in designing and implementing new operational processes, ideally within a fintech environment and with consistent changes. Solid understanding of incident management principles and best practices, specifically within a real-time payments context. Excellent data analysis skills with the ability to interpret complex payment data, draw meaningful conclusions, and make More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Solutions Architect

City of London, London, United Kingdom
Anson McCade
reviews to align platforms with AWS best practices. Collaborate across development teams to deliver reusable, automated migration tooling and infrastructure as code. Guide engineering teams through optimization, troubleshooting, and incident response. Promote adoption of DevOps methodologies and automated deployment pipelines. Create and maintain comprehensive documentation to support scalability and reuse. Skills & Experience Ideal candidates will bring experience in many More ❯
Posted:

Solutions Architect

London Area, United Kingdom
Anson McCade
reviews to align platforms with AWS best practices. Collaborate across development teams to deliver reusable, automated migration tooling and infrastructure as code. Guide engineering teams through optimization, troubleshooting, and incident response. Promote adoption of DevOps methodologies and automated deployment pipelines. Create and maintain comprehensive documentation to support scalability and reuse. Skills & Experience Ideal candidates will bring experience in many More ❯
Posted:

Site Reliability Engineer

London, United Kingdom
Grid Dynamics International, Inc
call responsibilities to address critical incidents and maintain system availability. Essential functions Provide support and ensure the stability of Data Platform solutions. Participate in an on-call rotation for incident response. Manage cloud resources using IaC tools like CloudFormation and Terraform on AWS and GCP. Implement data security best practices in cloud applications. Apply cloud networking knowledge (VPCs, Route More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Technical IAM Consultant

London, United Kingdom
Barclay Simpson
Stakeholders: Work with senior executives and business units to align IAM strategies with company objectives. IAM Transformation: Oversee the adoption of new IAM technologies and systems across the business. Incident Response: Lead IAM-related incident response strategies and ensure organizational resilience. Stay Current: Keep up with IAM trends and best practices to inform the company's More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Orgvue Limited
Enhance SRE practices across the organization Implement robust observability metrics, logs, and traces using our observability tools Guide the team in building automated, self-healing systems Own and evolve incident response processes, including on-call practices and post-mortem culture Mentor engineers on reliability, operational readiness, and scalable infrastructure best practices Drive Infrastructure as Code (IaC) initiatives using … using Terraform and knowledge of GitOps workflows Strong background in observability: metrics, visualization, logging, tracing Understanding of automation, CI/CD pipelines, deployment automation, and release strategies Experience with incident management, disaster recovery, root cause analysis, and post-incident reviews Additional Benefits: Hybrid working: 1+ days a week in London office Wellbeing initiatives: coaching, fitness sessions, webinars, Wellbeing More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Technology Resilience Manager

London, United Kingdom
Innovation Group
multi-faceted role supporting both a Technology Transformation Programme as well as helping to ensure current operational technology and applications are reliable and resilient. This role will suit an incident or IT disaster recovery manager, or someone with equivalent practical experience in technology operations, who is looking to broaden their skillset. After developing your specialist skills you are now … maintain risk identification frameworks. - Risk Assessment & Evaluation: Ensure compliance with governance policies, provide expertise on operational resilience, and support risk assessments for internal operations and third-party vendors. - Crisis & Incident Management: Lead the design and implementation of IT Disaster Recovery and Business Continuity plans, conduct simulations, and manage the Crisis and Major Incident Management Framework. - Risk Governance & Compliance … Management & Development: Promote awareness campaigns, research resilience strategies, and support team learning and development. Required skills & experience: - Experience in technology operations, ITSM including Service Asset and Configuration Management - Created incident response playbooks - Developed and tested recovery plans, identified and resolved gaps in resilience - Managed incidents and led responses to disruptions - Worked with external vendors and service providers to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Splunk Architect

London, United Kingdom
Hybrid / WFH Options
N Consulting Limited
optimize ETL pipelines and data workflows for seamless data integration. Manage Docker/Kubernetes environments for containerized deployment. Collaborate closely with Treasury teams-especially in Wallstreet FX environments. Lead incident response efforts and conduct post-mortem analysis to improve system resilience. What We're Looking For: Strong hands-on experience with Splunk architecture and observability tooling Expertise in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Software Engineer, AI Reliability Engineering

London, United Kingdom
Hybrid / WFH Options
Menlo Ventures
of millions of external customers and high-traffic internal workloads Develop and manage automated failover and recovery systems for model serving deployments across multiple regions and cloud providers Lead incident response for critical AI services, ensuring rapid recovery and systematic improvements from each incident Build and maintain cost optimization systems for large-scale AI infrastructure, focusing on … evaluate your non-AI-assisted communication skills. Please indicate 'Yes' if you have read and agree. Why Anthropic? Why do you want to work at Anthropic? (We value this response highly - great answers are often 200-400 words.) Will you now or will you in the future require employment visa sponsorship to work in the country in which the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

South West London, London, England, United Kingdom
Oscar Technology
support cloud-native infrastructure evolution Build and optimise CI/CD pipelines (GitHub Actions, Azure DevOps, Jenkins) Implement robust monitoring and alerting solutions (CloudWatch, Azure Monitor, Grafana, ELK) Own incident response processes, ensuring high availability and rapid resolution Collaborate with stakeholders to communicate solutions and technical trade-offs clearly Ideal Experience: 3-5 years SRE or DevOps experience More ❯
Employment Type: Contractor
Rate: £450 - £500 per day
Posted:

Security Coordinator

London, United Kingdom
Goldsmiths, University of London
include Implementing security protocols and procedures, managing access control systems, conducting and coordinating patrols and inspections to ensure compliance, leading and overseeing daily security operations, provide effective leadership during incident response, such as fires, evacuations, or medical incidents, ensuring emergency procedures are followed. About the Candidate The Ideal candidate will foster and maintain positive relationships with the team More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Systems Development Engineer - Ground Infrastructure, Project Kuiper

London, United Kingdom
Amazon
existing systems such as trouble ticketing, dashboards, and metrics tools and services - Measure and improve the performance and availability of the Kuiper Ground Infrastructure - Provide critical operations support and incident response for the service, while taking part in an on-call rotation - Design and implement scalable backend services and APIs - Front end development A day in the life More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Client Solutions Advisor - Privilege Access Management (PAM)

London, United Kingdom
Hybrid / WFH Options
Saviynt Inc
awareness training during onboarding and annually thereafter- Review (initially and annually thereafter), understand, and adhere to Information Security/Privacy Policies and Procedures such as (but not limited to): Incident Response Policy/Procedures Personnel Security Policy Saviynt is an equal opportunity employer and we welcome everyone to our team. All qualified applicants will receive consideration for employment More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Engineering Manager

London, United Kingdom
dunnhumby
of developer tooling to ensure positive adoption of services across the board Observability strategy, including metrics, logging, tracing and alerting of our K8 environments Champion automation-first approaches to incident response, service recovery, and environment provisioning Collaborate with product and platform teams to embed hosting strategy and practices across the organisation Partner with Product, Security, Platform and wider More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Business Development Manager

London, South East, England, United Kingdom
Hybrid / WFH Options
Profectus Recruitment
hiring for an experienced Business Development Manager for our market leading Cyber Security client. Our client specialises in offering Cyber Security Solutions, including but not limited to Pen Testing, Incident Response, Investigative Services and accreditations. If you have a passion for Cyber Security and a minimum of 2 years in Cyber Security sales then this could be the … pitches and proposals to target clients. Essentials: A minimum of 2 years exceeding targets in a Business Development role specifically within Cyber Security. Knowledge of Cyber Services, such as incident response, Pen Testing or Digital Forensics. A competitive and Goals driven mindset. Well versed in the use of CRM software and additional sales software tools. Minimum of More ❯
Employment Type: Full-Time
Salary: £30,000 - £35,000 per annum, OTE
Posted:

Site Reliability Engineer (SRE) - Front-end/React Specialist

London, United Kingdom
Hybrid / WFH Options
ZILO
join our SRE team. In this role, you'll ensure the reliability, performance, and operability of our React-based user interfaces running on AWS and Kubernetes. You'll lead incident response for client-side issues, diagnose end-to-end failures in the stack, and build tooling to automate detection and self-healing. Key Responsibilities Incident Response … or deployment failures. Analyze browser logs, application metrics (e.g., Real User Monitoring), and backend traces to isolate root causes across React , Node.js services, AWS , and Kubernetes layers. Orchestrate post-incident reviews: document findings, define mitigation plans, and drive tickets to resolution. Reliability Engineering & Automation Develop and maintain robust observability for front-end components: integrate Datadog for observability. Define SLIs … Collaboration & Knowledge Sharing Serve as the React/SRE subject-matter expert: mentor engineers on best practices for building resilient front-ends. Produce and maintain runbooks, debugging guides, and incident-playbooks specific to client-side failures. Partner closely with wider backend SRE, DevOps, and product teams to ensure end-to-end reliability. Enhanced leave - 38 days inclusive of More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Technical Services Manager

Roehampton, London, England, United Kingdom
Insight Executive Group Limited
financial performance against budget targets. Overseeing audits, reporting, and performance improvement initiatives. Supporting project works from planning through to delivery and closeout. Participating in out-of-hours support and incident response as needed. About You You’ll be an experienced FM professional with a strong technical background, ideally from within the healthcare or PFI sectors. Essential: Proven experience … and supplier contracts. Technically knowledgeable with the credibility to engage with engineers and suppliers. IT literate, ideally familiar with CAFM systems and ISO standards. Willingness to participate in emergency response and on-call rota. Desirable: HNC/ONC in Engineering, Building Services, or a related field. NEBOSH or relevant H&S qualification. Membership of a relevant professional body (e.g. More ❯
Employment Type: Full-Time
Salary: £70,000 - £80,000 per annum
Posted:

Head of Technical Operations Centre (TOC) - Media & Advertising

London, United Kingdom
Hamilton Barnes Associates Limited
and video services. Oversee live event execution, SLA compliance, service bookings, and customer support. Act as the senior point of escalation for complex incidents (Tier 3 support). Drive incident response, root cause analysis, and proactive monitoring/reporting. Develop and implement TOC strategy, staffing models, and documentation standards. Participate in systems architecture, new tech evaluation, and vendor … a TOC, NOC, or MCR environment. Strong understanding of live broadcast workflows, encoding, transmission, and routing. Deep knowledge of TCP/IP networking (switching, routing, multicast). Excellent leadership, incident management, and performance development skills. Strong documentation and process optimisation experience. High-pressure decision-making and problem-solving capabilities. Proficiency with Excel/Google Sheets; adaptable across Windows, MacOS More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Centre Day Technician - Slough

London, UK
Stott & May Professional Search Limited
Responsibilities: Operate and maintain all mechanical and electrical systems on site, including conducting HV switching (where authorised). Support or deputise for the Shift Leader when required, assisting with incident response and team coordination. Perform planned and reactive maintenance on a variety of critical infrastructure systems. Ensure compliance with method statements, risk assessments, and safe systems of work. … in a shift-based environment. Preferred Skills and Attributes: IOSH, NEBOSH, or similar health & safety training Leadership capability under pressure Familiarity with CAFM and digital PTW systems Experience with incident and change management processes This is a great opportunity to join a high-performing operations team within a world-class data centre. Ongoing training and progression opportunities are available More ❯
Posted:
Incident Response
London
10th Percentile
£53,649
25th Percentile
£62,375
Median
£70,000
75th Percentile
£87,500
90th Percentile
£100,000