Incident Management Jobs in London

126 to 150 of 195 Incident Management Jobs in London

Front Office Support Engineer

London, United Kingdom
Hybrid / WFH Options
MARGO
for investigating and resolving database issues Good scripting ability in Python or similar for automation/troubleshooting Understanding of networking fundamentals, firewalls, and latency tuning Familiarity with support processes, incident management, and change control Bonus: Exposure to market data platforms (e.g., Bloomberg, Tradeweb, ION) ️ Tech stack includes: OS: Linux, Windows Scripting & Automation: Python, Ansible, Bladelogic Databases: SQL, Oracle More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Service Support Analyst

London, United Kingdom
Hybrid / WFH Options
Tokio Marine Kiln group
the incidentequest and to confirm the successful resolution of them. Liaise directly with internal 3rd line support teams to ensure the resolution of all incidents in line with the Incident Management Policy. Provide excellent customer service through ownership of issues and deliver effective communication and resolution of issues in line with the agreed service levels. Send clear, concise More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer - Developer Experience

London, United Kingdom
Zopa Bank Limited
master Honeycomb Developer empathy & outstanding communication skills; thrive on coaching and cross team collaboration Track record of data driven decision making and continuous improvement Familiarity with agile methodologies and incident management best practices Growth mindset; curious about GenAI and its impact on developer productivity Nice to have experience: Experience building mobile or serverless (Lambda) CI/CD pipelines More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Backend Engineer

London, United Kingdom
Hybrid / WFH Options
Etsy
used by 1M+ active daily users Take ownership of product development, from feature discovery, to the breakdown of work, and its implementation End-to-end application support, including production incident management Embrace agile methodologies and user-centred thinking Engage in a culture of continuous improvement by attending events such as blameless post-mortems, architecture reviews, and engineering guild More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Tria
platforms. This is a hands-on leadership role - you won't just guide others, you'll be the go-to expert when systems are under pressure. You'll lead incident response, own root cause analysis, and solve performance issues like memory leaks, outages, and flaky services. You will take ownership of the site reliability and drive that as a … discipline. Your focus will include: Leading incident management, post-mortems, and blameless RCAs Building scalable, resilient microservices with the dev teams Uplifting observability Improving alerting, monitoring, and system-level metrics Driving better SLOs, SLIs, and overall uptime What you'll bring: Experience in high-traffic digital or eCommerce platforms 5+ years in SRE/DevOps roles; strong background … in incident response Observability, automation, and infrastructure as code expertise Leadership skills - mentoring others or leading from the front The stack includes Kubernetes, Terraform, AWS, Python, and modern CI/CD tools, and it's evolving. If you understand what a good SRE practice looks like, and want to leave systems in a better place than you found them More ❯
Employment Type: Permanent
Salary: £85000 - £100000/annum
Posted:

Operations Analyst

London Area, United Kingdom
Norton Blake
review process for ongoing service improvement Assess the most effective methods and technologies to approach tasks, with a focus on automating processes where possible Establish and implement a document management framework ensuring consistent production and updating of documents that support the incident management process APPLY NOW More ❯
Posted:

Operations Analyst

City of London, London, United Kingdom
Norton Blake
review process for ongoing service improvement Assess the most effective methods and technologies to approach tasks, with a focus on automating processes where possible Establish and implement a document management framework ensuring consistent production and updating of documents that support the incident management process APPLY NOW More ❯
Posted:

Business Continuity Analyst

City of London, London, England, United Kingdom
Hybrid / WFH Options
Next Employment
people, processes or facilities. THE ROLE · Assist in the development, execution and maintenance of business continuity documents including business impact analysis and office risk assessments. · Support business recovery, crisis management, and disaster-preparedness planning within the organization. · Disseminate information on business continuity processes, standards and initiatives. · Assist in coordinating business continuity exercises and testing of the mass notification system. … Provide support throughout live events that impact the business continuity plan. · Engage, deliver and track business continuity training to new members on local crisis management teams. IDEAL CANDIATE · The ideal candidate will have excellent verbal and written communication skills with the ability to coordinate multiple projects/assignments simultaneously and complete tasks accurately and on a timely basis. · Experience … notification system platforms (AlertFind, Send Word Now, Everbridge), business continuity software (Fusion, Cobalt, LDRPS, etc.) and Power BI. · Experience in a BCM or related role · Experience with crisis/incident management and IT DR · Good understanding of BCM standards · CBCP designation a plus · Excellent verbal, written and interpersonal skills · Proficient in Microsoft Office Suite, e.g. Excel, Word, PowerPoint More ❯
Employment Type: Full-Time
Salary: £45,000 - £50,000 per annum, Inc benefits
Posted:

Product Specialist, 2nd Line Support

London, United Kingdom
Hybrid / WFH Options
Board Intelligence Limited
leaders and 3,000 organisations across the world, with clients across the Fortune 500, FTSE 100, and OMX 30. In 2024 we received substantial backing from K1 Investment Management - the leading B2B Enterprise SaaS investors. We are at the beginning of significant growth, and we're looking for superb talent to join us on this journey. The team is … queries and exceptions to pre-empt future issues and suggest product or process improvements. Champion process improvements and documentation, identifying inefficiencies in support workflows and helping refine the overall incident handling lifecycle. We are looking for a motivated individual with a strong technical background and a passion for delivering exceptional customer service. Key technical and professional skills include: Ability … to investigate and resolve complex technical issues using tools like Datadog, Bugsnag, and JIRA, and interpret log data to identify root causes and trends. Familiarity with incident management workflows using tools like PagerDuty and Bugsnag, with the ability to prioritise, document, and escalate issues appropriately based on severity and impact. Knowledge of APIs, SSO, and web technologies to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

L2 Support Engineer

London, United Kingdom
Hybrid / WFH Options
N Consulting Limited
including querying, regex, alerts, and dashboards setup (no need for Splunk admin knowledge). Application log analysis skills and prior experience troubleshooting issues in production environments. ITIL, ticket, and incident management skills with relevant work experience. Proactive approach to production issues, including notifying the team about ongoing and potential future issues. Ensuring client SLAs are met by managing … deliverables for critical applications and understanding IT/business SLAs. Collaborating with Development and Level 3 support teams on incident triages, release/change reviews, and application stabilization enhancements. Handling major incidents, engaging relevant teams, creating post-mortems, and ensuring incident closure. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineer I, Workplace Technology London

London, United Kingdom
Hybrid / WFH Options
Checkout Ltd
data and security consistency across all People Tech platforms. As an Engineer, you will triage and respond to queries from across the business. Alongside this, collaborate with People Tech Management, Product and Principal Engineers and other workday functional areas on any data requirements focusing on Workday support with additional cross-functional support opportunities in other technologies. What you will … technologies that could be used to improve effectiveness, efficiency and user experience Support/provide input to develop the HR technology systems, service road map, change control processes and incident management frameworks Provide technical content to support the development of materials to effectively train end-users on how to use the HR system On-call Support as required … developer in implementing and managing Core HCM/HRIS (specifically Workday) platforms and other systems across the employee lifecycle - such as L&D, Talent Acquisition, Service Delivery, and compensation management platforms. 1+ years of proven experience in software development with a proven track record delivering high quality software Familiarity with compliance regulations and data privacy practices related to People More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

IT Risk & Controls Manager UK based

London, United Kingdom
Hybrid / WFH Options
Compre Group
cloud estate. Responsibilities • Collaborate with stakeholders to drive security initiatives and strategy • Implement a best practice IT Controls Framework • Act as the security SME across IT, overseeing security operations, incident management and threat detection • Ensure robust third-party security, including commercial agreements • Implement security policies and standards • Manage cybersecurity risks and response to incidents • Implement plans to meet … security certifications • Champion a security awareness culture through training and engagement initiatives • Work with auditors to demonstrate control compliance and for remediation activities Candidate requirements • Experience in IT Risk Management, Compliance, Internal Audit or External Audit roles - understanding IT security standards and frameworks • Previous work experience in a regulated Financial Services environment - ideally you will have knowledge of the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Head of Technical Operations Centre (TOC) - Media & Advertising

London, United Kingdom
Hamilton Barnes Associates Limited
and video services. Oversee live event execution, SLA compliance, service bookings, and customer support. Act as the senior point of escalation for complex incidents (Tier 3 support). Drive incident response, root cause analysis, and proactive monitoring/reporting. Develop and implement TOC strategy, staffing models, and documentation standards. Participate in systems architecture, new tech evaluation, and vendor selection. … a TOC, NOC, or MCR environment. Strong understanding of live broadcast workflows, encoding, transmission, and routing. Deep knowledge of TCP/IP networking (switching, routing, multicast). Excellent leadership, incident management, and performance development skills. Strong documentation and process optimisation experience. High-pressure decision-making and problem-solving capabilities. Proficiency with Excel/Google Sheets; adaptable across Windows More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Platform Engineer

London, United Kingdom
Hybrid / WFH Options
Rise Technical Recruitment Limited
Flink, Kafka, and Python. This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incident management, architecture, and performance. The Role: Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS Act as an escalation point for critical … of Apache Flink, Kafka, and Python in production environments Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.) Comfortable with monitoring tools, distributed systems debugging, and incident response Reference Number: BBBH259303 To apply for this role or for to be considered for further roles, please click "Apply Now" or contact Tommy Williams at Rise Technical Recruitment. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Platform Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Rise Technical Recruitment Limited
Flink, Kafka, and Python. This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incident management, architecture, and performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS *Act as an escalation point for critical … of Apache Flink, Kafka, and Python in production environments *Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.) *Comfortable with monitoring tools, distributed systems debugging, and incident response Reference Number: BBBH259303 To apply for this role or for to be considered for further roles, please click 'Apply Now' or contact Tommy Williams at Rise Technical Recruitment. More ❯
Employment Type: Permanent, Work From Home
Salary: £90,000
Posted:

Data Platform Engineer

London, South East, England, United Kingdom
Hybrid / WFH Options
Rise Technical Recruitment Limited
Apache Flink, Kafka, and Python.This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incident management, architecture, and performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS*Act as an escalation point for critical … of Apache Flink, Kafka, and Python in production environments*Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.)*Comfortable with monitoring tools, distributed systems debugging, and incident response Reference Number: BBBH259303 To apply for this role or for to be considered for further roles, please click "Apply Now" or contact Tommy Williams at Rise Technical Recruitment. More ❯
Employment Type: Full-Time
Salary: £80,000 - £90,000 per annum
Posted:

Software Engineer, Python

London, United Kingdom
Hybrid / WFH Options
Cedar Cares, Inc
and implement promising new technologies. Providing operational support for Cboe Europe's trading systems by participating in a production support rota, responding to incidents in line with Cboe's Incident Management and Response processes, and contributing to post-mortem analyses and follow-up actions. Participate in a global software development team The ideal candidate has: Solid Python knowledge … as Apache Kafka Preferred: Familiarity with data pipeline platforms such as Apache Airflow Preferred: Familiarity with Java Preferred: Experience in one or more relevant financial areas (market data, order management, algorithmic trading, financial systems integration, compliance, etc.) Benefits and Perks We value the total wellbeing of our people - including health, financial, personal and social wellness. We believe standard benefits More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Engineering Lead

London, United Kingdom
Bouygues Construction SA
data engineering team and mentoring technical talent Collaborating across functions to define data requirements, structure, and strategy Ensuring adherence to data governance, MDM standards, and nuclear safety protocols Owning incident management processes and technical documentation for live environments To succeed in this role, you'll bring a blend of strong technical expertise and people leadership. We're looking More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Salesforce Developer - Content Integrity

London, United Kingdom
Trustpilot, Inc
code reviews and ensuring solutions align with the broader Salesforce ecosystem strategy. Design and implement Salesforce best practices, focusing on scalable architecture, data integrity, security, and performance optimization. Support incident management and Content Integrity data processes directly within Salesforce through custom development and troubleshooting. Collaborate with stakeholders to gather complex requirements, translate them into technical specifications, and deliver More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer

London, United Kingdom
Hybrid / WFH Options
Our Future Health Limited
TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future with employer contributions of up to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer (Core Data Services)

London, United Kingdom
Hybrid / WFH Options
Our Future Health
TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future with employer contributions of up to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Dedicated TOC Network Engineer - Media & Advertising

London, United Kingdom
Hamilton Barnes Associates Limited
including check-in, configuration, and monitoring. Perform advanced monitoring and troubleshooting across IP networks (Arista, Net Insight), video encoders (Mediakind, Appear), and system infrastructure (Linux/Windows). Support incident management by investigating and resolving issues, producing detailed reports, and participating in post-incident reviews. Maintain and improve documentation, workflows, and infrastructure relating to the client-specific More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

HR Systems Analyst

City of Westminster, Greater London, United Kingdom
Hybrid / WFH Options
SmartSourcing plc
productivity and performance, ensuring the most effective and efficient use of the systems. They will lead on troubleshooting system problems and customer enquiries, following agreed procedures for system and incident management, including documenting actions and outcomes. They will develop reports using varied report-writing or data extraction tools. They will be responsible for the support of Key system … interfaces Required Experience, Skills and Knowledge: Substantial experience of development and support of iTrent HR applications in particular technical configuration and management of Payroll and Core HR modules. Advanced Excel skills and a proven ability to handle and analyse large quantities of data. Proven business analysis experience including process mapping, and requirements definition. Experience of leading on Database administration More ❯
Employment Type: Contract
Rate: £45000/annum
Posted:

Wallet Operations Team Lead

London, United Kingdom
Hybrid / WFH Options
Ledger Enterprise
Wallet Operations Lead will also ensure that both Ledger & our partners performance is in line with SLA's and expectations. This role requires a proactive approach to excellent stakeholder management & communication skills, the ability to build out robust, well documented processes and governance within a dynamic and fast paced environment. Your mission : Partners & Stakeholder management: Govern partner contracts … partners. Support advertising strategies to maximize revenue and enhance user value on Ledger Live. Performance Monitoring and Response: Continuously monitor system alerts and performance for incidents. Develop and refine incident monitoring and response processes. Create playbooks for 24/7 teams, ensuring their training and confidence in handling off-hour issues. Communication and Reporting: Ensure that lines of communications … write clear, process and governance documentation & playbooks Bonus Points Proficiency in Google Workplace suite (Docs, Sheets, Slides, Forms) Familiarity with monitoring tools and software. Experience in ITIL or other incident management frameworks What's in it for you? Equity : Employees are the foundation of our success, and we award stock options so you can share in that success More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

IT Services Manager IT & Security London Improbable London Improbable

London, United Kingdom
Improbable Worlds Limited
driven to uplift and challenge those around them. You will provide high quality central services to the Improbable group and our network of venture businesses through the leadership and management of the IT Support, Tech Ops, and Technical Security functions. Your role will be to ensure that all company services are deployed, managed, and supported at the highest levels … implementation, and maintenance of security infrastructure, systems, and applications. Maintain security controls aligned with recognised international standards, and conduct internal and external audits. Own our service catalogue, internal processes, incident management and security controls. Manage IT budgets and ensure cost-effective allocation of technology resources and overseeing procurement. Ensure our suppliers and vendors are appropriately assessed against our … as ISO 27001:22 (preferred), NIST CSF or SOC2), including internal and external auditing. Have a track record of motivating and organising technical teams. Understand Networks, Security, Firewall, Vulnerability Management, SIEM and EDR technologies. This role would benefit from exposure to the following: Jira, Confluence, Google Workspace, Google Cloud Platform, Azure, Slack, and ITIL framework. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Incident Management
London
10th Percentile
£50,000
25th Percentile
£62,000
Median
£67,500
75th Percentile
£96,250
90th Percentile
£120,000