Working closely with other leaders, you'll help set strategic direction and foster a collaborative environment that prioritises engineering excellence and business impact. Key responsibilities Team Leadership and People Management Lead, manage, mentor, and develop 3-5 teams of engineers, supporting both individual growth and team effectiveness Set clear objectives for individuals and teams, monitor performance, and promote accountability … efforts and contribute to hiring guidelines, technical assessments, and shared candidate pipelines Oversee the full operational lifecycle of your teams' products and services, ensuring reliability, robust monitoring, and effective incident response Strategic Direction and Delivery Collaborate with other leaders to define and deliver on long-term team and business goals Set direction and scope for technical initiatives and work … a strong focus on coaching and feedback. Strong track record of delivering technical initiatives that support strategic business outcomes Deep understanding of the software development lifecycle, operational excellence, and incidentmanagement practices Experience navigating competing priorities and keeping teams focused and aligned Expertise in managing end-to-end delivery processes and improving team workflows Effective communicator across technical More ❯
outage time for the customer. Deployment, configuration, and maintenance of power systems (IBM AIX & IBMi) according to best practices and standards Ensuring ITIL/Agile procedures are followed, e.g., IncidentManagement and Change Management processes. Collaboration with other team members or other teams to develop, improve automation strategies and deployment processes. Knowledge sharing, documentation Internal and external … expect from you: Candidates should have expert working knowledge of: IBMi Operating System Upgrades. PTF/Technology Refresh/Service Pack Application BRMS Backup and Restore. 3rd Party System Management and Job Scheduler Package experience (Robot, Revsoft, Halcyon). LPAR configuration. Exposure to High Availability Software – Management of Mimix/ICluster Software. Administration and implementation experience from 7.3 … to 7.5. CL coding capability. VIOS Server skills SEA and NPIV understanding. System Firmware Upgrades HMC Management and Upgrades Tape Library Management Experience of supporting global IT infrastructures in a technical role Discussing technical solutions with customers/suppliers Ability to troubleshoot, research and diagnose root cause for incidents or problems, taking ownership where necessary. Excellent communication skills More ❯
payments service users. Activities will include and not limited to a variety of areas of focus including the execution of Consumer Duty outcome monitoring, FOS reporting, assist in the management of APUK's operational resilience framework, business continuity plan, board of director decks, MIC meetings, and assisting the team in the preparation of required regulatory reports/filing data … simultaneously while also overseeing the execution of daily regulated 'BAU' activities. A Typical Day: Operational Oversight Coordinate and manage the execution of APUK's regulatory obligations, including Operational Resilience, IncidentManagement, and Consumer Duty Perform horizon scanning for forthcoming regulations, supervisory guidelines, or industry initiatives which impact APUK Analyze of company roadmap and identify items which merit additional … continuity test scenarios Take ownership of APUK's operational monitoring as it relates to risk, outsourcing, and regulatory compliance Prepare input & present at APUK governance and oversight forums (including Management Internal Controls and Board of Directors) Communications Oversee the processing of incoming regulatory communications, support escalations & legal correspondences Manage communications with key stakeholder groups (internally & externally) Participate in internal More ❯
Squarepoint is a global investment management firm that utilizes a diversified portfolio of systematic and quantitative strategies across financial markets that seeks to achieve high quality, uncorrelated returns for our clients. We have deep expertise in trading, technology and operations and attribute our success to rigorous scientific research. As a technology and data-driven firm, we design and build … for end to end transaction flow across the trading platform Quickly identify, analyze and correct alerts & issues and/or escalate to minimize trade flow impact and preempt outages Incidentmanagement: create and send reports following established process Intensive interaction with trading community, back office operations, brokers, exchanges, development and infrastructure teams Support scheduled changes: application deployments, network More ❯
Squarepoint is a global investment management firm that utilizes a diversified portfolio of systematic and quantitative strategies across financial markets that seeks to achieve high quality, uncorrelated returns for our clients. We have deep expertise in trading, technology and operations and attribute our success to rigorous scientific research. As a technology and data-driven firm, we design and build … for end to end transaction flow across the trading platform Quickly identify, analyze and correct alerts & issues and/or escalate to minimize trade flow impact and preempt outages Incidentmanagement: create and send reports following established process Intensive interaction with trading community, back office operations, brokers, exchanges, development and infrastructure teams Support scheduled changes: application deployments, network More ❯
assigned customers and interpret issues and potential business impact to provide contextual technical guidance to the support engineers to expedite issue resolution Utilize resources from Performance Engineering, Professional Services, IncidentManagement, and Support Engineering, while also engaging other specialized technical experts for tasks beyond your expertise Willingness to travel regionally to customer locations, deliver on-site solutions, and … as solutions engineering, technical architecture, or data architecture consulting Experience in one of the following industries: Retail/CG, Financial Services, Healthcare, Media & Advertising Hands-on experience in database management, data engineering, and data science Exposure to the partner ecosystem as it pertains to Snowflake solutions Skilled in resolving complex escalations with senior customer executives Excellent verbal, written, communication More ❯
calm in challenging situations. Computer Operations. Systems/Web Programming (PHP/SQL). Methodical approach. Good at building relationships. Responsibilities Role specific responsibilities Work closely with the Senior Management Team and wider teams to review current systems and business processes and then identify and deliver IT solutions that improve these from efficiency, cost, and end-user perspectives. Attend … Board meetings to present IT reports as and when required. Lead cross-functional project teams to deliver improved systems, processes, and procedures. Ensure effective day-to-day management of the in-house IT support services, ensuring KPI and SLA targets are met around service requests, incidentmanagement, equipment, and IT resource requests. Define and develop a strategic … Cyber Essentials; GDPR; Data security, minimizing data risk events. Exceeding recognized good practice and standards, where possible. Drive the performance of the IT and project teams through effective leadership, management, and development of the team. Chair the Technology Advisory Committee meetings, and participate in key working groups across the organization as needed. Build effective relationships with a wide range More ❯
conferences and meetups. Your Technical Expertise Experience in Operations: You have experience managing production systems, responding to incidents, and implementing best practices. You are familiar with monitoring, logging, and incidentmanagement and have hands-on experience with deployment, configuration, and troubleshooting in live production systems. Experience with Messaging Systems: You have experience with distributed systems that use some … and supporting applications deployed on major cloud platforms such as AWS, Azure, or GCP is highly desirable. Advanced Linux Administration Skills: Familiarity with Linux system administration tasks, including package management, service configuration, performance monitoring, and basic scripting, is a significant advantage. Containerization Technologies: Experience with containerization technologies such as Docker and Kubernetes is a plus. More ❯
day supportability and maintenance of our tools and platforms Collaborate with the team to troubleshoot and resolve issues, shadowing and learning from Mid and Senior-level Engineers Aligns to incident response processes, helping with root cause analysis and problem resolution during incidentmanagement sessions Take ownership and pride in the work delivered, ensure what is delivered is More ❯
technologies Pro-actively define, optimize, automate, publish and communicate deployment patterns for migration Support engineering teams in optimizing the migration path Escalation point for engineering teams in troubleshooting and incidentmanagement Driving the adoption of automation in the platform, using common methods with other platforms Documentation of deployed systems, scripts and other working practices to enable, re-use … AWS Certified Solutions Architect Associate or Professional, AWS Speciality domains (Security, Advanced Networking) A scaled agile framework certification, such as SAFe or Scrum@Scale Managing stakeholders, including users and management Mentoring junior engineers and nurturing their passion for engineering Security Clearance is required for this vacancy. If you are not currently Security Cleared, you will need to be eligible More ❯
NE1, Newcastle upon Tyne, Tyne & Wear, Ouseburn, United Kingdom Hybrid / WFH Options
MFK Recruitment
technical support to the business, across a large customer base with bespoke environments. Providing excellent customer service and being the first point of call for IT support queries Following incidentmanagement processes and ensuring customer tickets are responded to within SLA Previous experience working with call logging and ticketing systems, this role will involve working on tickets, it More ❯
Shefford, Bedfordshire, South East, United Kingdom
Intercity Technology Limited
and resolve Azure-related problems. Qualifications, Skills & Experience: Minimum 2 years experience in a busy service desk environment. Strong knowledge of Microsoft 365, Exchange Online, and Office Suite. Proven incidentmanagement and escalation handling skills. Experience with remote administration of workstations, servers, and network devices. Familiarity with Windows server and client operating systems. Solid understanding of network topology More ❯
service, enabling them to achieve the highest levels of continuity and performance from their Snowflake implementation. Ideally, you have worked in a 24x7 environment, handled technical case escalations and incidentmanagement, worked in technical support for an RDBMS, been on-call during weekends, and are familiar with database release management. AS A SENIOR CLOUD SUPPORT ENGINEER AT SNOWFLAKE More ❯
data products through cloud web applications and APIs. Define key performance indicators (KPIs) and implement monitoring systems for deployed products to ensure continuous operational performance. Define strategy to handle incident management. Engage with RC D&A Platform Lead and Platform Product Owner to scope, plan and implement accelerators. Stay updated with the latest advancements in MLOps. Apply relevant techniques … into projects. Educate D&A on technological advancements in this area. Documentation: Maintain comprehensive documentation for model training pipelines, deployment processes, and code. Partner with the Product Management squad model and provide advice on how inflight projects can utilise ML and AI to generate additional value. What are we looking for? 5-7 years of experience working in a More ❯
data products through cloud web applications and APIs. Define key performance indicators (KPIs) and implement monitoring systems for deployed products to ensure continuous operational performance. Define strategy to handle incident management. Engage with RC D&A Platform Lead and Platform Product Owner to scope, plan and implement accelerators. Stay updated with the latest advancements in MLOps. Apply relevant techniques … into projects. Educate D&A on technological advancements in this area. Documentation: Maintain comprehensive documentation for model training pipelines, deployment processes, and code. Partner with the Product Management squad model and provide advice on how inflight projects can utilise ML and AI to generate additional value. What are we looking for? 5-7 years of experience working in a More ❯
code (IaC) using tools like Terraform. Strong written and verbal communication skills. Previous roles involving Azure administration, Azure development, and/or DevOps processes. Preferred Skills and Experience Proactive incidentmanagement with Azure-based troubleshooting. Python/Go/R. Understanding of generative AI, LLMs, NLP, and machine learning frameworks (TensorFlow, PyTorch, Hugging Face, etc.). Hands on More ❯
technologies. Pro-actively define, optimize, automate, publish and communicate deployment patterns for migration. Support engineering teams in optimizing the migration path. Escalation point for engineering teams in troubleshooting and incident management. Driving the adoption of automation in the platform, using common methods with other platforms. Documentation of deployed systems, scripts and other working practices to enable re-use and More ❯
Red Snapper Recruitment are seeking a detail-oriented and experienced Cyber IncidentManagement (CIM) and Threat and Vulnerability Management (TVM) Governance Analyst to support a robust cybersecurity governance program. This role plays a key part in enhancing and maintaining the integrity of cybersecurity operations through effective data analysis, reporting, and cross-functional collaboration. The successful candidate will … Collaborate with internal teams and external partners to support governance-related functions. Assist with distributing governance reports across organizational leadership and forums. What You Bring: Deep understanding of cyber incident response, patch management, and vulnerability assessment in large-scale environments. Strong problem-solving, analytical, and organizational skills. Ability to build strong working relationships in a global, distributed team More ❯
Red Snapper Recruitment are seeking a detail-oriented and experienced Cyber IncidentManagement (CIM) and Threat and Vulnerability Management (TVM) Governance Analyst to support a robust cybersecurity governance program. This role plays a key part in enhancing and maintaining the integrity of cybersecurity operations through effective data analysis, reporting, and cross-functional collaboration. The successful candidate will … Collaborate with internal teams and external partners to support governance-related functions. Assist with distributing governance reports across organizational leadership and forums. What You Bring: Deep understanding of cyber incident response, patch management, and vulnerability assessment in large-scale environments. Strong problem-solving, analytical, and organizational skills. Ability to build strong working relationships in a global, distributed team More ❯
Role Title: Global Incident Manager Duration: 6 Months Location: Sheffield 3 days a week on site Umbrella only £ 535 Would you like to join a global leader in consulting, technology services and digital transformation? Our client is at the forefront of innovation to address the entire breadth of opportunities in the evolving world of cloud, digital and platforms. We … are seeking an experienced and proactive Global Incident Manager to join our team in a shift-based role. This position is critical to ensuring the stability and resilience of services within a fast-paced financial services environment. The successful candidate will lead the end-to-end incidentmanagement process, leveraging deep expertise to minimize service disruption and … uphold regulatory and operational standards. Key Responsibilities: IncidentManagement Lead the incidentmanagement process across global operations, ensuring all incidents are logged, tracked, and resolved promptly. Apply deep knowledge of incidentmanagement frameworks within the financial services sector to reduce downtime and maintain service continuity. Backlog Management Support the resolution of the existing More ❯
Sheffield, South Yorkshire, Yorkshire, United Kingdom
Experis
Role Title: Global Incident Manager Duration: 6 Months Location: Sheffield 3 days a week on site Umbrella only £ 650 - £700 Would you like to join a global leader in consulting, technology services and digital transformation? Our client is at the forefront of innovation to address the entire breadth of opportunities in the evolving world of cloud, digital and platforms. … We are seeking an experienced and proactive Global Incident Manager to join our team in a shift-based role. This position is critical to ensuring the stability and resilience of services within a fast-paced financial services environment. The successful candidate will lead the end-to-end incidentmanagement process, leveraging deep expertise to minimize service disruption … and uphold regulatory and operational standards. Key Responsibilities: IncidentManagement Lead the incidentmanagement process across global operations, ensuring all incidents are logged, tracked, and resolved promptly. Apply deep knowledge of incidentmanagement frameworks within the financial services sector to reduce downtime and maintain service continuity. Backlog Management Support the resolution of the More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Commify Group
passionate innovators. Our objective remains unwavering: to elevate business communication to new heights. With ambitious growth plans, we aim to expand our already impressive range of products. The Incident Manager plays a pivotal role in developing and maintaining robust incidentmanagement processes throughout our organisation. This position goes beyond merely implementing best practices; it encompasses taking the … across various platforms and regions, championing continuous improvement through in-depth Root Cause Analysis (RCA) and swift resolution of actions. The Role: Drawing on your previous experience in implementing incidentmanagement processes, you'll be responsible for shaping and enhancing our current framework, collaborating with stakeholders across the organisation to optimise our approach to live incident management. … delivery. Your mission is to empower the business to elevate its response, reaction, and resolution strategies for major incidents, leading to improvements in our software, customer experience, and communication. IncidentManagement Process Implementation: Create and implement a resilient incidentmanagement process to effectively tackle incidents across our various platforms. Incident Response, Resolution and Communication: Take More ❯
We are seeking an Incident Response Lead to lead on the University's cybersecurity incident response and operational resilience efforts. The postholder will be responsible for the development and adoption of a University wide standardised approach to Incident Response, advancing the University's capability to manage cyber incidents effectively, and thereby protecting our students, staff, research, and … contractual obligations. Based within the Information Security team, you will be the sole Incident Response Lead, providing direction and knowledge to navigate and effectively handle incidents. You will ensure effective incidentmanagement by overseeing the effective coordination and escalation across internal departments while engaging with external stakeholders, vendors, and UK authorities such as the National Cyber Security … Centre (NCSC) . This is an exciting opportunity for a proactive professional to shape and strengthen the University's approach to cyber incident management. Job Purpose The Incident Response Lead will lead on the University's/Information Security's response to operational resilience incidents (cyber) as well as the University' engagement with external stakeholders and vendor engagement. More ❯
Business Continuity Coordinator Job Function: Job Term: Job Region: Chorley TVS are recruiting an Information Security & Business Continuity (ISBC) Coordinator to develop and maintain an already established information security management system certified to ISO27001 and a business continuity management system certified to ISO22301 across several UK sites. The successful candidate will have a working knowledge of ISO standards … understand risk management and be able to communicate effectively at all levels. Job Responsibilities Support the maintenance, development and continual improvement of ISBC Management System Coordinate and assist in internal audits to maintain ISO 27001 and ISO 22301 compliance Track and follow up on corrective and preventive actions resulting from audits or incidents Maintain documentation, records, and registers … in accordance with ISO standards Assist in managing the risk assessment and treatment processes Monitor compliance with policies, procedures, and controls Support incidentmanagement and business continuity testing activities Organise and deliver awareness training and communication efforts related to compliance topics Contribute to and partake in external, regulatory and customer surveillance visits Help ensure that day-to-day More ❯
The Spotify Security team is looking to enhance our incident response capabilities with a hardworking and collaborative security engineer focused on incident management. If you thrive under pressure and enjoy working with partners across the company to improve our containment and response efforts, then apply now! Locations London Stockholm What You'll Do Drive continuous improvement of Spotify … s security incidentmanagement process, identifying areas for enhancement and implementing changes. Collaborate with compliance teams to ensure incident processes meet all regulatory requirements while remaining lean and adaptable. Utilize security technologies (e.g., SOAR, SIEM), communication platforms, and automation tools to accelerate response and train responders on their use. Develop automation and response capabilities to speed up … investigation and response, leveraging our defender's advantage. Coordinate scheduling for incident managers and responders to ensure coverage and readiness. Create and deliver training programs for incident responders to maintain high incident readiness. Participate in and lead responses to security incidents, ensuring swift action, process adherence, and documentation for improvement. Work closely with IT, infrastructure, legal, and More ❯