is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. To be successful in this role as an Infrastructure Engineer - Production Network Engineering, you should possess the following skillsets: Extensive experience as an individual contributor in the design More ❯
derbyshire, midlands, united kingdom Hybrid / WFH Options
The Aspire Educational Trust
management and support of the ICT systems and services within the Trust's schools. · To manage and maintain the Trust's cyber security policies and procedures and manage the incident responses across the Trust and schools within the Trust. · To be an active member of the Central Team, to develop technological strategy and policy. · To advise on and support More ❯
staffordshire, midlands, united kingdom Hybrid / WFH Options
The Aspire Educational Trust
management and support of the ICT systems and services within the Trust's schools. · To manage and maintain the Trust's cyber security policies and procedures and manage the incident responses across the Trust and schools within the Trust. · To be an active member of the Central Team, to develop technological strategy and policy. · To advise on and support More ❯
wilmslow, north west england, united kingdom Hybrid / WFH Options
The Aspire Educational Trust
management and support of the ICT systems and services within the Trust's schools. · To manage and maintain the Trust's cyber security policies and procedures and manage the incident responses across the Trust and schools within the Trust. · To be an active member of the Central Team, to develop technological strategy and policy. · To advise on and support More ❯
manchester, north west england, united kingdom Hybrid / WFH Options
The Aspire Educational Trust
management and support of the ICT systems and services within the Trust's schools. · To manage and maintain the Trust's cyber security policies and procedures and manage the incident responses across the Trust and schools within the Trust. · To be an active member of the Central Team, to develop technological strategy and policy. · To advise on and support More ❯
As the IT Operations Centre Team Leader, you'll be at the core of digital operations, leading a skilled team of analysts responsible for system monitoring, availability, and rapid incidentresponse across one of the UK's largest and most complex university infrastructures. Within your role you will: Lead, coach and inspire your team to deliver reliable, responsive … IT monitoring and support. Embed ITIL best practices and ensure standards are met for incident, problem and change management. Act as a calm, confident escalation point during critical events, ensuring clear communication and quick resolution. Collaborate with experts and partners to optimise monitoring tools, drive automation, and improve service resilience. Champion continuous improvement, building a culture that values learning … experience leading IT operations or service monitoring teams (preferably in a 24/7 or mission-critical environment). Strong understanding of ITIL frameworks and operational processes such as incident, change and problem management. Hands-on experience with monitoring tools (e.g. SolarWinds, Zabbix, Nagios) Familiarity with CMDB management and configuration best practices. As a leader, you'll balance accountability More ❯
As the IT Operations Centre Team Leader, you'll be at the core of digital operations, leading a skilled team of analysts responsible for system monitoring, availability, and rapid incidentresponse across one of the UK's largest and most complex university infrastructures. Within your role you will: Lead, coach and inspire your team to deliver reliable, responsive … IT monitoring and support. Embed ITIL best practices and ensure standards are met for incident, problem and change management. Act as a calm, confident escalation point during critical events, ensuring clear communication and quick resolution. Collaborate with experts and partners to optimise monitoring tools, drive automation, and improve service resilience. Champion continuous improvement, building a culture that values learning … experience leading IT operations or service monitoring teams (preferably in a 24/7 or mission-critical environment). Strong understanding of ITIL frameworks and operational processes such as incident, change and problem management. Hands-on experience with monitoring tools (e.g. SolarWinds, Zabbix, Nagios) Familiarity with CMDB management and configuration best practices. As a leader, you'll balance accountability More ❯
St. Albans, Hertfordshire, England, United Kingdom
Method Resourcing
you'll do Lead the design, build, deployment, and operation of critical software systems. Architect and deliver the shift to an event-driven microservices environment. Improve automation, monitoring, and incidentresponse capability. Partner with Product and stakeholders to define and execute the roadmap. Mentor and develop engineers, driving a culture of quality and accountability. What you'll bring More ❯
St. Albans, Hertfordshire, South East, United Kingdom
Method-Resourcing
you'll do Lead the design, build, deployment, and operation of critical software systems. Architect and deliver the shift to an event-driven microservices environment. Improve automation, monitoring, and incidentresponse capability. Partner with Product and stakeholders to define and execute the roadmap. Mentor and develop engineers, driving a culture of quality and accountability. What you'll bring More ❯
driving automation and supporting the development teams with robust CI/CD infrastructure in a hands-on leadership role. KEY RESPONSIBILITIES - Oversee day-to-day cloud operations, including monitoring, incidentresponse and trouble shooting. - Leading and managing short and long term project planning. - Developing and implementing cloud governance, security and compliance. - Leading automation and IaC improvements. - Providing mentorship More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
Big Red Recruitment
driving automation and supporting the development teams with robust CI/CD infrastructure in a hands-on leadership role. KEY RESPONSIBILITIES - Oversee day-to-day cloud operations, including monitoring, incidentresponse and trouble shooting.- Leading and managing short and long term project planning.- Developing and implementing cloud governance, security and compliance.- Leading automation and IaC improvements.- Providing mentorship More ❯
Manage and monitor the performance of internal tooling and fraud rules. Work cross-functionally across Operations, Engineering, Product, and Finance to mitigate areas of risk. Represent Trust & Safety during incidentresponse and create mitigation processes. About You: 3+ years of experience in fraud or chargeback operations, investigations, or a related Trust & Safety vertical. Experience with user restriction systems More ❯
Milton Keynes, Buckinghamshire, United Kingdom Hybrid / WFH Options
Rightmove PLC
metrics (CSAT, quality, speed, backlog health) to drive improvements. Analyse service data to identify trends, risks, and opportunities. Oversee resource planning and workload forecasting to maintain smooth operations. Manage incidentresponse standards and escalation processes, reducing friction across CX teams. Leading Teams Lead, coach, and support Team Leaders to build confident, high-performing teams. Take accountability for team More ❯
and telemetry. Security, Compliance & Governance: Ensure compliance with corporate security and privacy policies through proactive monitoring and enforcement. Partner with cyber and ISRM teams to integrate secure access into incidentresponse, threat detection, and policy governance workflows. Support enforcement of access policies for third-party, BYOD, and contractor devices using secure, scalable methods. Collaboration & User Experience: Work closely More ❯
teams to improve asset pipeline tooling and runtime systems. Work on system profiling, debugging, and performance tuning. Support CI/CD efforts including build capacity management, automated testing, and incident response. Act as a generalist engineer with the ability to operate across multiple technologies and domains. Requirements: Strong software development experience in C# and Unity . Experience with C++ More ❯
Newcastle Upon Tyne, Tyne and Wear, North East, United Kingdom Hybrid / WFH Options
The Bridge (IT Recruitment) Limited
SNOPs Lead to adapt the SNOPs roadmap priorities in line with shifts in industry, evolving threat landscape and regulatory requirements. Ensure effective 24/7 security operations (inc. security incident management) Collaborate closely with the Enterprise Resilience function (1st Line of Defence) to ensure integrated risk management and incident response. Promote stakeholder engagement and cross-functional collaboration to … a culture of security awareness and ownership across the organisation. Operational Oversight Ensure high availability, performance, and security of all technology systems and infrastructure. Monitor and improve service levels, incident resolution times, and system reliability metrics. Lead cross-functional coordination for escalations, major incidents, and service continuity planning. Team Leadership & Development Provide leadership and direction to platform tower leads … a complex, global environment. Deep understanding of IT infrastructure, cloud platforms (e.g., Azure), and enterprise collaboration tools (e.g., Microsoft 365). Strong grasp of ITIL-based service management, including incident, change, and problem management. Expertise in security and compliance frameworks, including DORA and Cyber Essentials Plus. Prior hands-on experience in delivering security solutions within enterprise environments Knowledge of More ❯
Agreements (SLAs) and safety standards. Key Responsibilities: Perform maintenance, repairs, and system switching with minimal disruption. Monitor infrastructure via site inspections and the Building Management System (BMS). Support incidentresponse and execute Emergency Operating Procedures when required. Ensure site safety, compliance, and operational excellence. Requirements: Data Centre Experience is preferred but no essential Electrical or Mechanical trained More ❯
Agreements (SLAs) and safety standards. Key Responsibilities: Perform maintenance, repairs, and system switching with minimal disruption. Monitor infrastructure via site inspections and the Building Management System (BMS). Support incidentresponse and execute Emergency Operating Procedures when required. Ensure site safety, compliance, and operational excellence. Requirements: Data Centre Experience is preferred but no essential Electrical or Mechanical trained More ❯
ownership of their customer support operations. This is a fantastic opportunity for a hands-on, process-driven leader. Key Responsibilities: Oversee customer support operations and shift coverage Manage SLAs, incidentresponse, and escalations Maintain separate support flows for two brands Plan and resource staffing models and schedules Implement automation and AI to drive ticket deflection Own the knowledge More ❯
Focus Manage Nvidia GPU clusters and related infrastructure Implement failover, resilience, and resource optimization strategies Oversee capacity planning and workload scheduling Monitor performance using Nvidia and HPE tools Manage incidentresponse, node failures, and access/security controls Required Skills & Experience Strong understanding of L1/L2 processes and troubleshooting workflows Experience with cloud, APIs, and distributed systems More ❯
Focus Manage Nvidia GPU clusters and related infrastructure Implement failover, resilience, and resource optimization strategies Oversee capacity planning and workload scheduling Monitor performance using Nvidia and HPE tools Manage incidentresponse, node failures, and access/security controls Required Skills & Experience Strong understanding of L1/L2 processes and troubleshooting workflows Experience with cloud, APIs, and distributed systems More ❯
monitoring the platform security and integrate security tools into the S-SDLC. Work with the local DevSecOps team to improve our S-SDLC and take part in our security incidentresponse team Your Experience & Skills At least 3 years of experience in software engineering. At least 2 years of experience in application security. In-depth knowledge of application More ❯
Functional Alignment : Manage up, down, and across the business, driving tight coordination between sales, product, operations, and leadership to accelerate outcomes and resolve friction fast. Maintain Operational Excellence : Support incidentresponse, troubleshoot issues, and uphold high service levels, ensuring a responsive, dependable experience for every partner touchpoint. What You Bring 5+ years managing high-stakes telecom or key More ❯