Mulcture House, 11 Mulcture Hall Rd, Halifax, England Hybrid / WFH Options
Calderdale Metropolitan Borough Council
curious, analytical thinker with a passion for technology and problem-solving, we want to hear from you! Role Support cyber security risk assessments, cyber security audits and cyber security incident management. Identify security threats and hazards to a system, service or processes to inform risk assessments and design of security features. Liaise with colleagues across the Council to ensure More ❯
platforms. This is a hands-on leadership role - you won't just guide others, you'll be the go-to expert when systems are under pressure. You'll lead incident response, own root cause analysis, and solve performance issues like memory leaks, outages, and flaky services. You will take ownership of the site reliability and drive that as a … discipline. Your focus will include: Leading incidentmanagement, post-mortems, and blameless RCAs Building scalable, resilient microservices with the dev teams Uplifting observability Improving alerting, monitoring, and system-level metrics Driving better SLOs, SLIs, and overall uptime What you'll bring: Experience in high-traffic digital or eCommerce platforms 5+ years in SRE/DevOps roles; strong background … in incident response Observability, automation, and infrastructure as code expertise Leadership skills - mentoring others or leading from the front The stack includes Kubernetes, Terraform, AWS, Python, and modern CI/CD tools, and it's evolving. If you understand what a good SRE practice looks like, and want to leave systems in a better place than you found them More ❯
Camberley, Surrey, United Kingdom Hybrid / WFH Options
North SP Group Limited
in new and exciting ways. Responsibilities About the Role: This for a Client Services Project Manager is assigned to a specific campus and are responsible for the control and management of all client requirements within their Area of Responsibility (AOR). This role will be direct client facing and work in support of client requirements. Predominantly based between Corsham … Update Client Fit Out Schedule. Update Risks and Issues Log as required. Update Supplier Portals if applicable. Resolve any incidents and confirm to Head of Client Services when complete. IncidentManagement and Resolution. Project Closure On completion inspect installation with supplier. Create Snag List and confirm Timescales for resolution. Issue Practical Completion certificate (PCC) and arrange handover with … Skills & Experience Required: Strong technical knowledge of both: Data Centre Infrastructure (power, cooling, cabling, fitouts) Physical Security Systems (access control, CCTV, intrusion detection) Extensive experience in project or operations management within data centres, critical infrastructure, or physical security environments. Demonstrated success in building and leading high-performing teams. Experience with P&L ownership, budgeting, and financial reporting. Strong client More ❯
review process for ongoing service improvement Assess the most effective methods and technologies to approach tasks, with a focus on automating processes where possible Establish and implement a document management framework ensuring consistent production and updating of documents that support the incidentmanagement process APPLY NOW More ❯
review process for ongoing service improvement Assess the most effective methods and technologies to approach tasks, with a focus on automating processes where possible Establish and implement a document management framework ensuring consistent production and updating of documents that support the incidentmanagement process APPLY NOW More ❯
City of London, London, England, United Kingdom Hybrid / WFH Options
Next Employment
people, processes or facilities. THE ROLE · Assist in the development, execution and maintenance of business continuity documents including business impact analysis and office risk assessments. · Support business recovery, crisis management, and disaster-preparedness planning within the organization. · Disseminate information on business continuity processes, standards and initiatives. · Assist in coordinating business continuity exercises and testing of the mass notification system. … Provide support throughout live events that impact the business continuity plan. · Engage, deliver and track business continuity training to new members on local crisis management teams. IDEAL CANDIATE · The ideal candidate will have excellent verbal and written communication skills with the ability to coordinate multiple projects/assignments simultaneously and complete tasks accurately and on a timely basis. · Experience … notification system platforms (AlertFind, Send Word Now, Everbridge), business continuity software (Fusion, Cobalt, LDRPS, etc.) and Power BI. · Experience in a BCM or related role · Experience with crisis/incidentmanagement and IT DR · Good understanding of BCM standards · CBCP designation a plus · Excellent verbal, written and interpersonal skills · Proficient in Microsoft Office Suite, e.g. Excel, Word, PowerPoint More ❯
Deployment operating systems To train/coach specific Service Bench/Click users to ensure maximum output is achieved To own the Service Bench & Click risk/system failure incidentmanagement for Field/Central Operations. Review and adjust tactical Service Bench/Click settings to ensure the best Customer experience in the most efficient way To remain … data into tangible actions Knowledge of the Field Operation and the variables to deliver an efficient workforce. Desirable: Understanding of the IT infrastructure required for a deployment system. Stakeholder management experience. Operational experience of IFS and PSO More ❯
behind the curtain, ensuring our critical systems are always reliable, available, and performing like a dream . We're talking about implementing smart automation, sharp monitoring, and super-speedy incident response strategies to keep everything running smoothly. You'll be working hand-in-hand with our dev, infra, and security teams, making sure we balance exciting new features with … be the guardian of our uptime, making sure our critical systems are always available and hitting those all-important SLAs . You'll also be leading the charge on incidentmanagement , getting to the bottom of any issues and making sure we learn from them. Monitoring & Alerting Maestro: Setting up and maintaining top-notch monitoring systems (like Dynatrace … craft alerting systems that give us a heads-up before problems even get a chance to impact our players, and you'll define key metrics to measure system health. Incident Response Ace: When things get a bit wobbly, you'll be on the front lines, resolving incidents fast to minimize downtime. After the dust settles, you'll lead the More ❯
leaders and 3,000 organisations across the world, with clients across the Fortune 500, FTSE 100, and OMX 30. In 2024 we received substantial backing from K1 Investment Management - the leading B2B Enterprise SaaS investors. We are at the beginning of significant growth, and we're looking for superb talent to join us on this journey. The team is … queries and exceptions to pre-empt future issues and suggest product or process improvements. Champion process improvements and documentation, identifying inefficiencies in support workflows and helping refine the overall incident handling lifecycle. We are looking for a motivated individual with a strong technical background and a passion for delivering exceptional customer service. Key technical and professional skills include: Ability … to investigate and resolve complex technical issues using tools like Datadog, Bugsnag, and JIRA, and interpret log data to identify root causes and trends. Familiarity with incidentmanagement workflows using tools like PagerDuty and Bugsnag, with the ability to prioritise, document, and escalate issues appropriately based on severity and impact. Knowledge of APIs, SSO, and web technologies to More ❯
including querying, regex, alerts, and dashboards setup (no need for Splunk admin knowledge). Application log analysis skills and prior experience troubleshooting issues in production environments. ITIL, ticket, and incidentmanagement skills with relevant work experience. Proactive approach to production issues, including notifying the team about ongoing and potential future issues. Ensuring client SLAs are met by managing … deliverables for critical applications and understanding IT/business SLAs. Collaborating with Development and Level 3 support teams on incident triages, release/change reviews, and application stabilization enhancements. Handling major incidents, engaging relevant teams, creating post-mortems, and ensuring incident closure. More ❯
data and security consistency across all People Tech platforms. As an Engineer, you will triage and respond to queries from across the business. Alongside this, collaborate with People Tech Management, Product and Principal Engineers and other workday functional areas on any data requirements focusing on Workday support with additional cross-functional support opportunities in other technologies. What you will … technologies that could be used to improve effectiveness, efficiency and user experience Support/provide input to develop the HR technology systems, service road map, change control processes and incidentmanagement frameworks Provide technical content to support the development of materials to effectively train end-users on how to use the HR system On-call Support as required … developer in implementing and managing Core HCM/HRIS (specifically Workday) platforms and other systems across the employee lifecycle - such as L&D, Talent Acquisition, Service Delivery, and compensation management platforms. 1+ years of proven experience in software development with a proven track record delivering high quality software Familiarity with compliance regulations and data privacy practices related to People More ❯
development and adherence to good practices. Ensure stakeholders consider ethical, technological, and commercial factors when designing or using new customer data tools. Provide expert guidance and challenge to senior management and employees on data protection obligations and best practices. Act as the primary contact for regulatory authorities, including the ICO. Collaborate with Technology, Legal, HR, and other units to … implement privacy-by-design in new projects and systems. Raise awareness and provide training on data protection principles across the organisation. Data Subject Rights & IncidentManagement: Oversee data subject requests such as SARs, right to erasure, and data portability. Manage data breach response plans and ensure timely reporting to regulators and individuals. Supervise records of processing activities (RoPA More ❯
cloud estate. Responsibilities • Collaborate with stakeholders to drive security initiatives and strategy • Implement a best practice IT Controls Framework • Act as the security SME across IT, overseeing security operations, incidentmanagement and threat detection • Ensure robust third-party security, including commercial agreements • Implement security policies and standards • Manage cybersecurity risks and response to incidents • Implement plans to meet … security certifications • Champion a security awareness culture through training and engagement initiatives • Work with auditors to demonstrate control compliance and for remediation activities Candidate requirements • Experience in IT Risk Management, Compliance, Internal Audit or External Audit roles - understanding IT security standards and frameworks • Previous work experience in a regulated Financial Services environment - ideally you will have knowledge of the More ❯
and video services. Oversee live event execution, SLA compliance, service bookings, and customer support. Act as the senior point of escalation for complex incidents (Tier 3 support). Drive incident response, root cause analysis, and proactive monitoring/reporting. Develop and implement TOC strategy, staffing models, and documentation standards. Participate in systems architecture, new tech evaluation, and vendor selection. … a TOC, NOC, or MCR environment. Strong understanding of live broadcast workflows, encoding, transmission, and routing. Deep knowledge of TCP/IP networking (switching, routing, multicast). Excellent leadership, incidentmanagement, and performance development skills. Strong documentation and process optimisation experience. High-pressure decision-making and problem-solving capabilities. Proficiency with Excel/Google Sheets; adaptable across Windows More ❯
and video services. Oversee live event execution, SLA compliance, service bookings, and customer support. Act as the senior point of escalation for complex incidents (Tier 3 support). Drive incident response, root cause analysis, and proactive monitoring/reporting. Develop and implement TOC strategy, staffing models, and documentation standards. Participate in systems architecture, new tech evaluation, and vendor selection. … a TOC, NOC, or MCR environment. Strong understanding of live broadcast workflows, encoding, transmission, and routing. Deep knowledge of TCP/IP networking (switching, routing, multicast). Excellent leadership, incidentmanagement, and performance development skills. Strong documentation and process optimisation experience. High-pressure decision-making and problem-solving capabilities. Proficiency with Excel/Google Sheets; adaptable across Windows More ❯
Flink, Kafka, and Python. This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incidentmanagement, architecture, and performance. The Role: Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS Act as an escalation point for critical … of Apache Flink, Kafka, and Python in production environments Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.) Comfortable with monitoring tools, distributed systems debugging, and incident response Reference Number: BBBH259303 To apply for this role or for to be considered for further roles, please click "Apply Now" or contact Tommy Williams at Rise Technical Recruitment. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Flink, Kafka, and Python. This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incidentmanagement, architecture, and performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS *Act as an escalation point for critical … of Apache Flink, Kafka, and Python in production environments *Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.) *Comfortable with monitoring tools, distributed systems debugging, and incident response Reference Number: BBBH259303 To apply for this role or for to be considered for further roles, please click 'Apply Now' or contact Tommy Williams at Rise Technical Recruitment. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Apache Flink, Kafka, and Python.This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incidentmanagement, architecture, and performance. The Role: *Maintaining and monitoring real-time and batch data pipelines using Flink, Kafka, Python, and AWS*Act as an escalation point for critical … of Apache Flink, Kafka, and Python in production environments*Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.)*Comfortable with monitoring tools, distributed systems debugging, and incident response Reference Number: BBBH259303 To apply for this role or for to be considered for further roles, please click "Apply Now" or contact Tommy Williams at Rise Technical Recruitment. More ❯
and implement promising new technologies. Providing operational support for Cboe Europe's trading systems by participating in a production support rota, responding to incidents in line with Cboe's IncidentManagement and Response processes, and contributing to post-mortem analyses and follow-up actions. Participate in a global software development team The ideal candidate has: Solid Python knowledge … as Apache Kafka Preferred: Familiarity with data pipeline platforms such as Apache Airflow Preferred: Familiarity with Java Preferred: Experience in one or more relevant financial areas (market data, order management, algorithmic trading, financial systems integration, compliance, etc.) Benefits and Perks We value the total wellbeing of our people - including health, financial, personal and social wellness. We believe standard benefits More ❯
Core Platform Technical Manager to lead the design, implementation, and optimisation of enterprise platforms across Azure cloud, data platforms, and business automation. You will oversee end-to-end cloud management, working with Azure providers and internal engineering teams, and drive RPA integration. You will define and apply platform architecture best practices with the Cloud & Shared Platforms Architect, ensuring alignment … functional teams to deliver solutions supporting digital transformation. Present strategic roadmaps and solution designs to senior leadership. Lead platform modernisation, migration, and transformation initiatives within budget. Provide technical leadership, incidentmanagement, and 24/7 on-call support as required Requirements desirable: Microsoft Certified: Azure Solutions Architect Expert Microsoft Certified: Azure Security Engineer Associate Microsoft Certified: Azure DevOps More ❯
Defence's (MOD) military satellite communications system and ground stations. Day-to-day, you'll provide engineering support to the Network Operations Centre, ensuring smooth delivery of services and incident investigations. Provide engineering advice and knowledge to members of the Network Operations Centre. Assist in delivering and maintaining, new technologies and services into the Skynet Network. Lead major incident … Support Engineer Broad understanding of data communications. Significant experience in IP switching and routing. Previous experience or familiarity within satellite communications would be highly beneficial. Previously held responsibilities within IncidentManagement Qualifications for the Network Operations Support Engineer Ideally you will be qualified to CCNA level however, we value difference and don't have a fixed idea when … talk about flexible working - please ask about alternative patterns of work at interview. Closing Date: 17/07/25 Job Segment: CCNA, Cisco, Network, Engineer, CSR, Technology, Engineering, ManagementMore ❯
Wokingham, Berkshire, South East, United Kingdom Hybrid / WFH Options
M Group Energy
call rota with the occasional meetings in Wokingham. What will you be doing? Youll have the ability to act as the on-site technical expert Youll be operationally Monitoring & Management of Voice/Data network and associated systems. Youll be providing 2nd line support to resolve faults and carry out changes on the networks. Youll be providing technical and … fault management capabilities, 2nd/3rd line support to resolve and assist in resolving network issues. Youll be be a part of a 2nd/3rdLine Technical Support & Delivery team focused on complex technical delivery activities supporting specific areas of Customer Services. Other duties will be Complex delivery in line with customer SLAs Active participation in the customers governance … cycles Accurate resource planning and forecasting Complex incidentmanagement (Technology and Solution based). Root Cause analysis for major incidents and escalated faults. What youll bring Youll bring knowledge and experience of LAN/WAN and Wireless LAN concepts, equipment (Cisco), including TCP/IP, MPLS, QoS, VLANs, Cisco Call Manager. Knowledge of security best practices. Youll have More ❯
Coventry, West Midlands, United Kingdom Hybrid / WFH Options
M Group Energy
call rota with the occasional meetings in Warwickshire. What will you be doing? Youll have the ability to act as the on-site technical expert Youll be operationally Monitoring & Management of Voice/Data network and associated systems. Youll be providing 2nd line support to resolve faults and carry out changes on the networks. Youll be providing technical and … fault management capabilities, 2nd/3rd line support to resolve and assist in resolving network issues. Youll be be a part of a 2nd/3rdLine Technical Support & Delivery team focused on complex technical delivery activities supporting specific areas of Customer Services. Other duties will be Complex delivery in line with customer SLAs Active participation in the customers governance … cycles Accurate resource planning and forecasting Complex incidentmanagement (Technology and Solution based). Root Cause analysis for major incidents and escalated faults. What youll bring Youll bring knowledge and experience of LAN/WAN and Wireless LAN concepts, equipment (Cisco), including TCP/IP, MPLS, QoS, VLANs, Cisco Call Manager. Knowledge of security best practices. Youll have More ❯
feedback through Quality Control and our Customer Satisfaction Survey. To provide Service Desk support to clients via telephone and chat Logging and updating support tickets within the Company's IncidentManagement Application Resolving support calls in a quick and efficient manner whilst meeting SLA's To escalate calls where necessary to the Desktop Team and/or Infrastructure More ❯
feedback through Quality Control and our Customer Satisfaction Survey. To provide Service Desk support to clients via telephone and chat Logging and updating support tickets within the Company's IncidentManagement Application Resolving support calls in a quick and efficient manner whilst meeting SLA's To escalate calls where necessary to the Desktop Team and/or Infrastructure More ❯