Reading, Oxfordshire, United Kingdom Hybrid / WFH Options
Thames Water Utilities Limited
you will be responsible for maintaining SecOps solutions, controls, and processes across the organisation, while mentoring and leading the SOC team to ensure effective management of OT alerts and incidents. This position requires a deep understanding of SecOps concepts, technologies, and best practices, specifically across IT and OT environments. … You will be tasked with ensuring robust incidentmanagement, proactive threat detection, and continuous improvement of our security posture. Strong communication and collaboration skills are essential as you will work closely with cross-functional teams to mitigate risks and protect Thames Water's essential services. What you'll … activities such as threat hunting to uncover vulnerabilities and ensure continuous risk reduction. • Provide tangible metrics to demonstrate risk reduction and reduced technical debt. Incident Readiness & Response: • Lead the incident triage and response process, ensuring effective management and remediation of cyber security incidents. • Improve incidentmanagementMore ❯
role within the IT team, overseeing the helpdesk function that is the entry point for all issues, problems and requests. You will own major incidentmanagement, problem management, and change management processes. You will be the driving force behind continual service improvement, ensuring all services are … underpinned by robust ITIL practices and aligned to our strategic and operational goals. Key Responsibilities: Leadership and Team Management Lead and develop the IT Helpdesk team across all tiers (L1, L2, L3) ensuring seamless end-to-end support of the highest quality. Build a team to support the Institute … meet agreed SLAs, OLAs, and customer expectations with a white-glove approach. Champion service excellence and user satisfaction through proactive engagement and feedback mechanisms. Incident, Problem and Change Management Act as the owner for Major IncidentManagement, ensuring timely resolution and effective communication during incidents. Lead More ❯
Science, or a related technical field 8+ years of experience in enterprise technology roles, with 3–5 years focused on platform operations or service management Hands-on experience with managing GenAI/ML platforms and LLM-based services (e.g., OpenAI, Anthropic, Azure OpenAI, Hugging Face) Proven track record in … implementing and scaling MLOps or LLMOps practices in a production environment Certifications in cloud platforms (e.g., Azure, AWS, GCP) and/or ITIL Service Management preferred Advanced coursework or certifications in AI/ML, MLOps, or LLMOps is a strong plus Ongoing learning and participation in GenAI or platform … tools (e.g., Prometheus, Grafana, Azure Monitor) Exceptional communication and stakeholder engagement skills to partner with business, technical, and governance teams Experience managing platform SLAs, incidentmanagement, and continuous improvement cycles in high-availability environments Ability to balance strategic thinking with hands-on execution in a fast-paced, evolving More ❯
Science, or a related technical field 8+ years of experience in enterprise technology roles, with 3–5 years focused on platform operations or service management Hands-on experience with managing GenAI/ML platforms and LLM-based services (e.g., OpenAI, Anthropic, Azure OpenAI, Hugging Face) Proven track record in … implementing and scaling MLOps or LLMOps practices in a production environment Certifications in cloud platforms (e.g., Azure, AWS, GCP) and/or ITIL Service Management preferred Advanced coursework or certifications in AI/ML, MLOps, or LLMOps is a strong plus Ongoing learning and participation in GenAI or platform … tools (e.g., Prometheus, Grafana, Azure Monitor) Exceptional communication and stakeholder engagement skills to partner with business, technical, and governance teams Experience managing platform SLAs, incidentmanagement, and continuous improvement cycles in high-availability environments Ability to balance strategic thinking with hands-on execution in a fast-paced, evolving More ❯
Science, or a related technical field 8+ years of experience in enterprise technology roles, with 3–5 years focused on platform operations or service management Hands-on experience with managing GenAI/ML platforms and LLM-based services (e.g., OpenAI, Anthropic, Azure OpenAI, Hugging Face) Proven track record in … implementing and scaling MLOps or LLMOps practices in a production environment Certifications in cloud platforms (e.g., Azure, AWS, GCP) and/or ITIL Service Management preferred Advanced coursework or certifications in AI/ML, MLOps, or LLMOps is a strong plus Ongoing learning and participation in GenAI or platform … tools (e.g., Prometheus, Grafana, Azure Monitor) Exceptional communication and stakeholder engagement skills to partner with business, technical, and governance teams Experience managing platform SLAs, incidentmanagement, and continuous improvement cycles in high-availability environments Ability to balance strategic thinking with hands-on execution in a fast-paced, evolving More ❯
Reading, Oxfordshire, United Kingdom Hybrid / WFH Options
Mobile Broadband Network Limited
Incident Assurance Manager Job ID PERM002892ML Department Details The Operational Services directorate is accountable for ensuring the network sites are always accessible and available. It undertakes the operation, enablement, and management of the network infrastructure to enable EE/BT and Three to deliver their best customer experiences … at the lowest cost. Reporting to the Senior Incident Assurance Manager, this role will involve relentlessly managing the delivery of Incident Assurance services by the supplier ecosystem to achieve agreed business outcomes and performance targets set by EE/BT, Three and the MBNL AOP. This is a … minimum of two days per week in our Central Reading office. What you will do: Manage and proactively drive Service and Site Availability and Incident resolution and ticketing KPI's and quality issues against contractual obligations and industry benchmarks. Identify and contribute efforts which will improve the methodologies, processes More ❯
Aylesbury, Buckinghamshire, United Kingdom Hybrid / WFH Options
Esri Ireland
You will be a key point of contact with our customers and building strong relationships is core to successful service delivery. Change and release management: : Co-ordinate changes approvals for the plans for upgrades, changes and co-ordinate future releases to environments. Incidentmanagement : Lead on incidentmanagement for your customers, collaborating with colleagues and ensuring clear regular communications with the customer. Support and trend analysis :Monitor incident trends and adopt a proactive approach to addressing service-related issues. Report on service performance : provide customers with ongoing performance reporting, insight and recommendations to provide More ❯
Incident Assurance Manager - RAN/Mobile telecoms 3-Month Contract Reading/Home Join a leading telecom service provider as an Incident Assurance Manager. You'll play a key role in ensuring service stability by overseeing critical incidentmanagement processes and coordination across multiple partners and … teams. Responsibilities of the Incident Assurance Manager include: Manage end-to-end incident processes, ensuring swift resolution and minimal business disruption. Coordinate major incident response across partners and tech teams. Lead post-incident reporting and ensure key stakeholders are informed. Enhance incident workflows through automation … and streamlined escalation paths. The successful Incident Assurance Manager will have: Proven experience in IT Operations or Service Management (ITIL environment). Strong stakeholder management, communication, and coordination skills. Ability to stay calm under pressure and solve problems proactively. Experience with ITSM tools (e.g., BMC Remedy). More ❯
Incident Assurance Manager - RAN/Mobile telecoms 3-Month Contract Reading/Home Join a leading telecom service provider as an Incident Assurance Manager. You'll play a key role in ensuring service stability by overseeing critical incidentmanagement processes and coordination across multiple partners and … teams. Responsibilities of the Incident Assurance Manager include: Manage end-to-end incident processes, ensuring swift resolution and minimal business disruption. Coordinate major incident response across partners and tech teams. Lead post-incident reporting and ensure key stakeholders are informed. Enhance incident workflows through automation … and streamlined escalation paths. The successful Incident Assurance Manager will have: Proven experience in IT Operations or Service Management (ITIL environment). Strong stakeholder management, communication, and coordination skills. Ability to stay calm under pressure and solve problems proactively. Experience with ITSM tools (e.g., BMC Remedy). More ❯
Science, Engineering, Data Science, or a relatedtechnical field 8+ years of experience inenterprise technology roles, with 3–5 years focused on platformoperations or service management Hands-onexperience with managing GenAI/ML platforms and LLM-based services(e.g., OpenAI, Anthropic, Azure OpenAI, HuggingFace) Proven track record in implementing andscaling … and platform telemetry tools (e.g., Prometheus,Grafana, Azure Monitor) Exceptionalcommunication and stakeholder engagement skills to partner withbusiness, technical, and governanceteams Experience managing platform SLAs,incidentmanagement, and continuous improvement cycles inhigh-availability environments Ability tobalance strategic thinking with hands-on execution in a fast-paced,evolving landscape Professional … tuning at scale. Ensureefficient management of AI models to maximize their effectivenessand businessimpact. ServiceManagement & Support Establish andmanage robust service management processes, including incidentmanagement, service-level agreements (SLAs), change management, andcontinuous service improvement for GenAIplatforms. Drive excellence in service deliverythrough proactive support and managementstrategies. Governance& Compliance Alignment Ensureplatform More ❯
Science, Engineering, Data Science, or a relatedtechnical field 8+ years of experience inenterprise technology roles, with 3–5 years focused on platformoperations or service management Hands-onexperience with managing GenAI/ML platforms and LLM-based services(e.g., OpenAI, Anthropic, Azure OpenAI, HuggingFace) Proven track record in implementing andscaling … and platform telemetry tools (e.g., Prometheus,Grafana, Azure Monitor) Exceptionalcommunication and stakeholder engagement skills to partner withbusiness, technical, and governanceteams Experience managing platform SLAs,incidentmanagement, and continuous improvement cycles inhigh-availability environments Ability tobalance strategic thinking with hands-on execution in a fast-paced,evolving landscape Professional … tuning at scale. Ensureefficient management of AI models to maximize their effectivenessand businessimpact. ServiceManagement & Support Establish andmanage robust service management processes, including incidentmanagement, service-level agreements (SLAs), change management, andcontinuous service improvement for GenAIplatforms. Drive excellence in service deliverythrough proactive support and managementstrategies. Governance& Compliance Alignment Ensureplatform More ❯
Science, Engineering, Data Science, or a relatedtechnical field 8+ years of experience inenterprise technology roles, with 3–5 years focused on platformoperations or service management Hands-onexperience with managing GenAI/ML platforms and LLM-based services(e.g., OpenAI, Anthropic, Azure OpenAI, HuggingFace) Proven track record in implementing andscaling … and platform telemetry tools (e.g., Prometheus,Grafana, Azure Monitor) Exceptionalcommunication and stakeholder engagement skills to partner withbusiness, technical, and governanceteams Experience managing platform SLAs,incidentmanagement, and continuous improvement cycles inhigh-availability environments Ability tobalance strategic thinking with hands-on execution in a fast-paced,evolving landscape Professional … tuning at scale. Ensureefficient management of AI models to maximize their effectivenessand businessimpact. ServiceManagement & Support Establish andmanage robust service management processes, including incidentmanagement, service-level agreements (SLAs), change management, andcontinuous service improvement for GenAIplatforms. Drive excellence in service deliverythrough proactive support and managementstrategies. Governance& Compliance Alignment Ensureplatform More ❯
Science, Engineering, Data Science, or a relatedtechnical field 8+ years of experience inenterprise technology roles, with 3–5 years focused on platformoperations or service management Hands-onexperience with managing GenAI/ML platforms and LLM-based services(e.g., OpenAI, Anthropic, Azure OpenAI, HuggingFace) Proven track record in implementing andscaling … and platform telemetry tools (e.g., Prometheus,Grafana, Azure Monitor) Exceptionalcommunication and stakeholder engagement skills to partner withbusiness, technical, and governanceteams Experience managing platform SLAs,incidentmanagement, and continuous improvement cycles inhigh-availability environments Ability tobalance strategic thinking with hands-on execution in a fast-paced,evolving landscape Professional … tuning at scale. Ensureefficient management of AI models to maximize their effectivenessand businessimpact. ServiceManagement & Support Establish andmanage robust service management processes, including incidentmanagement, service-level agreements (SLAs), change management, andcontinuous service improvement for GenAIplatforms. Drive excellence in service deliverythrough proactive support and managementstrategies. Governance& Compliance Alignment Ensureplatform More ❯
expertise is VIRTUS' greatest strength. Job Summary Reporting to Senior Director of Operations, the primary function of the role is to provide leadership, direction, management and oversight on the day-to-day operations of the VIRTUS Data Centres facilities operations. This includes ensuring the safety, reliability, efficiency, and operational … Continuous Improvement: Champion the implementation of innovative practices, processes, and technologies to drive improvements in service delivery, operational efficiency, and customer satisfaction. Strategic Financial Management: You will have comprehensive control over the operational and capital expenditure, managing a significant budget with a keen eye on cost optimisation without compromising … Drive initiatives aimed at enhancing operational capabilities, streamlining processes, and implementing state-of-the-art solutions to bolster efficiency and service quality. Performance & SLA Management: Ensure that all data centres facilities operations consistently meet or exceed their 100% uptime SLA and operations goals. Address performance and goals challenges promptly More ❯
Milton Keynes, Buckinghamshire, United Kingdom Hybrid / WFH Options
The Open University UK
functionality Lead Testing Efforts: Design and execute test plans to validate software quality, ensuring functionality, performance, and security requirements are met Support Software Configuration Management: Manage software configuration and version control, ensuring that changes are tracked, documented, and easily retrievable Data Modelling and Database Design: Contribute to data modelling … testing to ensure systems meet performance, scalability, and security standards Support Application and System Operations: Provide support for application and system operations, assisting in incidentmanagement, including out-of-hours as required, and ensuring smooth functioning of services Availability and Capacity Planning: Contribute to managing system availability and … capacity, ensuring that services are reliable and scalable to meet current and future needs Problem Management: Participate in problem management processes, helping to identify and resolve underlying issues to improve system stability Skills and Experience Software Development Lifecycle: Solid understanding of the software development lifecycle, including design, development More ❯
with the Head of IT, Applications Manager, Indirect Procurement and Legal team leads to size and implement future support requirements included 24/7 incidentmanagement for global operations. * Manage D365 project budget * Manag, lead and support relevant meetings as appropriate. These must include project progress reports, executive … role implementing and growing D365 F&O - Essential * Experience managing and implementing Power Platform solutions - Desirable * Application specific knowledge (Desirable): o Warehousing and inventory management knowledge o Vendor managed inventory knowledge o Vendor management o Production control and costings o Product information management o PLM integration experience … o Project management and accounting * Experience working with Microsoft 365, Azure Entra ID and SQL - Desirable * Agile Foundation/Practitioner - Desirable * Demonstratable experience engaging with stakeholders on all levels of a business, internal and external - Essential * A team player used to learning new skills & taking on new challenges - Essential More ❯
leadership and capabilities. We’re looking for a Level 3 SOC Analyst to join our client's team, offering expertise in security analysis and incident response to help drive the success of their Cyber Security Operations Center (CSOC). In this role, you will investigate and validate potential security … mentor and uplift analyst skills and act as a key escalation point. The role will involve collaborating with global security teams, including CERT and IncidentManagement, to enhance overall security capabilities. Key Responsibilities: Advanced Incident Response: Handle escalated security incidents that L1 and L2 analysts cannot resolve … Security Reporting and Advisories: Contribute to or lead the delivery of cyber security reports and advisories to key stakeholders. Residual Risk Assessment: Deliver post-incident analysis, technical lessons learned, and reporting to assess residual risk. Advanced SIEM Tuning: Refine and tune SIEM tools to reduce false positives and detect More ❯
InfraView - Specialist Cloud & IT Infrastructure Technology Recruitment
to communicate and collaborate at senior levels Be part of the on call Team on a rota system Excel Proficient Key Responsibilities: Leadership and Management Lead a team of service Desk engineers to deliver a strong customer experience Conduct regular performance reviews Able and experienced in motivating and managing … manage service desk performance metrics Develop and implement processes and procedures to enhance team productivity Be the escalation point for the Service Desk for Incident and Major Incidentmanagement Lead the onboarding of new Managed Services custom ers. Service Desk Manager - £40,000 - £45,000 + Bonus More ❯
Woodstock, Oxfordshire, South East, United Kingdom
Owen Mumford Ltd
issues to ensure a prompt closure. (This role will be dual sited across Chipping Norton & Witney) Key Responsibilities: Daily monitoring of the IT Service Management (ITSM) system, ensuring all aspects of the IncidentManagement process are followed. Constant monitoring of any unclaimed tickets and changes, in the … Ensure the Service Desk Team Lead is made aware of key support issues and trends identified in the ITSM. Active Directory for Users & Computers management in line with defined departmental processes. Assist with the ongoing management of the Hardware Asset Register. Participation in the out of hours support More ❯
buckinghamshire, south east england, United Kingdom
Fawkes & Reece
of COINS ERP, and the ability to manage multiple tasks simultaneously. Main Duties and Responsibilities for the COINS Support Analyst: End User Support and IncidentManagement: Provide COINS support, advice and guidance for internal users, resolving issues within agreed SLAs Log support issues and liaise with 3rd party … application vendors to ensure resolution of service incidents User Access Management: Handle the user lifecycle (starters, movers, leavers), ensuring appropriate access is granted and revoked. Manage delegation of tasks, approvals, and authorisations within the system System Maintenance: Maintain documentation for system configurations and customisations Reporting and Auditing: Conduct periodic … Provide training to end-users on system functions, workflows, and new features Document business processes and develop internal guidance notes and knowledge base Change Management: Participate in change management processes related to COINS, ensuring clear communication with stakeholders regarding system changes or downtime Skills and Experience Required for More ❯
role that requires commitment to providing a high-quality service to meet customer demand. The role is required to deliver IMAC, iMACD, and Breakfix IncidentManagement Services. The duties range from small alterations such as patching and fault finding to performing large-scale equipment and device relocations including … mechanical and electrical provisioning - Technology component swaps. WTS Control functions Technology room inspections Alarm investigation assistance (CMS/BMS/Peregrine) Identification to DCO management of redundant technology infrastructure Escort and supervision of activities within technology rooms i.e. 3rd party and other bank group activities. Power circuit support (proprietary … Technology component swaps Third party supervision (escort and supervision) WTS Cabling Carry out cable installation services in accordance with STS cabling/patching schedules Management of patching within equipment cabinet Labelling of cabling in accordance with STS standards and requirements Supervision and QA for cabling installs. Recovery of unused More ❯
Provide customer service to internal and external customers to ensure a consistent experience. Adopt a proactive approach towards all client activities. Day to day incidentmanagement and proactive monitoring of IT Security Systems and associated platforms and components Coordinate small teams delivering security related work packages in line … functions and applications in line with IT industry standards. Demonstrates awareness of health and safety at work. Firewall and network security configuration AV, Patch Management, Endpoint Protection and EDR technologies, Crowd strike preferred. Competencies & Key Success Factors Proactively managing the security landscape for our customers both internally and externally More ❯
the highest SLA uptimes Consolidation, improvement and expansion of our cloud infrastructure offering Increased visibility of cloud estate and health metrics Greater control and management of infrastructure for production Responsibilities: Effectively deploy, manage and monitor Cloud infrastructure in a LiveOps environment. Scoping, design and implementation of cloud architecture. Implement … maintain and consolidate cloud testing and automation tools. Identifying and deploying cybersecurity measures. Incidentmanagement and root cause analysis. Working with our code and build teams to ensure a streamlined workflow. Experienced with version control systems like Perforce and git. A knowledge of creating and maintaining logging, monitoring … and incident response technologies. Experienced with Infrastructure as Code technologies (AWS/Azure). Experienced containerising applications and maintaining containerised infrastructure (ECS, Docker Swarm, Kubernetes etc). Familiarity with CI/CD systems like Jenkins, GitLabCI, CircleCI etc. Experience with Pulumi is desirable. Knowledgeable in Microsoft PlayFab is an More ❯
solving and will already be experienced in a similar role. Essential skills: Windows 7/10, MS Office 365/2013 Active Directory administration IncidentManagement Tools e.g. Cherwell, Remedy, Assyst Desktop/laptop hardware Desirable qualifications: Any of the following: ITIL Foundation, Microsoft certifications, CompTIA A+ What More ❯
InfraView - Specialist Cloud & IT Infrastructure Technology Recruitment
continuous. “People first” culture The Latest, cutting-edge technology You will join a 24/7 SOC team, involved in the highest level of Incident Response activities, proactive threat hunting and development of detection and use-case capabilities. You’ll be working alongside the engineering team to help find … or similar experience. Microsoft SC-200 Certs are desirable. Excellent communication skills A strong knowledge of sophisticated threat actor methodologies, along with experience in incident response and forensic investigations. This business has a clear picture of where they want to be, have the right individuals steering the ship and … exceptional tech talent leading the way. Impressive customers, flexible working and a company who truly care. Responsibilities: Leading escalated Cyber IncidentManagement, including Major Incidents and 2nd/3rd line analysis for ongoing investigations. Carrying out proactive threat hunts, RCAs, creation of detection capabilities Monitor/hunt security More ❯