team builds innovative digital solutions rapidly and at scale to deliver the next generation of banking services for our customers around the world. Service Management's purpose is to protect the availability, integrity, and confidentiality of IT Services that underpin customer and colleagues' experience of the HSBC brands. It … is a multi-functional team comprising Change Management, IncidentManagement, Problem Management, Service Level Management, Outage Management, Service Recovery, and Service Insights and Reporting. We are seeking a senior technology leader to take on the dual role of Senior Recovery Lead and Global Head … of Service Reliability. This is a highly visible, high-impact position reporting to the Global Head of Service Management, with a mandate to transform how we recover from incidents and build long-term service resilience. This individual will lead a global team of technical experts who act as escalation More ❯
Bradford, West Yorkshire, Yorkshire, United Kingdom
Vanquis Bank Limited
will proactively identify, analyse, respond, and mitigate cyber threats that pose risks to Vanquis Banking Groups cybersecurity posture. This involves monitoring security events, conducting incident response activities, enhancing our threat detection capabilities, and ensuring compliance with policy, standards, and regulation. Your contributions will directly impact our ability to protect … you will: Actively participate the delivery of services provided by the Cyber Intelligence Centre including by not limited to Cyber Threat Intelligence, Security Posture Management, Cyber Security Incident Response, Threat Hunting, Penetration Testing & Red Team Testing, and Cyber Risk Mitigation. Incorporate threat intelligence into CIC activities. Collaborate and … assist with the investigation and resolution of complex security incidents. Support the delivery of retrospective improvements based on incident analysis, RCAs and PIRs. Engage with third-party security partners to enhance and mature services. Maintain centralised processes across all VBG product lines, promoting synergy and efficiency. Stay updated on More ❯
recovery, and continuous improvement of Lloyds Banking Group's Public Cloud services across Microsoft Azure and Google Cloud Platform. The role ensures that robust incidentmanagement, problem management and risk governance practices are embedded, with a clear focus on minimising customer impact, reducing service risks, and driving … at the heart of Public Cloud service operations and ensuring alignment to regulatory expectations. Key Responsibilities Proactively manage the end-to-end availability and incident recovery strategy for Public Cloud products and services, ensuring efficient execution of incident, problem, and risk management processes. Drive proactive problem management by leveraging incident analytics, service monitoring, and trend identification to mitigate risks before they impact service availability. You will ensure continuous visibility of service health through proactive monitoring and actionable MI, enabling early risk identification and preventative action. With strong communication skills, you will engage confidently with stakeholders More ❯
and procedures, we believe there is nothing we cannot improve - Assisting and managing relationships with external vendors and contractors - Liaising with internal teams and management groups - Creating and maintaining metrics on all aspects of our Data Centers and utilising those metrics to drive positive changes - Assisting in implementing service … methodologies including incidentmanagement, problem management, change management, capacity management, etc About the team About the team Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. … systems such as feeders, Transformers, Generators, Switch gear, UPS systems, ATS units, PDU units, chillers, pumps, Air Handling units - Proven track record of people management and developing teams and in particular ensuring staff are ready for any and conditions through skill and process development - Ability to solve problems at More ❯
of banking services for our customers around the world. We are seeking a strategic, execution-focused leader to serve as the Head of Service Management for the CTO organization, reporting to the Global Head of Service Management. This individual will be responsible for embedding global service management practices … owned platforms and services-ensuring high reliability, operational rigor, and alignment to enterprise service standards. This role is also a critical partner to Product Management teams, enabling fast-paced innovation by ensuring that platforms and developer services are production-ready, resilient, and continuously improving through operational insights. Job Requirements … Lead Service Management for the CTO Organization Own service management execution across the CTO portfolio, including infrastructure, developer and data platforms, CI/CD tooling, and enabling technologies. Drive adoption of global service management practices, ensuring CTO services are governed by enterprise standards for incident, problem More ❯
Job Title: Service Management Reliability Engineer Overview: The Service Management Reliability Engineer (SMRE) role is a key contributor within the operational organisation, demonstrating broad business knowledge, technical proficiency, and leadership across FCS A2AR Services. The position provides direction and guidance, ensuring the delivery of high-quality services to … availability, suitability, supportability and quality is delivered to our customers. The SMRE will be accountable, responsible, and consulted on the following: Operation Federated Problem Management • Accountable for the delivery of Problem Management for FCS A2AR Services, from seeking trends within IncidentManagement that may be problems … and delivery of the company's business and services in conjunction with the product development function. Design • Consulted on the design of the service management processes that the SMRE are accountable for. • Consulted on the design of business processes for existing and new products, ensuring alignment with MA process More ❯
to make a real impact by ensuring the integrity and resilience of the company’s IT environment against evolving cyber threats. Key Responsibilities: Support incidentmanagement and security response efforts, providing expertise to address and resolve security incidents quickly and effectively. Perform regular security checks, including daily, weekly … Security solutions and network security operations. Understanding of security testing principles, including vulnerability scanning, risk identification, and mitigation. Knowledge of security auditing and security incident response processes. Experience with event and log analysis to monitor and assess security risks. Solid understanding of Disaster Recovery (DR) and Business Continuity principles. … global organisation, apply now. Keywords: Information Security Consultant, IT Security Consultant, Cybersecurity Specialist, Microsoft O365 Security, Enterprise Security Jobs, Information Security Leeds, IT Risk Management, Security Incident Response, Vulnerability Management, ISO 27001, GDPR Compliance, Security Awareness, Disaster Recovery and Business Continuity. More ❯
Middlesbrough, Yorkshire, United Kingdom Hybrid / WFH Options
Causeway Technologies
Knowledge of OWASP vulnerabilities and security testing ISTQB certification Experience with source control tools like Git or Bitbucket Strong problem-solving, communication, and time management skills Minimum of 5 years in a Software Tester role Desirable Experience with Azure, SonarCloud, and CI/CD pipelines Knowledge of TDD practices … and identifying vulnerabilities Development: Designing, coding, testing, and reviewing complex programs Design: Improving product design and data structures Documentation: Creating clear and accurate documentation IncidentManagement: Handling incidents and change requests effectively Database Management: Managing queries and assisting in database design Business Impact: Contributing to project value More ❯
/Senior Support/Cloud Support/Infrastructure Support must have strong documentation skills, able to adhere to processes and procedures as well as incidentmanagement/escalations from the Service Desk,... we are here waiting to invest in your career if you are dedicated and have More ❯
/Senior Support/Cloud Support/Infrastructure Support must have strong documentation skills, able to adhere to processes and procedures as well as incidentmanagement/escalations from the Service Desk,... we are here waiting to invest in your career if you are dedicated and have More ❯
Middlesbrough, North Yorkshire, North East, United Kingdom
In Technology Group Limited
technical problems while maintaining excellent customer service standards. Key Responsibilities Technical Support: Provide 2nd line support to clients, troubleshooting hardware, software, and network issues. IncidentManagement: Resolve escalated incidents efficiently, ensuring minimal disruption to business operations. Systems Administration: Manage and maintain Windows Server environments, Active Directory, and Microsoft … 365. Network Management: Monitor and troubleshoot network infrastructure, including routers, switches, and firewalls. User Support: Assist end-users with technical queries, offering remote and on-site support as needed. Key Skills & Experience Minimum of 2 years' experience in a 2nd Line Engineer or similar IT support role. Strong knowledge More ❯
/IT Support/Cloud Support/Infrastructure Support must have strong documentation skills, able to adhere to processes and procedures as well as incidentmanagement/escalations from the Service Desk, we are here waiting to invest in your career if you are dedicated and have a More ❯
Leeds, WF17, Batley, West Yorkshire, United Kingdom
Pro-Connexions
/IT Support/Cloud Support/Infrastructure Support must have strong documentation skills, able to adhere to processes and procedures as well as incidentmanagement/escalations from the Service Desk,... we are here waiting to invest in your career if you are dedicated and have More ❯
Employment Type: Permanent
Salary: £32000 - £35000/annum £32- £35k + £3k in Overtime+ Skill d
/Senior Support/Cloud Support/Infrastructure Support must have strong documentation skills, able to adhere to processes and procedures as well as incidentmanagement/escalations from the Service Desk, we are here waiting to invest in your career if you are dedicated and have a More ❯
/Senior Support/Cloud Support/Infrastructure Support must have strong documentation skills, able to adhere to processes and procedures as well as incidentmanagement/escalations from the Service Desk,... we are here waiting to invest in your career if you are dedicated and have More ❯
Employment Type: Permanent
Salary: £45000 - £50000/annum £45- £50k + £3k in Overtime+ Skill d
the highest levels of continuity and performance from their Snowflake implementation. Ideally, you have worked in a 24x7 environment, handled technical case escalations and incidentmanagement, worked in technical support for an RDBMS, been on-call during weekends, and are familiar with database release management. AS A SENIOR More ❯
managing a large team of people on a 24/7 rotating shift across multiple Data Centers. You should be familiar with Problem Change & IncidentManagement, creating & tracking expenditure, continuously design and implement new mechanisms. The role includes mentoring, developing and growing people in all levels, supporting Infrastructure … projects and proposing technical solutions that are scalable. Key job responsibilities Team management of experienced Data Center Managers leading Data Center Technicians. This includes recruitment/people development through coaching and mentoring/performance review/rotation management. Define the strategy to improve the efficiency and productivity of day … Operations. Challenge the quality and quantity of services being provided by the Data Center Operations team and continuously strive to improve our Customer Experience. Management of large-scale events in the Datacenters (crisis situations: thermal or power events, networking issues, etc.) Manage Vendor performance. Ensure Data Centers are compliant More ❯
automation Building and maintaining observability solutions using Grafana, Prometheus, Loki, OpenTelemetry Proactively identifying and resolving performance bottlenecks and infrastructure issues Automating infrastructure provisioning, configuration management, and deployments Implementing effective logging, monitoring, and alerting strategies Managing incident response and post-mortem processes to improve system resilience Implementing high-availability … cloud architectural decisions Continuously improving infrastructure reliability and operational efficiency WHAT ARE WE LOOKING FOR IN A CANDIDATE? Experience with SRE principles, such as incidentmanagement, error budgets, and service-level objectives (SLOs) Experience designing and implementing robust observability, monitoring and logging solutions Strong proficiency with observability and … cloud-native applications in production environments Proficiency in capacity planning and performance optimization Experience in managing and improving CI/CD pipelines Knowledge of incident response best practices and on-call operations WHAT IS LIFE LIKE AT SMARTSEARCH? We are a multi-award winning Tech company with an aspirational More ❯
automation Building and maintaining observability solutions using Grafana, Prometheus, Loki, OpenTelemetry Proactively identifying and resolving performance bottlenecks and infrastructure issues Automating infrastructure provisioning, configuration management, and deployments Implementing effective logging, monitoring, and alerting strategies Managing incident response and post-mortem processes to improve system resilience Implementing high-availability … cloud architectural decisions Continuously improving infrastructure reliability and operational efficiency WHAT ARE WE LOOKING FOR IN A CANDIDATE? Experience with SRE principles, such as incidentmanagement, error budgets, and service-level objectives (SLOs) Experience designing and implementing robust observability, monitoring and logging solutions Strong proficiency with observability and … cloud-native applications in production environments Proficiency in capacity planning and performance optimization Experience in managing and improving CI/CD pipelines Knowledge of incident response best practices and on-call operations WHAT IS LIFE LIKE AT SMARTSEARCH? We are a multi-award winning Tech company with an aspirational More ❯
APET Incident Imprv. Mgr., EU Trust & Safety - Amazon Partner Escalations Team Job ID: Amazon UK Services Ltd. Your role and responsibilities: • Investigate claims with different levels of severity; • Deep Dive on data from various internal systems; propose remedial action based on investigation findings; production of completed investigation documentation; • Manage … assigned casework and close investigations by meeting the expected quality and SLAs; • Consult and collaborate with business partners across (DSP Management, Corporate ER teams, Legal teams, PR and 3rd Party Vendors, Audit and Compliance). About the team The Amazon Logistics EU Deliver Service Partners (DSP), Amazon Partner Escalations … and long-term strategies to drive improvement and delight Drivers and DSP stakeholders, while protecting customer experience. The Investigations Manager will carry out critical incident investigations and establish, expand and standardize our team's internal investigation processes. The Amazon Partner Escalations Awareness process is to ensure fair and respectful More ❯
Doncaster, South Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
BluePool Computer Services
the needs of different customers and prioritised based on severity. Maintains an extremely positive attitude and demonstrates a high attention to detail. Problem/Incidentmanagement, and the ability to get to the root cause of issues. Immediately flag major incidents and escalate accordingly. Ability to logically troubleshoot More ❯
Information Security Consultant you will work with the wider IT Security Team to support and maintain enterprise wide solutions. The Responsibilities Assist with security incidentmanagement and response activities General day-to-day support on managing and responding to security alerts from systems and end users Perform daily … including Cyber Essentials, ISO 27001, 27002 etc. Data Protection Act and the General Data Protection Regulation Microsoft O365 Security solutions; Networking; Security operations; Vulnerability Management Security Auditing Good understanding of security testing principles, including experience of vulnerability scanning, identifying, resolving, and reporting risks Experience of formal document creation, such … as the creation of reports or procedures Threat Intelligence analysis and best practice Security Incident Response processes, procedures, and best practices Disaster Recovery and Business Continuity principles Event and log analysis If you are looking for an exciting new challenge to join a leading global service provider, please apply More ❯
CI and release pipelines and development environments to facilitate frequent delivery of new product features. In production, SREs perform Tier 1 on-call and incidentmanagement functions, supporting a high-throughput platform which processes more than 15 billion events per day. To ensure the reliability of this environment More ❯
microservices, promoting good operational principles during research, design, and development. Engage with current and future technology stacks both in the UK and internationally. Utilize IncidentManagement processes, automating and integrating with Mastercard's ticketing systems. Communicate effectively with internal stakeholders, clients, and technical teams. All About You You More ❯
key responsibilities and accountabilities will include: Accurately record and document incidents in Optums CRM, following ITIL best practices. Diagnose and resolve incidents through the incidentmanagement process. Participate in ongoing projects, providing support and feedback during the project lifecycle. Identify and escalate trends, proposing appropriate solutions. Provide a More ❯