integration, automation, and a mobile-first strategy across a global footprint. Key Responsibilities Platform Ownership & Strategy Act as the primary owner for the Azure cloud environment, including governance, cost management, architecture, and operations. Develop and maintain a scalable, secure, and resilient cloud platform aligned with the firm’s digital transformation goals. Drive and implement the roadmap for infrastructure upgrades … regardless of time of day is an essential. Lead by example, setting clear expectations and holding individuals and teams accountable for high performance and ethical conduct. Provide direct line management of individuals where appropriate, ensuring clear direction, regular development conversations, and alignment with team and business goals. Operational Management Take full ownership of day-to-day BAU operations … ensuring the cloud platform’s performance, reliability, security, and compliance. This includes patch management across all infrastructure services, as well as ensuring regular reporting is carried out for patch compliance and vulnerability management. Beyond operational stability, the role demands a continuous improvement mindset - proactively evaluating the platform and driving enhancements to optimise efficiency, resilience, and user experience. Manage incident More ❯
role within their shared service IT function, responsible for leveraging the ServiceNow platform and its ServiceNow Performance Analytics and ServiceNow Reporting modules to analyze the company's IT service management data and design meaningful reporting to support IT service reviews and continuous service improvement. The ideal candidate should have a background in IT Business Analysis, Service Reviews, Reporting, and … reviews with internal and external stakeholders using live data from ServiceNow. Build and maintain service performance dashboards and scorecards within ServiceNow and other tools. Analyze incident, request, change, and problemmanagement trends to recommend service improvements. Work with ITSM teams to ensure accurate SLA tracking and OLA performance reporting within ServiceNow. Support the definition and implementation of corrective … and consistency in ServiceNow reporting modules. Gather business and technical requirements for IT reporting needs and translate them into ServiceNow reporting solutions. Present key findings and insights to senior management with clarity and confidence. Serve as a bridge between service owners, process managers, and technical teams. Contribute to the optimization of ITIL-aligned processes, in particular Incident, Request, ProblemMore ❯
objectives, meet service level agreements (SLAs), and provide a seamless user experience. Key Responsibilities Oversee and improve the entire lifecycle of key ITIL practices and be proficient in Incident, Problem, Change, Asset, Transition and Service Request management, ensuring timely resolution and fulfilment in alignment with service level agreements (SLAs). Manage End to End Service Provision by acting … ensuring seamless service delivery by adhering to established systems, processes, and methodologies. Manage Service Communications by providing regular incident & maintenance and downtime updates as well as reports to Senior Management on all aspects of service performance, ensuring transparency and timely communication on any issues. Manage SLA’s/SLO’s and develop service excellent by regularly attending internal and … Board and manage the overall process. Monitor and report on service KPIs and performance. Develop and adapt reporting templates and/or metrics to suit ongoing business requirements. Identify problem areas, root causes or general support trends for further review or actioning. Work closely with the Service Desk and initiate periodic quality checks across triaging, ticket queues and general More ❯
and applications, with strong expertise in Azure, Active Directory, VMware, Microsoft, Linux and O365. ITIL certification and a track record of successful change and incident management. Experience around Incident Management, ProblemManagement, Change Management and Asset Management. Strong communication skills, with the ability to influence senior leaders and manage competing priorities. Strong team leadership skills, with More ❯
City of London, London, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
Production Application Support Engineer (Linux SQL ITIL) London to £95k Do you have experience of supporting systems and users on Capital Markets systems You could be joining the Investment Management arm of a global bank.As a Production Application Support Engineer you will join a talented, global team responsible for providing first class support to Fixed Income Clients and applications … across Commodities and Futures for Trading, Pricing and Risk management systems. You'll manage production systems, acting as an escalation point for operational teams on a 24x7 basis, taking ownership of Major Incident Management, incident response, problemmanagement and root cause analysis. Employing a data-driven approach you will drive production stability via metrics and reporting … partnering closely with Application Development teams and external vendors, ensuring accurate logging of support incidents, conducting Incident and Problem Review Meetings (PRMs) and tracking action items through to closure. Location/WFH: You'll be based in the London office and have flexibility to work from home once a week. About you: You have achieved a 2.1 or above More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Tate Recruitment
business processes and applications to deliver effective support. Implement code fixes and minor enhancements to existing systems. Act as a subject matter expert (SME) during code reviews and release management processes. Maintain a knowledge base by documenting issues and resolutions to agreed standards. Generate ad-hoc reports for business users, including audit extracts and management information. Identify opportunities … office or customer-facing application support roles. Strong SQL skills, including the ability to write and optimize complex JOIN queries across platforms such as SQL Server and MySQL. Excellent problem-solving skills with a structured approach to root cause analysis. Experience in incident and problemmanagement, including ownership from identification to resolution. Ability to manage multiple priorities More ❯
City of London, London, United Kingdom Hybrid / WFH Options
CLS Group
environment, implementing and maintaining security measures in compliance with relevant policies and regulations. Manage the environment strategies for EPM and ERP environments and administer access to those environments. Incident & ProblemManagement: Lead technical investigations into complex problems, working with other teams to identify root causes and implement permanent solutions. Take the lead during critical incidents and outages, managing More ❯
positive impact. What are we looking for? 7+ years of SAP Support leadership roles with minimum 2 full lifecycle of S/4Hana implementations. Strong knowledge of SAP Transport Management Proven experience in ITIL-based change, incident and ProblemManagement Solid understanding of end to end SAP modules/taxonomy and how they interact Experience with transition … team. Lead a small team and all Service Support activities across the global S/4Hana Landscape and collaborate/work with existing BAU Team. Manage Release and Transport Management Change and Configuration Management Tools and Governance Management Service transition ensuring smooth handover from implementation with a focus on readiness, documentation and training Governance, Reporting and continuous … improvement on system health, change volumes, incident tredns and release performance. Manage cross-team and stakeholder relationships to drive collaboration and meet shared goals Apply problem solving and critical thinking to enable the identification of Technology and Risks associated. Work effectively in diverse team within an inclusive team culture where people are recognised for their contribution. Stay updated on More ❯
and resolve issues with a focus on minimizing business impact. Collaborate with regional teams (London and Aberdeen) to deliver stable and secure platform operations. Engage in Incident, Change, and ProblemManagement using ITIL processes. Implement continuous improvements in real-time support and operational efficiency. Align operations with regulatory frameworks, including EMIR and REMIT. Provide insight and support across More ❯
Production Support Engineer to join our dynamic Front Office FX technology team. This is a high-impact role supporting the bank’s electronic FX trading platforms. Key Responsibilities Incident Management: Respond rapidly to production incidents with data driven decision making, minimizing downtime and financial impact. Lead root cause analysis and conduct blameless post-mortems Monitoring & Automation: Enhance application health … and scalable operations Collaboration: Partner with development teams to design and deploy fault-tolerant, scalable solutions aligned with business goals Governance & Compliance: Enforce and adhere to change, incident, and problemmanagement policies, as well as bank specific non-financial risk frameworks Mentorship: Support the growth of junior team members and promote a culture of engineering excellence and continuous More ❯
City of London, Greater London, UK Hybrid / WFH Options
Cruinn Consulting
cloud services. Manage provisioning, configuration, and lifecycle of cloud resources. Monitoring & Automation Proactively monitor systems and resolve issues. Implement automation tools and scripts for deployment, scaling, and maintenance. Incident & ProblemManagement Investigate and resolve incidents and root causes. Drive improvements to cloud reliability and service quality. Governance & Documentation Maintain up-to-date procedures and compliance with security standards. More ❯
troubleshoot real-time issues in a live airport environment under pressure. Experience working in multi-vendor or airline operational setups involving airport and third-party stakeholders. Knowledgeable of incident management and SLAs, preferably aligned with ITIL standards. Strong documentation and communication skills for logging incidents, updates, and resolutions. Willingness to work in shift patterns, including early mornings, late evenings … airline schedule disruptions and system impacts during IROPs (irregular ops). Experience supporting international airport environments or multi-airline terminals. Ability to perform root cause analysis and contribute to problem management. Basic scripting or automation (e.g. PowerShell, batch scripts) for system checks/log extraction. Awareness of aviation security protocols and operational compliance at airports. Rewards & Benefits: TCS is More ❯
Ensure effective communication with relevant stakeholders within the business and customers as appropriate. Monitor and report on key performance metrics, response times, resolution times, ticket volumes, customer satisfaction Incident & ProblemManagement: To maintain ongoing ownership of all incidents and facilitate the communication of incident updates to customers provided by the support teams. Ensure the prompt resolution of incidents … disruption and downtime within the data centre. Proven experience in managing IT service desk operations, preferably in a data centre or mission-critical environment. Strong knowledge of IT service management frameworks, such as ITIL, and experience implementing ITIL processes. Experience managing a 24x7 service desk or support team in a high-pressure environment. Excellent leadership and team managementMore ❯
help resolve critical incidents and service issues. Lead a small cross-functional product team focused on customer and retailer-facing technologies. Own and evolve the roadmap for CRM, case management, and contact centre tools. Collaborate with stakeholders to gather requirements and maintain a prioritised feature backlog. Work with internal and external tech teams to deliver features into production. Support … incident and problem resolution to ensure system stability and reliability. Monitor key performance metrics and drive improvements based on data insights. Your Profile Proven experience as a Product Owner or in a similar role within product or service management. Background in leading or managing a technical or cross-functional team. Hands-on experience with call centre, telephony, or CRM … OpenText). Familiarity with GDPR and secure handling of personal data (PII). Strong understanding of software development life cycles and agile practices. Experience with ITIL frameworks, incident/problemmanagement, and second-line support. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
development, QA and multiple production trading systems including some belonging to third party clients. Collaborate with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, root cause analysis, and rapid resolution of system outages or performance degradation. Ensure compliance … of procedures such as change management, patch management and security and audit processes. Assist in the maintenance of these procedures. Support regular security audits and penetration tests, addressing findings and oversee any remediation work. Improve system monitoring, alerting, documentation, operating procedures and incident response processes. Manage, mentor, plan and coordinate the activities of both teams. Required Skills/… Experience Ideally 7+ years Linux system administration experience with at least 3 years in a managerial or team lead role. Strong expertise with RHEL-based systems, including installation, ongoing management, monitoring, performance tuning, system security hardening, etc. Proven track record of managing geographically distributed teams, including senior engineers and tier-1/2 support staff including on-call and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
development, QA and multiple production trading systems including some belonging to third party clients. Collaborate with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, root cause analysis, and rapid resolution of system outages or performance degradation. Ensure compliance … of procedures such as change management, patch management and security and audit processes. Assist in the maintenance of these procedures. Support regular security audits and penetration tests, addressing findings and oversee any remediation work. Improve system monitoring, alerting, documentation, operating procedures and incident response processes. Manage, mentor, plan and coordinate the activities of both teams. Required Skills/… Experience Ideally 7+ years Linux system administration experience with at least 3 years in a managerial or team lead role. Strong expertise with RHEL-based systems, including installation, ongoing management, monitoring, performance tuning, system security hardening, etc. Proven track record of managing geographically distributed teams, including senior engineers and tier-1/2 support staff including on-call and More ❯
Management. Maintain accurate and timely updates within incident and change tickets, including clear communication to internal stakeholders and external customers during outages or service degradations. Participate in the change management process, including peer review and execution of MOPs to support routine maintenance and network upgrades. Collaborate closely with Australian-based Engineers during shift handovers, ensuring full context and documentation … including Cisco, Ciena, Nokia, and Fortinet (routers, switches, firewalls) Skilled in network performance troubleshooting and event triage within complex global network environments Familiarity with Data Centre environments including rack management, power, and thermal considerations Understanding of security compliance and physical access control processes within telecommunications facilities Hands-on experience with service management systems (Jira Service Management or … similar) across incident, change, and problemmanagement workflows Working knowledge of ITIL v3 or v4 principles including Service Operations and Service Transition Competent in executing and reviewing Method of Procedures (MOPs) for planned work and changes Experience with monitoring, alerting, and observability platforms such as Observium, MCP by Ciena, and New Relic Clear and calm communicator with the More ❯
stakeholders to identify gaps, define enhancements, and contribute to strategic product selection. Process & Workflow: Promote best practices, highlight impacts, and ensure demand prioritization aligns with business objectives. Incident/ProblemManagement: Manage and resolve incidents, issues, and risks via Jira, identifying root causes and developing creative solutions. Embedding & Adoption: Partner with users to ensure effective embedding and scalable … Assist in defining new business requirements, designing processes, and maintaining strong relationships with investment teams. Required Experience & Skills: Mandatory Aladdin SME expertise. Experience as a change leader within Asset Management or a similar organization. Strong knowledge of end-to-end investment processes and a broad range of investment products. Proven ability to define business requirements, negotiate solutions, and operate More ❯
City of London, London, United Kingdom Hybrid / WFH Options
OneAPPS Consulting
systems performance and reduce cost. 4. Tracking and analyzing performance and resource utilization of CICS systems 5. Recommend changes, upgrades, and enhancements based on the technical analysis. 6. Perform problem diagnostics and resolutions as well. 7. Participate in incident and problemmanagement activities related to applications and CICS. 8. Encourage and ensure collaboration across functional users and … IPCS. Experience in cloning and sunsetting CICS regions. Basic understanding of CICS security with RACF. Exposure to working with SMF 110 records and its practical application in terms of problem determination. Strong problem determination skills in both system and application areas Proficient in working with high availability environment along with alignment to process (ITIL) Experience working with and More ❯