South West London, London, England, United Kingdom
Michael Page Technology
an escalation point for complex issues that junior technicians are unable to resolve. Incident Management: Take the lead on managing critical incidents, ensuring timely resolution and communication with stakeholders. RootCauseAnalysis: Perform rootcauseanalysis for recurring issues and recommend long-term solutions. Process Improvement: Identify areas for process improvement within the service More ❯
South West London, London, England, United Kingdom
Michael Page Technology
an escalation point for complex issues that junior technicians are unable to resolve. Incident Management: Take the lead on managing critical incidents, ensuring timely resolution and communication with stakeholders. RootCauseAnalysis: Perform rootcauseanalysis for recurring issues and recommend long-term solutions. Process Improvement: Identify areas for process improvement within the service More ❯
own and oversee the resolution of service-related issues from start to finish. This role offers an exciting chance to work in a dynamic environment with complex systems, making rootcause identification challenging. You will collaborate closely with your team and other Service Operations teams, including ITIL functions, Technology Office teams, our Service Providers, and our customers becoming … outcomes to the problems in hand. Lead problem resolution efforts to ensure timely and effective outcomes. Report and escalate when necessary, ensuring full transparency throughout Be accountable for the RootCauseAnalysis lifecycle and ensure accuracy and quality is obtained collaboratively with our 3rd party suppliers. Assess and manage risks associated with services and recurring problems. Work … stakeholders to improve existing processes and performance and feeding back successes and any concerns in implementation of these improvements. Conduct Trend Analyses to identify and eliminate common factors that cause incidents. Then escalate and communicate onwards as necessary with suggested solutions and mitigation. Ensure awareness of workarounds are communicated to wider business areas. For example, and not limited to More ❯
Systems) job: Maintain and improve the Business Management System in line with ISO9001 and TL9000 standards. Lead internal and external audit programmes, ensuring timely closure of findings and robust rootcause analysis. Conduct risk and opportunity assessments across all business activities, supporting mitigation planning. Define, write, and maintain quality procedures and process documentation with input from stakeholders. Coordinate … years of experience in a similar role, preferably in a manufacturing or engineering environment. Certified ISO9001 Lead Auditor. Lean Six Sigma Black Belt or equivalent. Strong background in rootcauseanalysis, internal audits, and quality system improvement. Experience working with multidisciplinary teams and communicating with internal and external stakeholders. Proficient in MS Office, SAP, JIRA, and other More ❯
site specific improvements, region wide improvement programmes, and oversight and reporting throughout. The Engineer plays a key role during incident and follow-up, including taking part in incident calls, rootcauseanalysis exercises, and supporting sites throughout resolution of problem tasks. The Engineer may play a consultative role in the review of high-risk changes and incident … discipline of Electrical engineering to the level of subject matter expert. Working Knowledge of Plan Do Check Act (Deming cycle) highly desirable. Experience of taking part in and conducting rootcause analyses, including awareness of the relevant frameworks. Experience working with computerised maintenance management systems. Experience required in management of external resources. Can prepare & present relevant information to More ❯
site specific improvements, region wide improvement programmes, and oversight and reporting throughout. The Engineer plays a key role during incident and follow-up, including taking part in incident calls, rootcauseanalysis exercises, and supporting sites throughout resolution of problem tasks. The Engineer may play a consultative role in the review of high-risk changes and incident … discipline of Electrical engineering to the level of subject matter expert. Working Knowledge of Plan Do Check Act (Deming cycle) highly desirable. Experience of taking part in and conducting rootcause analyses, including awareness of the relevant frameworks. Experience working with computerised maintenance management systems. Experience required in management of external resources. Can prepare & present relevant information to More ❯
expert teams to support application-level integration, reporting, and analytics. Cross-Functional Collaboration : Work with enterprise teams on UX design, security best practices, cloud strategies, and platform engineering. Technical Analysis : Lead technical analysis and estimation efforts for custom-built applications. Best Practices : Drive the adoption of release management and automation best practices. Incident Management : Ensure thorough rootcauseanalysis and prompt remediation during any incidents or outages. YOU'RE GOOD AT You bring solid development and program leadership experience to drive technical governance, innovation, integrations, and cloud strategies using emerging technologies like Gen AI. You thrive in environments that demand independent problem-solving, analytical thinking, and clear communication. In this role, you will: Demonstrate More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Michael Page Technology
application, API, and infrastructure issues across multiple environments (mainly on Azure). Collaborate with development, DevOps, and product teams to resolve complex technical issues. Manage incident response and provide rootcauseanalysis (RCA) for platform outages. Automate repetitive support tasks using scripting (Python, Bash, PowerShell). Maintain documentation of processes, troubleshooting steps, and known issues. Ensure adherence More ❯
premises, hybrid, and cloud environments (e.g., AWS, Azure, GCP) for M&A activities. Participate and lead from a Nuveen International perspective, troubleshooting efforts for complex infrastructure issues and ensure rootcauseanalysis and documentation working with TIAA infrastructure teams Develop and maintain architecture diagrams, runbooks, standards, and documentation for infrastructure systems. Participate in and lead infrastructure assessments … or CompTIA Security+. Experience in regulated industries (e.g., healthcare, finance, government) Familiarity with containerization and orchestration e.g., Docker, Kubernetes) Related Skills Accountability, Adaptability, Business Acumen, Cloud Platforms, IT Business Analysis, IT Disaster Recovery, IT Infrastructure, Network Infrastructure, Risk Mitigation Company Overview Nuveen is a global investment leader, managing public and private assets for clients around the world and on More ❯
our personalized learning opportunities - just to name a few! Job Description Your Career You will work firsthand with our valued customers to address their complex post–sales concerns where analysis of situations or data requires an in–depth evaluation of many factors. You're a critical thinker in understanding the methods, techniques, and evaluation criteria for obtaining results. You … issues via ticketing systems, phone, and remote sessions Troubleshoot complex problems at both the application and operating system levels using deep technical knowledge and collaboration with internal teams Identify root causes (code, configuration, or environment), and work with engineering and product teams to deliver permanent solutions Share insights from customer interactions to improve our product and support experience Document … troubleshooting steps and resolutions clearly for both internal and customer use Lead rootcauseanalysis and coordinate corrective actions to prevent recurrence Qualifications Your Experience Mandatory Requirements 🔒 Due to the nature of this role and the customers we support, candidates must either: Have lived in the UK for the last 5 consecutive years, or Hold British Citizenship More ❯
Collaborate with project teams to support cloud migrations and service transitions. Monitor and maintain customer Azure environments using tools such as Azure Monitor, Log Analytics, and Sentinel. Contribute to rootcauseanalysis and implement service improvements. Assist in developing and enforcing operational best practices and documentation. Participate in technical reviews, knowledge sharing, and internal training sessions. Support More ❯
hands-on deep dive when required. Leading the teams during Major Incidents and provide recommendations on fastest path to the major incident recovery or supporting technical delivery teams with rootcauseanalysis for Major Incidents Experience of working on both SAFE/AGILE project delivery Your Profile Essential skills/knowledge/experience: MS Azure solution architect More ❯
hands-on deep dive when required. Leading the teams during Major Incidents and provide recommendations on fastest path to the major incident recovery or supporting technical delivery teams with rootcauseanalysis for Major Incidents Experience of working on both SAFE/AGILE project delivery Your Profile Essential skills/knowledge/experience: MS Azure solution architect More ❯
hands-on deep dive when required. Leading the teams during Major Incidents and provide recommendations on fastest path to the major incident recovery or supporting technical delivery teams with rootcauseanalysis for Major Incidents Experience of working on both SAFE/AGILE project delivery Your Profile Essential skills/knowledge/experience: MS Azure solution architect More ❯
london (city of london), south east england, united kingdom
Tata Consultancy Services
hands-on deep dive when required. Leading the teams during Major Incidents and provide recommendations on fastest path to the major incident recovery or supporting technical delivery teams with rootcauseanalysis for Major Incidents Experience of working on both SAFE/AGILE project delivery Your Profile Essential skills/knowledge/experience: MS Azure solution architect More ❯
ensure successful automation as part of our standard build process. Occasional manual testing when automation is not an option. Report, reproduce, and help development resolve defects, emphasis on troubleshooting, rootcauseanalysis, and prevention of similar issues in the future. Freely debate ideas and rally behind decisions. Pushing for continual improvement in everything we do. Apply technology … databases, including writing queries for validation and verifying data integrity. Experience testing applications running in Kubernetes environments. Familiarity with using monitoring and observability tools like Grafana to support test analysis and validation. Experience troubleshooting and supporting customers with product features, including investigating issues and providing technical guidance. Bias for action and problem solving - eagerness to take initiative and make More ❯
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including rootcauseanalysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including rootcauseanalysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including rootcauseanalysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
london (city of london), south east england, united kingdom
BGC Group
and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging-related incidents, including rootcauseanalysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low More ❯
City of London, London, United Kingdom Hybrid / WFH Options
The Curve Group
supporting critical business applications across a modern and complex technology stack. In this role, you'll be responsible for: Investigating and resolving technical incidents, ensuring minimal downtime and effective rootcauseanalysis Proactively maintaining and optimising applications, performing upgrades and configuration changes Monitoring system performance, defining service-level objectives, and addressing bottlenecks before they impact users Collaborating More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Computappoint
fast query performance? Secure by Design : Implement bulletproof security frameworks and ensure rock-solid compliance across all data touchpoints? Problem Solver : Dive deep into complex technical challenges, providing expert rootcauseanalysis and innovative solutions? Knowledge Sharer : Develop comprehensive documentation and mentor teams on advanced data engineering best practices What Makes You Perfect for This Role Essential More ❯
teams to uplift knowledge and performance standards. Partner with Senior Data Centre Managers to proactively manage and resolve incidents, ensuring lessons learned are captured and acted upon. Lead detailed rootcauseanalysis of incidents and oversee the implementation of preventative measures. Coordinate service planning and maintenance activities, ensuring minimal customer impact and alignment with business priorities. Collaborate More ❯
teams to uplift knowledge and performance standards. Partner with Senior Data Centre Managers to proactively manage and resolve incidents, ensuring lessons learned are captured and acted upon. Lead detailed rootcauseanalysis of incidents and oversee the implementation of preventative measures. Coordinate service planning and maintenance activities, ensuring minimal customer impact and alignment with business priorities. Collaborate More ❯
teams to uplift knowledge and performance standards. Partner with Senior Data Centre Managers to proactively manage and resolve incidents, ensuring lessons learned are captured and acted upon. Lead detailed rootcauseanalysis of incidents and oversee the implementation of preventative measures. Coordinate service planning and maintenance activities, ensuring minimal customer impact and alignment with business priorities. Collaborate More ❯