Dashboards. Contribute to process and technical capabilities (e.g., Data Modeling, Data Visualizations, Artificial Intelligence (AI), Machine Learning (ML to enhance identification of service improvement opportunities. Complex data mining, trend analysis, metric and report production will be required. Identify and review service improvement opportunities with stakeholders based on TR enterprise-wide performance metrics. Proactive collaboration with stakeholders to create and … improvement initiatives. Be responsive to internal stakeholder needs and engage with stakeholders across multiple functions. Typical daily work may include but is not limited to complex data mining, trend analysis, metric and report production, process flow charting, and iterative service improvement activities (e.g. daily standups, data quality checks, change reviews, tool enhancement design and review). Contribute to proactive … enhance service reliability and availability. Support for Service Management activities to ensure a consistent standard of incident, problem, change and other practice areas for enhanced accuracy of data quality, rootcauseanalysis and identification of preventative measures. Support the recurring service performance reporting cycle (e.g., weekly, monthly, quarterly). About You: Experience in enterprise problem management, application More ❯
base articles. Monitor application health using tools and custom dashboards. Support integration and communication between cloud platforms (Azure, Entra ID, Microsoft 365). Contribute to service improvement initiatives, including rootcauseanalysis and automation opportunities. Participate in on-call rotations or after-hours incidents during peak retail periods. Work within established security frameworks and governance. Hybrid working More ❯
hands-on role supporting high-availability systems, rapid deployments, and production incident response. Key Responsibilities - Manage and monitor AWS infrastructure for performance and security - Respond to production incidents, perform rootcauseanalysis, and implement fixes - Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries - Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes … Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting (Python, Go, Bash, SQL) - Skilled in GitHub, CI/CD, and Kubernetes operations Desirable: - Experience with Terraform or CloudFormation - Advanced log analysis with Splunk - Strong problem-solving and analytical thinking More ❯
configuring, updating, and monitoring security tools and software, such as antivirus, encryption, authentication, SIEM etc. Evaluate, research and manage emerging cyber security threats. Support the incident management process, through RootCause Analysis. Responding to and resolving security incidents and events, such as malware infections, phishing attempts, denial-of-service attacks, data breaches, etc. Liaise with stakeholders in relation … Exposure to security monitoring technologies Understanding of Incident Response, Cyber Kill Chain, ATT&CK · Knowledge & experience of common program language e.g., Python, C++, PowerShell, JavaScript Being able to perform RootCauseAnalysis Experience with vulnerability assessments Ability to discover, design and document security implementations. Strong networking skills. Good understanding of securing Cloud technologies through native and multi More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Infinigate UK & Ireland
configuring, updating, and monitoring security tools and software, such as antivirus, encryption, authentication, SIEM etc. Evaluate, research and manage emerging cyber security threats. Support the incident management process, through RootCause Analysis. Responding to and resolving security incidents and events, such as malware infections, phishing attempts, denial-of-service attacks, data breaches, etc. Liaise with stakeholders in relation … Exposure to security monitoring technologies Understanding of Incident Response, Cyber Kill Chain, ATT&CK · Knowledge & experience of common program language e.g., Python, C++, PowerShell, JavaScript Being able to perform RootCauseAnalysis Experience with vulnerability assessments Ability to discover, design and document security implementations. Strong networking skills. Good understanding of securing Cloud technologies through native and multi More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Infinigate UK & Ireland
configuring, updating, and monitoring security tools and software, such as antivirus, encryption, authentication, SIEM etc. Evaluate, research and manage emerging cyber security threats. Support the incident management process, through RootCause Analysis. Responding to and resolving security incidents and events, such as malware infections, phishing attempts, denial-of-service attacks, data breaches, etc. Liaise with stakeholders in relation … Exposure to security monitoring technologies Understanding of Incident Response, Cyber Kill Chain, ATT&CK · Knowledge & experience of common program language e.g., Python, C++, PowerShell, JavaScript Being able to perform RootCauseAnalysis Experience with vulnerability assessments Ability to discover, design and document security implementations. Strong networking skills. Good understanding of securing Cloud technologies through native and multi More ❯
checks to identify process defects Reporting Support the creation of routine reporting packs and dashboards for internal stakeholders, utilising and defining performance metrics - Service Level Agreements (SLAs) etc Conduct Analysis utilising tools such as Excel or PowerBI, to identify trends and opportunities for both system optimisation and improvement in operational performance Continuous Improvement - Operations process optimisation Proactively identify opportunities … generating and maintaining a knowledgeable Problem Solving Critically assess and collaboratively work alongside the function's operations team, managed service vendors and enterprise IT team to identify/support rootcauseanalysis and remediation of issues, incidents and escalation. Bridge the gap by translating business requirements to the Tech team and vice versa Vendor Management Maintain a More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, rootcauseanalysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … understanding of cybersecurity principles and experience implementing security measures in a regulated environment. Ability to coach, mentor, and upskill staff; develop career paths and ensure team resilience. Experience undertaking rootcauseanalysis including prevention orientated solution reporting. Working experience with deployment tools (e.g. GitLab pipelines) and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such More ❯
City of London, London, United Kingdom Hybrid / WFH Options
REC SOLUTIONS LIMITED
with development, networks, ops and product teams on strategic IT initiatives. Assist with planning, management and resource allocation of inter-departmental projects alongside the PM team. Oversee incident management, rootcauseanalysis, and rapid resolution of system outages or performance degradation. Ensure compliance of procedures such as change management, patch management and security and audit processes. Assist … understanding of cybersecurity principles and experience implementing security measures in a regulated environment. Ability to coach, mentor, and upskill staff; develop career paths and ensure team resilience. Experience undertaking rootcauseanalysis including prevention orientated solution reporting. Working experience with deployment tools (e.g. GitLab pipelines) and rollback strategies. Proficiency in managing bare-metal servers, virtualization platforms such More ❯
and performance management. You'll develop and manage insightful dashboards and reports, monitor key performance indicators, and present actionable insights to stakeholders. Your role will also involve predictive analytics, root-causeanalysis, risk management collaboration, and ensuring compliance with industry standards. Proficiency in tools like Power BI, SQL, and Python, along with familiarity with PRISM or Primavera … risk performance. Monitor key project performance indicators and identify trends, deviations, and improvement opportunities. Present clear and actionable insights to stakeholders to enable effective project control decisions. Data Modelling & Analysis: Develop predictive analytics models to assess potential project outcomes based on current data and trends. Conduct root-cause analyses of project variances and propose mitigation strategies to … reporting. Project Controls Knowledge: Familiarity with project controls disciplines such as cost control, scheduling, and risk management. Understanding of Earned Value Management (EVM) and techniques like cost forecasting, variance analysis, and benchmarking. About You Essential Bachelor's degree in data science, Engineering, Project Controls, Finance, or a related discipline. Professional certifications (e.g., AACE, PMI-PMP) are a plus. At More ❯
Oversee technology issues management and risk acceptance processes. Lead on the 2LoD review of material Technology Incidents and Risk Events ensuring that actual/potential losses, fix details and rootcauseanalysis is reporting in a timely and accurate manner within risk governance. Strategic challenge of 1LoD identification and evaluation of risks associated with technology regulatory change … of mitigation strategies. Escalate material technology risks and issues within the Chief Risk Office and to wider risk governance and recommend appropriate mitigation. Provide insightful data driven technology risk analysis support risk-based decision-making. Report emerging technology risks within risk governance as part of integrated risk reporting. Provide subject matter expertise on emerging technology risks, including cloud security … as ITIL, COBIT, NIST, ISO. Demonstrable extensive relevant experience of technology and change/operational risk in either a 1LoD or 2LoD capacity (2LoD preferable). Experience in scenario analysis and resilience impact assessments would be advantageous. Core skills and competencies A strong working knowledge of Microsoft products including Excel and Word, strong analytical skills and ability to provide More ❯
and production process; develop documentation (Knowledge Acquisitions Process document) for all deliverables before moving to production to support ongoing maintenance. Create datasets that combine multiple data sources for further analysis by others. Consult with IT to identify data sources and/or changes to underlying reporting systems and application enhancements to support reporting requirements. Conduct rootcauseanalysis of production defects under general supervision. Participate in the development of project plans and carry out assigned tasks. Understand overall data models, data relationships, mapping lineage, business and data transformation rules, and participate in creating data models (e.g., entity design and population). Expand knowledge of Business Intelligence products and tools. Perform other duties as assigned. What … QlikView. Understanding of insurance products/concepts. Knowledge of agile methodologies and Rally application. Experience with Entity Relationship Diagrams, Data flow diagrams, and Dimensional modelling. Knowledge of Exploratory data analysis techniques. Experience with Quality Center. Understanding of basic principles and terminology to solve simple problems. Intermediate analytic, diagnostic, interpersonal, and communication skills. Ability to work independently and in a More ❯
storage, backups, and Linux systems using tools such as Ansible, Terraform, and GitHub. Collaborate with cross-functional teams to align infrastructure delivery with DevOps best practices. Lead incident response, rootcauseanalysis, and ongoing support for critical infrastructure services. Define and implement infrastructure administration standards and procedures. Champion Infrastructure as Code and continuous improvement across the hosting More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Tate Recruitment
storage, backups, and Linux systems using tools such as Ansible, Terraform, and GitHub. Collaborate with cross-functional teams to align infrastructure delivery with DevOps best practices. Lead incident response, rootcauseanalysis, and ongoing support for critical infrastructure services. Define and implement infrastructure administration standards and procedures. Champion Infrastructure as Code and continuous improvement across the hosting More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Tate Recruitment
storage, backups, and Linux systems using tools such as Ansible, Terraform, and GitHub. Collaborate with cross-functional teams to align infrastructure delivery with DevOps best practices. Lead incident response, rootcauseanalysis, and ongoing support for critical infrastructure services. Define and implement infrastructure administration standards and procedures. Champion Infrastructure as Code and continuous improvement across the hosting More ❯
storage, backups, and Linux systems using tools such as Ansible, Terraform, and GitHub. Collaborate with cross-functional teams to align infrastructure delivery with DevOps best practices. Lead incident response, rootcauseanalysis, and ongoing support for critical infrastructure services. Define and implement infrastructure administration standards and procedures. Champion Infrastructure as Code and continuous improvement across the hosting More ❯
Be proficient in Linux server and system administration (e.g., package management, kernel updates, filesystems, volume management) Have experience managing containerized workloads using Docker or Kubernetes Be an expert in RootCauseAnalysis Have a strong desire to learn new skills and technologies, with proven research capabilities and adaptability Possess at least two years of experience training and More ❯
and follow-up actions. Work closely with cross-functional teams on data-related projects and continuous improvement initiatives. Identify and investigate data quality issues, contributing to the development of rootcause analyses and solutions. Stay up-to-date with evolving data technologies, tools, and industry trends. Support the definition of data quality methodologies and standards across the business. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
dnevo Partners
and follow-up actions. Work closely with cross-functional teams on data-related projects and continuous improvement initiatives. Identify and investigate data quality issues, contributing to the development of rootcause analyses and solutions. Stay up-to-date with evolving data technologies, tools, and industry trends. Support the definition of data quality methodologies and standards across the business. More ❯
Holiday Days + your local bank holidays 1 Birthday day - it only happens once a year! 3 So Giving Days - spend these days giving back to your chosen cause Religious Celebrations Leave Mental Healthcare - Sessions withUnmind Enhanced Family Leave Values-driven culture - we're really proud of our culture. So Energy Who we are So Energy was created in … monthly/quarterly reports for senior leadership, including trend analyses, month-over-month comparisons, and variance explanations. • Support strategic initiatives and Lead or participate in special projects- providing data analysis, model validation, and performance tracking. • Conduct "deep dives" into specific issues and coordinate cross-departmental follow-up actions. KPI Monitoring & Analysis • Help define, monitor, and report on key … performance indicators (KPIs) related to payments • Perform root-causeanalysis for KPI deviations, pinpointing underlying issues such as bottlenecks in the collections workflow • Recommend corrective actions to improve KPIs, working closely with Collections Operations and Finance teams. Trend Identification & Forecasting • Analyse historical payment and collections trends to forecast cash flow, projected delinquency levels, and potential bad-debt More ❯
and scale Kubernetes clusters hosting critical microservices Design and enhance observability, alerting, and incident response processes Collaborate closely with engineers to ensure systems are reliable, secure, and performant Lead rootcauseanalysis for production incidents and help prevent recurrence Build tooling to automate repetitive tasks and improve deployment pipelines (CI/CD) Participate in on-call rotation More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Explore Group
and scale Kubernetes clusters hosting critical microservices Design and enhance observability, alerting, and incident response processes Collaborate closely with engineers to ensure systems are reliable, secure, and performant Lead rootcauseanalysis for production incidents and help prevent recurrence Build tooling to automate repetitive tasks and improve deployment pipelines (CI/CD) Participate in on-call rotation More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Explore Group
and scale Kubernetes clusters hosting critical microservices Design and enhance observability, alerting, and incident response processes Collaborate closely with engineers to ensure systems are reliable, secure, and performant Lead rootcauseanalysis for production incidents and help prevent recurrence Build tooling to automate repetitive tasks and improve deployment pipelines (CI/CD) Participate in on-call rotation More ❯
London, England, United Kingdom Hybrid / WFH Options
Focus on SAP
troubleshoot issues across UI, SCADA, data modeling, and system integration layers. Collaborate with customers to resolve day-to-day operational challenges, adhering to change management and compliance practices. Lead rootcauseanalysis efforts and proactively implement preventative solutions. Contribute to technical documentation, knowledge base articles, and case records to support internal teams and clients. Guide and mentor More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Focus on SAP
troubleshoot issues across UI, SCADA, data modeling, and system integration layers. Collaborate with customers to resolve day-to-day operational challenges, adhering to change management and compliance practices. Lead rootcauseanalysis efforts and proactively implement preventative solutions. Contribute to technical documentation, knowledge base articles, and case records to support internal teams and clients. Guide and mentor More ❯