South East London, England, United Kingdom Hybrid / WFH Options
Explore Group
and scale Kubernetes clusters hosting critical microservices Design and enhance observability, alerting, and incident response processes Collaborate closely with engineers to ensure systems are reliable, secure, and performant Lead rootcauseanalysis for production incidents and help prevent recurrence Build tooling to automate repetitive tasks and improve deployment pipelines (CI/CD) Participate in on-call rotation More ❯
prem environments. What You’ll Be Doing: Managing and supporting Solace PubSub+ appliances and software brokers across cloud and on-prem platforms Responding to production incidents and working on rootcauseanalysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message More ❯
prem environments. What You’ll Be Doing: Managing and supporting Solace PubSub+ appliances and software brokers across cloud and on-prem platforms Responding to production incidents and working on rootcauseanalysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message More ❯
on-prem environments. What Youll Be Doing: Managing and supporting Solace PubSub+ appliances and software brokers across cloud and on-prem platforms Responding to production incidents and working on rootcauseanalysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message More ❯
prem environments. What You’ll Be Doing: Managing and supporting Solace PubSub+ appliances and software brokers across cloud and on-prem platforms Responding to production incidents and working on rootcauseanalysis and long-term fixes Monitoring system health and performance with Prometheus, Grafana, and custom dashboards Optimising Solace across WAN environments for secure, low-latency message More ❯
who is passionate about discovering and solving complicated problems, learning complex systems, working with numbers, and organizing and communicating data and reports. The Research Analyst will perform Big data analysis to identify patterns, train model to generate product to product relationship and product to brand & model relationship. The Research Analyst is also expected to continuously improve the ML/… Research Analyst should be able to write clear and detailed functional specifications based on business requirements. Key job responsibilities • Scoping, driving and delivering complex projects across multiple teams. • Performs rootcauseanalysis by understanding the data need, get data/pull the data and analyze it to form the hypothesis and validate it using data. • Build programs … team(s)/partners to meet program and/or product goals. • Coordinates design effort between internal team and External team to develop optimal solutions. • Performs supporting research, conduct analysis of the bigger part of the projects and effectively interpret reports to identify opportunities, optimize processes, and implement changes. • Ability to convince and interact with stakeholders at all level More ❯
fixed income data model to deliver a consistent, cross-platform client experience. Collaborate with Product, Engineering, Ontologists, and Fixed Income SMEs to co-design an interconnected data model supporting analysis across multiple datasets. Translate business and product requirements into clear, maintainable data modelling artifacts. Define and document metadata standards, entity relationships, and model schemas to support semantic alignment and … and maintain metadata inventories. Communicate data modelling requirements to stakeholders, and drive alignment across metadata/modelling functions to ensure practices are well understood & followed. Perform data profiling and rootcauseanalysis to guide objective, data-driven modelling decisions. Promote FAIR data principles across the modelling lifecycle. You'll need to have: Please note we use years More ❯
endpoints are properly configured and updated. 2nd Line Support: Respond to and resolve escalated 2nd line support tickets, ensuring timely resolution of technical issues. Provide expert-level troubleshooting and rootcauseanalysis for more complex issues. Work closely with end-users, understanding their requirements and delivering technical solutions. Escalate issues to senior engineers as needed while keeping More ❯
City of London, Greater London, UK Hybrid / WFH Options
Halian
endpoints are properly configured and updated. 2nd Line Support: Respond to and resolve escalated 2nd line support tickets, ensuring timely resolution of technical issues. Provide expert-level troubleshooting and rootcauseanalysis for more complex issues. Work closely with end-users, understanding their requirements and delivering technical solutions. Escalate issues to senior engineers as needed while keeping More ❯
Central London, London, United Kingdom Hybrid / WFH Options
Halian Technology Limited
endpoints are properly configured and updated. 2nd Line Support: Respond to and resolve escalated 2nd line support tickets, ensuring timely resolution of technical issues. Provide expert-level troubleshooting and rootcauseanalysis for more complex issues. Work closely with end-users, understanding their requirements and delivering technical solutions. Escalate issues to senior engineers as needed while keeping More ❯
operational performance, and security compliance. Facilitate effective communication between IT teams and business units. Problem Solving and Incident Management: Manage and resolve high-priority incidents and critical issues. Conduct rootcauseanalysis and implement corrective actions to prevent recurrence. Develop and maintain incident response plans and procedures. Requirements: Proven experience as a Digital Operations Manager, IT Manager More ❯
operational performance, and security compliance. Facilitate effective communication between IT teams and business units. Problem Solving and Incident Management: Manage and resolve high-priority incidents and critical issues. Conduct rootcauseanalysis and implement corrective actions to prevent recurrence. Develop and maintain incident response plans and procedures. Requirements: Proven experience as a Digital Operations Manager, IT Manager More ❯
Storage (Block, Object, SQL, NOSQL) • Authentication, Authorisation, Identity Platforms • Information Security, Privacy and Regulatory Compliance • Performance Tuning, Hardening and Troubleshooting • Problem Solving Skills to Methodically Find Faults and perform RootCauseAnalysis • Able to evaluate multiple courses of action, achieving goals by non-standard means if necessary • System Regression • Protocol Analysis • Load Testing • Availability and Resilience More ❯
impacting issues. Note: Our teams work three days a week in our London office. Responsibilities: Act as the escalation and investigation lead for complex or persistent product issues Perform rootcauseanalysis and identify trends across customer contacts Raise and prioritize product issues using data-backed insights Support incident management efforts with timely triage, coordination, and communication More ❯
team. Contribute towards the maintenance and upkeep of the ED&F Man knowledge base for documented process, routine tasks and incident resolutions. Support the problem management function in the rootcauseanalysis of incidents and problems. Skills and Experience Experience working in an IT Service Desk or similar support function. Excellent communication skills at both a non More ❯
Job Responsibilities The position will work closely with Central ACES teams to develop platforms and tools and analyze data to support PD initiatives, dig deep into data to determine rootcause of defects, build and maintain decision support processes to track defects and correct them in real time. Monitoring health of DWP jobs, infrastructure and platform - maintaining … availability and scaling. Performing rootcauseanalysis and resolving Trouble Tickets. Enhancements to the existing solutions independently (with minimal guidance from BIE team). Gradual ownership of medium complexity solutions from the roadmap initiatives. Providing data insights to central ACES team to drive business critical actions. BASIC QUALIFICATIONS 1+ years of data analytics or automation experience. 1+ … years of capacity planning, operations planning, business analysis or similar experience. Bachelor's degree. Knowledge of data pipelining and extraction using SQL. Knowledge of SQL and Excel at a moderate or advanced level. Experience with data mining tools like SQL, SAS, SPSS, or similar. PREFERRED QUALIFICATIONS Knowledge of SQL/Python/R, scripting, MS Excel, table joins, and More ❯
maintain project plans, schedules, and budgets. Manage & control the project costs & financial performance, including approval of Timesheet. Facilitate stakeholder meetings to align project goals and address concerns proactively. Conduct rootcauseanalysis for issues and propose corrective actions. Oversee project scope, risks, and changes, ensuring alignment with project objectives. Prepare and present status reports to clients and … feedback constructively. Required Skills Strong leadership and problem-solving abilities. Excellent communication and interpersonal skills. Proficiency in project management tools and techniques. Ability to work independently with some oversight. Rootcauseanalysis and continuous improvement mindset. Preferred Skills A recognised project management certification, such as CAPM, Prince2, or APM, is preferred. Demonstrable understanding of both Agile and More ❯
maintain project plans, schedules, and budgets. Manage & control the project costs & financial performance, including approval of Timesheet. Facilitate stakeholder meetings to align project goals and address concerns proactively. Conduct rootcauseanalysis for issues and propose corrective actions. Oversee project scope, risks, and changes, ensuring alignment with project objectives. Prepare and present status reports to clients and … feedback constructively. Required Skills Strong leadership and problem-solving abilities. Excellent communication and interpersonal skills. Proficiency in project management tools and techniques. Ability to work independently with some oversight. Rootcauseanalysis and continuous improvement mindset. Preferred Skills A recognised project management certification, such as CAPM, Prince2, or APM, is preferred. Demonstrable understanding of both Agile and More ❯
improvement opportunities around preventative maintenance, system optimizations, reliability metrics, and overall equipment effectiveness. Utilizing this data, you will engage customers to understand and document business requirements, drive problems to rootcause, and manage implementation programs for corrective actions. You will apply your expertise in robotics, mechatronics, reliability engineering and system lifecycle management to build scalable solutions that ensure … optimal performance and availability of thousands of workcells across Amazon's global fulfillment center network. Key job responsibilities Primary responsibilities: - Utilize data analytics, rootcauseanalysis, design of experiments and six sigma methods to improve reliability and availability of critical automation equipment and robotics & mechatronics systems. - - Develop and deliver impactful data-driven analyses, based on equipment failure … and leaders within RME to deliver structured problem solving. - Design, pilot and implement new preventative maintenance standards, interval adjustments, part replacement cycles, and work instruction streamlining based on granular analysis of reliability (MTBF, MTTR, OEE, etc.) trends across assets. - Develop automated reporting through advanced SQL and Quicksight to provide RME leadership with actionable insights, trends and analysis around More ❯
ad-hoc and strategic data analyses Build various data visualizations to tell the story and let the data speak for itself. Recognize and adopt best practices in reporting and analysis: data integrity, test design, analysis, validation, and documentation. Build automation to reduce dependencies on manual data pulls etc. A day in the life A day in the life … of Business Intelligence Engineer will include working closely with Product Managers and Software Developers. Working on building dashboards, performing rootcauseanalysis and sharing actionable insights with the stakeholders to enable data-informed decision making. BASIC QUALIFICATIONS - 2+ years of analyzing and interpreting data with Redshift, Oracle, NoSQL etc. experience - Experience with one or more industry analytics … data visualization using Tableau, Quicksight, or similar tools - Experience with scripting language (e.g., Python, Java, or R) PREFERRED QUALIFICATIONS - Bachelor's degree, or Advanced technical degree - Experience with statistical analysis, co-relation analysis - Knowledge of data modeling and data pipeline design Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a More ❯
to support the growth of our FC operations. Responsibilities The role involves collaborating with Central ACES teams to develop platforms and tools, analyzing data to support PD initiatives, identifying root causes of defects, and maintaining decision support processes for real-time defect correction. Monitoring health of DWP jobs, infrastructure, and platform to ensure 100% availability and scalability. Conducting rootcauseanalysis and resolving Trouble Tickets. Enhancing existing solutions independently with minimal guidance. Owning medium-complexity solutions from the roadmap initiatives gradually. Providing data insights to the central ACES team to drive critical business actions. Key Job Responsibilities Fulfilling data requirements of the central ACES team and utilizing analytical tools to solve business problems. Creating data models More ❯
a hands-on leadership role - you won’t just guide others, you’ll be the go-to expert when systems are under pressure. You'll lead incident response, own rootcauseanalysis, and solve performance issues like memory leaks, outages, and flaky services. Your focus will include : Leading incident management, post-mortems, and blameless RCAs Building scalable More ❯
a hands-on leadership role - you won’t just guide others, you’ll be the go-to expert when systems are under pressure. You'll lead incident response, own rootcauseanalysis, and solve performance issues like memory leaks, outages, and flaky services. Your focus will include : Leading incident management, post-mortems, and blameless RCAs Building scalable More ❯
a hands-on leadership role - you won’t just guide others, you’ll be the go-to expert when systems are under pressure. You'll lead incident response, own rootcauseanalysis, and solve performance issues like memory leaks, outages, and flaky services. Your focus will include : Leading incident management, post-mortems, and blameless RCAs Building scalable More ❯
queries from end users with respect to the systems supported, including but not limited to incorrect data issues, ensuring feeds into the system in a timely manner Ensuring that rootcauseanalysis has been found and solutions implemented to prevent repeat occurrence Change management: plan, test and release new versions of the system Write documentation on system More ❯