visualization, logging, and tracing Understanding of automation, SDLC, CI/CD pipelines, deployment automation, and blue/green or canary releases Proven experience with incident management, disaster recovery planning, rootcauseanalysis, and post-incident reviews Hybrid working - 1+ days a week in the London office Wellbeing initiatives including Sanctus Coaching, Virtual fitness sessions, Wellbeing webinars, and More ❯
vulnerabilities are exploited or identified in real-time. Work with relevant teams to contain and mitigate security breaches, ensuring minimal impact on the business. Develop post-incident reports, including rootcauseanalysis and remediation strategies. Security Strategy & Improvement: Stay up-to-date on the latest security trends, tools, techniques, and frameworks. Continuously evaluate and improve the organisation More ❯
queries. Take ownership of issues from start to finish. Proactively drive issue investigation to resolution, involving other parties and escalating where needed. Liaise with clients to interpret queries for rootcauseanalysis and propose appropriate workarounds and solutions. Qualifications Excellent analytical and problem-solving skills Programming experience (C++ and Linux/Unix preferred) Ability to communicate confidently More ❯
queries. • Take ownership of issues from start to finish. • Proactively drive issue investigation to resolution, involving other parties and escalating where needed. • Liaise with clients to interpret queries for rootcauseanalysis and propose appropriate workarounds and solutions. Essential 2:1 Bachelor's degree (or higher) in a numerate subject (e.g. Computer Science, Engineering, Physics, Mathematics, etc. More ❯
architecture will help you. What you'll be doing Diagnose and resolve complex application and infrastructure issues Participate in our 24x7 on-call rotation, SCRUM, and deployment planning Perform RootCauseAnalysis (RCA) and provide recommendations for application teams Improve availability and reduce customer impact using Industry best observability tools Ensure best-practice and security-minded architecture More ❯
architecture will help you. What you'll be doing Diagnose and resolve complex application and infrastructure issues Participate in our 24x7 on-call rotation, SCRUM, and deployment planning Perform RootCauseAnalysis (RCA) and provide recommendations for application teams Improve availability and reduce customer impact using Industry best observability tools Ensure best-practice and security-minded architecture More ❯
quality and accurate reports for a wide range of stakeholders. Collaborate with Security Engineers and cross-functional teams to investigate and remediate large scale security incidents. Support security incident rootcauseanalysis, identify control gaps, and recommend mitigation strategies. Collaborate with cross-functional teams to drive improvements to security tools, policies and processes. Improve the effectiveness and More ❯
primarily focused on Search systems, with contributions to Recommendations and SLU, applying Deep Learning, GenAI, and other ML techniques to improve Fire TV user experience - Perform hands-on data analysis and modeling with large data sets to develop insights that increase device usage and customer experience - Design and run A/B experiments, evaluate the impact of your optimizations … stakeholders - Work closely with product managers and software engineers to design experiments and implement end-to-end solutions - Setup and monitor alarms to detect anomalous data patterns and perform rootcause analyses to explain and address them - Be a member of the Amazon-wide Machine Learning Community, participating in internal and external MeetUps, Hackathons and Conferences - Help attract More ❯
and resolution of incidents while participating in an on-call rotation to maintain readiness Develop and automate threat detection and response capabilities, creating and refining alerting use cases Conduct rootcauseanalysis, produce detailed incident reports, and implement mitigation strategies to prevent future incidents Design and implement automation initiatives to enhance the efficiency of threat detection, response More ❯
independently managing technical projects with minimal supervision. Excellent executive-level communication skills (written and verbal) for translating technical concepts into business outcomes. Strong troubleshooting and problem-solving abilities, including rootcauseanalysis and resolution of complex technical issues. Ability to build and maintain relationships with both technical and non-technical stakeholders. Experience in working cross-functionally with More ❯
improvement and development of support processes. Day-to-day Diagnose and resolve high-priority, complex technical issues reported by customers, ensuring timely resolution and high customer satisfaction Conduct thorough rootcauseanalysis of recurring issues to identify and implement preventive measures Ensure high ticket productivity while maintaining a minimal backlog Manage and prioritize incidents and service requests More ❯
visibility into data quality metrics and trends. Collaborating closely with data engineering teams, you will design and implement automated data quality checks and monitoring processes. Additionally, you will perform rootcauseanalysis for data quality issues, identifying underlying causes and proposing effective solutions. You will also assist product teams in developing and executing test plans, cases, and More ❯
and result logging. Understanding of testing life cycle. Collaborate with development, business, and QA teams to understand requirements and ensure test coverage. Understanding of defect life cycle. Experience of rootcause analysis. Good stakeholder management skills Familiarity with Agile and DevOps methodologies. Strong Interpersonal Skills - Mandatory More ❯
automation strategy across global processes. Delivering insights by analysing large, complex datasets to evaluate control design, identify risks, and recommend improvements. Applying automation and analytics to test controls, perform rootcause analyses, and support SOx assurance activities. Collaborating with cross-functional stakeholders—including Data & Analytics, IT, Finance, and Audit—to drive a culture of risk awareness and control More ❯
automation strategy across global processes. Delivering insights by analysing large, complex datasets to evaluate control design, identify risks, and recommend improvements. Applying automation and analytics to test controls, perform rootcause analyses, and support SOx assurance activities. Collaborating with cross-functional stakeholders—including Data & Analytics, IT, Finance, and Audit—to drive a culture of risk awareness and control More ❯
tools across diverse hardware and software environments. Create and maintain feature pipelines, feature stores, and model stores. Develop frameworks to enable scalable, reproducible research. Proactively troubleshoot performance bottlenecks, conduct root-cause analyses, and implement solutions to optimize GPU or CPU resource usage. Qualifications: Bachelor's or advanced degree in Computer Science, Machine Learning, Artificial Intelligence, or a related More ❯
as Order Gateway, Execution Algo, Market Data, Risk, and Middle Office. Help monitor and improve test coverage across services. Assist in bug reproduction, investigation, and work with developers on rootcause analysis. Participate in regular test cycle planning and release validation processes. Requirements Must-Have Skills Basic understanding of crypto trading workflows, including exchange matching, order lifecycle, and More ❯
to support investigations and facilitate recovery efforts when necessary. Stay updated on emerging fraud trends, industry best practices, and regulatory requirements to enhance fraud prevention and detection measures. Perform rootcause analyses, document control breakdowns, and work with the strategy and engineering teams to improve fraud controls. Document procedures and proactively identify opportunities for operational process improvement. Work More ❯
and encryption mechanisms. Collaborate with internal stakeholders to gather business requirements and translate them into technical specifications. Monitor and troubleshoot file transfer processes, identify and resolve issues, and perform rootcause analysis. Implement best practices for data integrity, security, and compliance, ensuring adherence to relevant regulations and standards. Develop and maintain documentation, including system architecture, configuration details, and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Tata Consultancy Services
and encryption mechanisms. Collaborate with internal stakeholders to gather business requirements and translate them into technical specifications. Monitor and troubleshoot file transfer processes, identify and resolve issues, and perform rootcause analysis. Implement best practices for data integrity, security, and compliance, ensuring adherence to relevant regulations and standards. Develop and maintain documentation, including system architecture, configuration details, and More ❯
understand project requirements and deliver high-quality solutions. Identify and fix performance bottlenecks in both front-end and back-end systems. Debug issues across the entire stack and perform rootcause analysis. Continuously monitor application performance and make improvements to ensure high availability and responsiveness. Your Profile Essential skills/knowledge/experience: Experience with Azure PaaS Services More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Tata Consultancy Services
understand project requirements and deliver high-quality solutions. Identify and fix performance bottlenecks in both front-end and back-end systems. Debug issues across the entire stack and perform rootcause analysis. Continuously monitor application performance and make improvements to ensure high availability and responsiveness. Your Profile Essential skills/knowledge/experience: Experience with Azure PaaS Services More ❯
build the technical acumen on the team. - Collaborate and learn from world-class leaders to meet world-class challenges, every day. - Hire, train, and grow new AWS engineers. - Drive rootcause analyses, in collaboration with software development teams, as well as influencing local development to improve operational performance. - Report on the health of these services at an executive More ❯
support our expanding Seller Compliance space. This role involves collaborating with engineering teams to provide operational support across multiple products and platforms, including development support (continuous deployment, operational readiness, root-causeanalysis, code fixes, testing, metrics, and dashboards), customer support tools, and data-driven decision support (data mining, reporting, scripting). The team comprises world-class engineers … Develop robust deployment processes with metrics and rollback strategies. Enable continuous deployment pipelines with comprehensive testing. Enhance monitoring and logging capabilities. Manage software updates and address technical debt. Perform rootcauseanalysis for technical challenges. Software Development and Maintenance Create operational tools and automation using Ruby, Rails, Java, Python, Shell scripting. Develop unit and integration tests with More ❯
tech and non-tech teams experience Use statistical models with large, multidimensional datasets to uncover trends, patterns, and opportunities Identify and recommend opportunities to automate processes Perform ad hoc analysis to quickly solve time-sensitive operational issues Clearly communicate discrepancies and findings, including rootcauseanalysis and resolution steps to a broad user base A day … analyze trends in security data, or develop dashboards for tracking key performance indicators. You'll prototype solutions for technical review by peers, implement ETL pipelines, and work independently to rootcause data anomalies. You'll be expected to understand business implications and recommend courses of action through crisp documentation for senior leaders. About the team Our team is … data with Redshift, Oracle, NoSQL etc. experience - Experience with data visualization using Tableau, Quicksight, or similar tools - Experience with data modeling, warehousing and building ETL pipelines - Experience in Statistical Analysis packages such as R, SAS and Matlab - Experience using SQL to pull data from a database or data warehouse and scripting experience (Python) to process data for modeling PREFERRED More ❯