City of London, England, United Kingdom Hybrid / WFH Options
Owen Thomas | Pending B Corp™
and efficiency. Automate configuration, provisioning, and deployment to reduce manual effort and streamline operations. Implement and uphold security standards, including encryption, access control, and compliance. Lead incident response and rootcauseanalysis, applying preventive measures to avoid recurrence. Collaborate across teams (QA, DevOps, IT) to troubleshoot and enhance system performance. Maintain clear documentation for configurations, procedures, and … with a focus on Python. Skilled in TDD and BDD, primarily using Python. Deep understanding of distributed systems, networking, storage, and compute management. Strong troubleshooting skills, with experience in rootcauseanalysis and timely resolution. Knowledge of security standards (ISO27001, NIST, GDPR) and infrastructure security best practices. Experienced with monitoring/logging tools like Splunk, Grafana, and More ❯
independently drive the performance improvement projects. In this role you will be focused on improving the experience and satisfaction of Amazon customers (vendors/vendor managers/end customer), rootcauseanalysis of issues and opportunities affecting the business. Key job responsibilities 1. Develop strategies for continuous improvement in process and customer quality. 2. Strengthen the existing … drive Kaizen events and work on new automation/solution building projects 4. Able to drill into large amounts of data and extract meaningful business metrics. 5. Perform data analysis on trends observed and recommend solutions to the product and Business teams 6. Collaborate with partner teams and stakeholders across the globe to deliver on key business goals and … objectives by driving consensus and building trust 7. Demonstrates the ability to dive deep into a problem, perform rootcause and corrective actions to avoid defect reoccurrence. 8. Establishes key reports for functional area 9. Able to write strong, well-structured and detail-oriented documents in a clear, concise and audience-specific format The ideal candidate is/ More ❯
West End of London, England, United Kingdom Hybrid / WFH Options
Owen Thomas | Pending B Corp™
and efficiency. Automate configuration, provisioning, and deployment to reduce manual effort and streamline operations. Implement and uphold security standards, including encryption, access control, and compliance. Lead incident response and rootcauseanalysis, applying preventive measures to avoid recurrence. Collaborate across teams (QA, DevOps, IT) to troubleshoot and enhance system performance. Maintain clear documentation for configurations, procedures, and … with a focus on Python. Skilled in TDD and BDD, primarily using Python. Deep understanding of distributed systems, networking, storage, and compute management. Strong troubleshooting skills, with experience in rootcauseanalysis and timely resolution. Knowledge of security standards (ISO27001, NIST, GDPR) and infrastructure security best practices. Experienced with monitoring/logging tools like Splunk, Grafana, and More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Owen Thomas | Pending B Corp™
and efficiency. Automate configuration, provisioning, and deployment to reduce manual effort and streamline operations. Implement and uphold security standards, including encryption, access control, and compliance. Lead incident response and rootcauseanalysis, applying preventive measures to avoid recurrence. Collaborate across teams (QA, DevOps, IT) to troubleshoot and enhance system performance. Maintain clear documentation for configurations, procedures, and … with a focus on Python. Skilled in TDD and BDD, primarily using Python. Deep understanding of distributed systems, networking, storage, and compute management. Strong troubleshooting skills, with experience in rootcauseanalysis and timely resolution. Knowledge of security standards (ISO27001, NIST, GDPR) and infrastructure security best practices. Experienced with monitoring/logging tools like Splunk, Grafana, and More ❯
activities include network engineering and troubleshooting, project management, mentorship of Technical Support Technicians, data cabling, systems administration in a variety of software and hardware environments, telecom/PBX administration, rootcauseanalysis in problem solving, and assistance in managing the daily activities of the department. A willingness to relocate as career advancement opportunities throughout the company become … activities include network engineering and troubleshooting, project management, mentorship of Technical Support Technicians, data cabling, systems administration in a variety of software and hardware environments, telecom/PBX administration, rootcauseanalysis in problem solving, and assistance in managing the daily activities of the department. A willingness to relocate as career advancement opportunities throughout the company become More ❯
considerations. Monitor and review control evidence and attestations, ensuring completeness, accuracy, and timely escalation of control failures or near misses. Act as lead incident response liaison, coordinating initial triage, rootcauseanalysis, and business remediation with tech and ops leads. Draft and socialize business continuity and operational resilience playbooks with relevant stakeholders; lead tabletop exercises for scenario … RCSA methodology, risk governance principles, and control lifecycle management. Working knowledge of trade processing workflows and operational risks associated with trading technology and infrastructure. Demonstrated experience in incident management, rootcauseanalysis, and driving remediation. Familiarity with regulatory and industry frameworks (e.g., MIFID II, DORA, NIST) related to operational resilience and technology risk in EU/UK More ❯
Responsibilities Understand the intricacies of Data Centers within your designated region. Conduct audits for power and mechanical capacity and oversee upgrades. Collaborate with internal teams to troubleshoot and perform RootCauseAnalysis (RCA) and Corrective Action (CA) for design-related issues. Liaise with local colocation partners to comprehend and synchronize site utility requisites. Offer after-hours support … Manage inventory processes for data center equipment. Collaborate with project teams and colocation partners to validate the functionality of electrical and mechanical systems. Extend operational support encompassing failure mode analysis, rootcause identification, maintenance assistance, best practices, procedural reviews, and more. Curate and maintain comprehensive technical documentation concerning corporate data centers and operational procedures. Collaborate with regional More ❯
objectives. Drive quality outcomes tied to delivery milestones such as UTP2, client onboarding, and infrastructure transformation. Implement continuous improvement processes informed by testing outcomes and operational metrics. SLA Management &RootCauseAnalysis Monitor service performance against defined SLAs, ensuring proactive tracking and reporting to senior stakeholders. Lead structured RootCauseAnalysis (RCA) processes following More ❯
London, England, United Kingdom Hybrid / WFH Options
Smartedge Solutions
project scope is delivered to the required timelines and quality. Principal Accountabilities: Collate and conduct thorough data cleansing and validation of various data sets (IT, HR, Payroll records) Execute root-causeanalysis to Identify and rectify data inconsistencies, duplicates, and errors. Develop and maintain data models, dashboards, and reports to support the data cleanse project. Collaborate with … as a Data Analyst, preferably with experience in managing large volume data sets within a project team (HR/IT) environment. Experience working with Oracle EBS Proficient in data analysis tools and software (SQL, EXCEL, Tableau, Power BI etc) Excellent communication, presentation & influencing skills, with the ability to translate complex data into actionable insights for non-technical audiences. Experience More ❯
develop a high-performing DevOps Engineers-recruit and manage a multidisciplinary team responsible for automation, observability/monitoring, security & compliance automation, CI/CD pipelines, reliability/resilience, FinOps, rootcause/incident response, dashboarding/reporting, and 24/7 runbook & on-call coordination. Drive platform automation and operational excellence-own and evolve automation strategies, tooling, and … deployment velocity, reliability, and consistency across centralised and federated engineering teams. Champion reliability engineering and incident response-embed best practices in site reliability engineering, including proactive monitoring, incident detection, rootcauseanalysis, and continuous improvement to minimize downtime and user impact. Enhance observability and operational visibility-oversee the design, implementation, and evolution of monitoring, alerting, dashboarding, and More ❯
virtualised environments (VMware ESXi, IBM Power-VM). Monitor and optimise system performance, capacity, and availability. Troubleshoot and resolve 2nd/3rd line infrastructure issues, with a focus on rootcauseanalysis and service restoration. Collaborate on the design and deployment of high availability solutions, including SAP HANA platforms. Contribute to automation, monitoring, and compliance across the More ❯
to grow the vendor's supply chain efficiency, traffic, customer conversion, even revenue on Amazon. Also, you will look into strategic and operational aspects of their business with Amazon, rootcauseanalysis of issues and opportunities affecting the vendor's business. Your focus will be on the strategic and operational aspects of managing the customer relationships with … metrics for the vendors and Amazon; • Work with internal Amazon teams/vendors to improve operational aspects of their business in providing a great consumer experience; • Conduct deep dive analysis on the handled issues for the vendors and develop recommendations and action plans based on data to improve vendor experience; • Provide thought leadership around planning, roadmaps and execution; • Establish … communication skills in Japanese; good written skills in English; - Strong presentation skills, ability to express thoughts logically and succinctly. Demonstrated active listening skills, highly consultative and solutions-oriented; - Data analysis skills - ability to collect, clean, interpret, challenge and questions data and provide solutions; demonstrated ability to manage the business; - Influencing skills - metric influencing: ability to understand a metric and More ❯
to grow the vendor's supply chain efficiency, traffic, customer conversion, even revenue on Amazon. Also, you will look into strategic and operational aspects of their business with Amazon, rootcauseanalysis of issues and opportunities affecting the vendor's business. Your focus will be on the strategic and operational aspects of managing the customer relationships with … metrics for the vendors and Amazon; • Work with internal Amazon teams/vendors to improve operational aspects of their business in providing a great consumer experience; • Conduct deep dive analysis on the handled issues for the vendors and develop recommendations and action plans based on data to improve vendor experience; • Provide thought leadership around planning, roadmaps and execution; • Establish … communication skills in Japanese; good written skills in English; - Strong presentation skills, ability to express thoughts logically and succinctly. Demonstrated active listening skills, highly consultative and solutions-oriented; - Data analysis skills - ability to collect, clean, interpret, challenge and questions data and provide solutions; demonstrated ability to manage the business; - Influencing skills - metric influencing: ability to understand a metric and More ❯
out technical performance and enhancements. Key Responsibilities : Lead incident management and escalations within a fast-paced NOC environment. Provide technical mentorship to team members, fostering growth and upskilling. Conduct rootcauseanalysis and drive continuous improvement initiatives. Maintain and improve network documentation and operational processes. Engage in vendor management and coordinate with multiple network solution providers. Play More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Stott and May
review, design, and implement infrastructure decisions. Maintain documentation for platforms, services, and pipelines. Audit activities to ensure compliance with security policies (including PCI DSS, GDPR, and PII). Perform root‐causeanalysis and implement improvements to prevent incidents and optimize performance. Maintain and evolve monitoring platforms, including synthetic and application monitoring, responding to alerts and identifying bottlenecks. More ❯
review, design, and implement infrastructure decisions. Maintain documentation for platforms, services, and pipelines. Audit activities to ensure compliance with security policies (including PCI DSS, GDPR, and PII). Perform root‐causeanalysis and implement improvements to prevent incidents and optimize performance. Maintain and evolve monitoring platforms, including synthetic and application monitoring, responding to alerts and identifying bottlenecks. More ❯
set clear goals and priorities Identify opportunities to automate repetitive tasks and promote reuse of components Engage with technical communities to share knowledge and advance shared capabilities Lead incident root-causeanalysis and promote active application custodianship to drive continuous improvements Invest in your ongoing development of technical and Agile skills Collaborate closely with architecture, security, and More ❯
of GitOps workflows Strong background in observability: metrics, visualization, logging, tracing Understanding of automation, CI/CD pipelines, deployment automation, and release strategies Experience with incident management, disaster recovery, rootcauseanalysis, and post-incident reviews Additional Benefits: Hybrid working: 1+ days a week in London office Wellbeing initiatives: coaching, fitness sessions, webinars, Wellbeing day Subsidised gym More ❯
to recruit a seasoned professional with at least 5 to 10 years hands-on expertise at 2 nd and 3 rd level support. 1st and 2nd Line Troubleshooting/Root-CauseAnalysis, Documentation write ups for Knowledge Base Articles Be an integral part of the Support team in supporting and delivering all aspect of services and full More ❯
for end-users (in collaboration with our Learning & Skill development team), create detailed documentation, and maintain up-to-date knowledge bases. Manage incident resolution and problem-solving processes, conduct rootcauseanalysis, and implement preventive measures. Your profile: Apprenticeship as IT specialist or bachelor's degree in information systems, computer science, IT, or related field. Extensive experience More ❯
benefits of our cloud-based and AI-enhanced tools. Collaborative Troubleshooting: Work closely with cross-functional teams to resolve issues and enhance our products. From reviewing logs to conducting rootcauseanalysis, you'll play a key role in ensuring technical issues are swiftly addressed. Content Creation & Knowledge Sharing: Share your findings with a wider audience by More ❯
at home (or pro rata for part-time employees). This policy may be changed at the Company's discretion. What Will You Do? Identify opportunities and problems through rootcauseanalysis across ongoing processes. Assist in identifying trends and gathering results to identify process improvement opportunities and may lead low complexity improvements or medium complexity with More ❯
Produce and distribute clear, actionable reports and dashboards on: Planned vs Actual manned hours Shift fulfilment and attendance rates SLA performance across all contracts Provided vs Invoiced hours , with rootcauseanalysis for variances Deliver reports daily (tactical), weekly (trend), and monthly (strategic/board level). Collaborate with the Data Manager and Invoicing Manager to ensure More ❯
coordinate SOX testing and remediation efforts. Testing & Documentation:Oversee or perform walkthroughs, test the operating effectiveness of controls, and ensure accurate, comprehensive documentation. Remediation & Continuous Improvement:Track issues, lead rootcauseanalysis, and follow up on remediation and preventive actions for SOX deficiencies. Awareness & Training:Educate and train IT and business teams on SOX requirements and best More ❯
unit, integration, etc.) and quality gates into the CI/CD pipelines, upholding company standards for software testing. Troubleshooting & Optimisation : Proactively troubleshoot and resolve infrastructure or deployment issues, performing rootcauseanalysis and implementing improvements to prevent recurrence and improve system reliability. Legacy Pipeline Migration: Lead the initiative to migrate existing legacy build/deployment pipelines to More ❯