best practices to ensure continuous integration and code quality. Identifying and enforcing the use of processes, procedures, and tooling for application monitoring, ongoing maintenance, and incident response to ensure highavailability and performance. Devising and leading the implementation of data management processes and standards for cloud resources, covering areas like naming, tagging, metadata management, backups, and documentation for More ❯
management systems. Strong technical aptitude-able to work closely with engineering to define APIs, system behaviour, and integration logic. Proven track record in leading cross-functional initiatives and delivering high-availability financial products at scale. Exceptional communication skills-able to translate complex technical requirements into commercial outcomes. Demonstrated ability to track key performance metrics and drive data-informed More ❯
experiences. Contribute to the definition and delivery of Engineering roadmaps, anticipating technical challenges and managing interdependencies. Oversee the entire software development lifecycle, from conception to deployment and maintenance, ensuring high-quality outcomes across all phases. Implement best practices in coding, testing, and maintenance to enhance system scalability and performance, particularly for high-traffic events. Ensure compliance with international … financial laws and security standards, integrating appropriate solutions for global payments. Set a high bar for software engineering excellence, emphasizing efficiency, performance optimization, and high availability. Being a mentor to tech leads and managers across the Engineering organisation. Skills Proven experience in leading and managing large teams or projects in an eCommerce environment with Gross Merchandise Value above More ❯
an exciting opportunity to serve as a Subject Matter Expert (SME) in Windows-based enterprise infrastructures. You'll play a central role in the design, deployment, and support of high-security systems that underpin key defence and aerospace operations. Your Impact Take ownership of complex technical issues through to resolution, collaborating with users and other IT teams. Provide expert … and maintain infrastructure security measures such as patching, access controls, and hardening to align with compliance and internal governance. Use monitoring tools to proactively optimize system performance and ensure high availability. Create and maintain detailed technical documentation and contribute to internal knowledge sharing and mentoring. Participate in IT projects, ensuring infrastructure design meets both technical and business objectives. Collaborate … with network, security, and application teams to deliver reliable, secure IT services. What you'll bring: Proficiency in Ivanti Application & Device Control or similar lockdown tools. Experience working with high-security MOD systems and secure design principles. Familiarity with security and compliance documentation (e.g. RMADS, SyOPs, NIST frameworks). Core Technologies Microsoft: Windows Server 2019/2022/ More ❯
is at the heart of everything we do. If this sounds exciting to you, please read on. We are seeking an experienced Senior Software Development Manager to lead our Availability Engineering team within Prime Video. This team is responsible for developing and maintaining our observability platform, incident management systems, and resiliency programs. Key job responsibilities - Manage a high-performing team of software engineers, program managers, data scientists, and incident responders focused on improving the availability and resilience of Prime Video - Oversee the development and evolution of our observability platform, which enables analysis of logs, traces, and other telemetry at scale to rapidly triage and resolve issues - Implement observability and incident management solutions, including the use of … escalation paths, and post-incident review - Drive initiatives to improve the overall resilience and fault-tolerance of the Prime Video platform - Partner closely with other engineering leaders to ensure availability and reliability goals are met - Hire, develop, and retain top technical talent for the Availability Engineering team A day in the life 1. Team Management: - Hold 1-on More ❯
My client, a leading international firm is seeking a skilled SQL Database Administrator to support and enhance enterprise-grade systems. The role involves managing database performance, security, and availability, supporting cross-functional teams, and contributing to strategic technical initiatives under senior guidance. Key Responsibilities * Maintain high-availability SQL Server environments and optimise performance * Monitor systems proactively to … Essential Skills & Experience * Advanced SQL Server 2019/2022 administration in Windows 2022/2025 * Expertise in tuning, monitoring, SSIS, SSAS, T-SQL, and PowerBI * Strong experience with SQL Availability Groups, encryption, and security features * Ability to build and maintain SSIS packages via Visual Studio * Proven external data integration via SFTP * Familiarity with Redgate tools and other RDBMS platforms More ❯
capabilities of groundbreaking AI technologies to benefit humanity in a safe and reliable way. Responsibilities: Develop appropriate Service Level Objectives for large language model serving and training systems, balancing availability/latency with development velocity Design and implement monitoring systems including availability, latency and other salient metrics Assist in the design and implementation of high-availability language model serving infrastructure capable of handling the needs of millions of external customers and high-traffic internal workloads Develop and manage automated failover and recovery systems for model serving deployments across multiple regions and cloud providers Lead incident response for critical AI services, ensuring rapid recovery and systematic improvements from each incident Build and maintain cost optimization … model serving, batch inference, and training pipelines Have proven experience implementing and maintaining SLO/SLA frameworks for business-critical services Are comfortable working with both traditional metrics (latency, availability) and AI-specific metrics (model performance, training convergence) Have experience with chaos engineering and systematic resilience testing Can effectively bridge the gap between ML engineers and infrastructure teams Have More ❯
a diverse team comprising software, hardware, network engineers, supply chain specialists, security experts, operations managers, and more. Your role involves collaborating across AWS to uphold safety standards and deliver high capacity at low costs, fostering an inclusive culture that welcomes innovation and ownership. Role Overview As a Data Center Engineering Operations Engineer (HV/MV), you will troubleshoot and … operate electrical and mechanical equipment, oversee facility operations, and ensure safety procedures are followed. Your work will help maintain highavailability and operational integrity of the data center infrastructure. Key Responsibilities Safety: Maintain and promote safety standards, address safety risks, and lead safety initiatives. Engineering & Facility Operations: Support equipment such as UPS, switchgear, circuit breakers, generators, chillers, HVAC More ❯
Type: Permanent Salary: £65,000 - £75,000 An experience Network and Security engineer is required for a globally recognised organisation in the energy and trading sector, known for its high-performance culture and cutting-edge technology environment. This is a rare opportunity to join a well-funded, international business with a strong London presence, where technology plays a central … Server About You: Hands-on experience in network and infrastructure support Strong knowledge of Cisco and Checkpoint technologies Proficient working within a VMware environment Comfortable working in a regulated, high-availability environment Excellent troubleshooting skills and a sleeves-rolled-up attitude Certifications - minimum of CCNA. CCNP is desirable Why this opportunity? Join a prestigious global business with a More ❯
Conduct regular performance reviews and provide ongoing feedback to foster growth and development within the team. Operational Oversight: Oversee the daily operations of digital systems, applications, and infrastructure. Ensure highavailability and performance of all IT services and applications. Implement and maintain monitoring systems to proactively identify and resolve issues. Application Support: Manage application support activities to ensure … digital products and services. Work with the End-to-End QA Specialist to develop comprehensive test plans and ensure thorough testing. Ensure quality assurance processes are followed to maintain high standards. Strategic Planning and Improvement: Develop and implement strategies to enhance digital operations and IT service delivery. Identify opportunities for process optimisation and efficiency gains. Stakeholder Communication: Act as … senior management on the status of projects, operational performance, and security compliance. Facilitate effective communication between IT teams and business units. Problem Solving and Incident Management: Manage and resolve high-priority incidents and critical issues. Conduct root cause analysis and implement corrective actions to prevent recurrence. Develop and maintain incident response plans and procedures. Requirements: Proven experience as a More ❯
analytical and systematic approach to project management, ensuring the smooth and professional handling of all tasks and interactions. The role demands excellent communication skills and the ability to deliver high-quality results while maintaining strong client relationships. If you're ready to take on a key role within an innovative company and work remotely with a supportive team, apply … Kardex is a leading global partner for intralogistics solutions in an attractive and growing market. The Group offers premium automated products, standardized systems, and life cycle services that guarantee highavailability and low total cost of ownership. Kardex provides an intelligent entry into automation with its dynamic storage and retrieval systems, offers integrated material handling systems, small parts … storage systems, and automated high-bay warehouses, and acts as a global AutoStore partner, offering flexible and modular storage and order fulfillment solutions. The Group employs around 2,500 people in over 30 countries. Kardex Holding AG has been listed on the SIX Swiss Exchange since 1989. Position: Project Manager Job Type: Full Time Location: Remote About the Role More ❯
designers, and other engineers to define, design and implement new capabilities in the domain including bringing new ideas and designing solutions from start to finish Design, develop, and maintain high-performance, scalable, and reliable interfaces in the team domain using Typescript, React, Canvas API, Java and related technologies Contribute to the overall architecture and technical direction of the team … and maintainable code, adhering to best practices and coding standards Participate in code reviews, technical discussions, and knowledge sharing sessions in the stream Troubleshoot and resolve production issues, ensuring highavailability and performance of our services Contribute to improving our development processes and tools What you'll need Excellent communication skills, both written and verbal 5+ years of … experience as a Software Engineer, with a proven track record of leading initiatives within complex, large-scale environments (more focus on FE) Expert knowledge of creating high-performance interfaces using fundamental rendering technologies such as WebGL, Canvas API or related 2D graphics libraries (e.g. PixiJS, Skia etc) Good knowledge of Java and related technologies (e.g. Maven, Spring Boot etc More ❯
requirement – technical capability is the priority. What we’re looking for: Over 3 years’ experience as a Java Developer Comfortable working across cloud-based systems Able to contribute to high-availability, customer-facing applications A proactive team player with good communication skills Ideally available on 1 month’s notice or less Interview process: 1st stage: Teams call with More ❯
globe. Use and enhance the tools you and the team develop. Using metrics, scalability in design, and continuous improvement, you will automate repetitive processes to eliminate outages and enable highavailability of the ground infrastructure. Key job responsibilities - Continuously strive for operational excellence by seeking to automate away manual processes - Ensure that all systems are designed to be … to report on the health of ground infrastructure. - Integrate with other existing systems such as trouble ticketing, dashboards, and metrics tools and services - Measure and improve the performance and availability of the Kuiper Ground Infrastructure - Provide critical operations support and incident response for the service, while taking part in an on-call rotation - Design and implement scalable backend services More ❯
North West London, London, United Kingdom Hybrid / WFH Options
McGregor Boyall Associates Limited
the technology organisation to ensure alignment and co-deliver overall business workflows. Requirements The candidate should have the following skills and qualities: Experience leading the design and development of high-availability, distributed platforms in the Front Office Pricing domain. Experience managing and directing small teams of technical experts. Experience of providing technical expertise and guidance to developers and More ❯
on the planet, using our technology and engineering muscle as our biggest advantage. We aim to leverage cutting edge technologies in big data, machine learning, optimization techniques, and operate high volume, low latency, and highavailability services. BASIC QUALIFICATIONS • Min 2 years of experience working on identity verification/fraud detection processes • Prior experience in manufacturing, transportation More ❯
working with stakeholders to achieve business goals and embracing learning opportunities. You'll play an important role in: Maintaining, supporting, and transforming existing networks into new infrastructure designed for highavailability, security, and ease of maintenance. Implementing the appropriate cybersecurity measures on network solutions at production sites, meeting country-specific and Group requirements. Managing network activities such as More ❯
global stakeholders to achieve our business goals and embrace learning opportunities. You'll play an important role in: maintaining, supporting, and transforming existing networks into new infrastructure designed for highavailability, security, and easy maintenance ensuring that the right cyber security level is implemented on the network solutions on production sites, fulfilling both country-specific and Group requirements More ❯
experiences. Contribute to the definition and delivery of Engineering roadmaps, anticipating technical challenges and managing interdependencies. Oversee the entire software development lifecycle, from conception to deployment and maintenance, ensuring high-quality outcomes across all phases. Implement best practices in coding, testing, and maintenance to enhance system scalability and performance particularly for high-traffic events. Ensure compliance with health … and wellness industry standards and regulations, integrating third-party wellness tracking technology solutions with a seamless user experience. Set a high-bar for software engineering excellence, emphasizing efficiency, performance optimization, and high availability. Being a mentor to tech leads and managers across the Engineering organisation. Skills Proven experience leading and managing teams or projects in a user-centric More ❯
The Manager, Remote Hands leads the on-site Customer Services Technician team, ensuring consistent and high-quality Remote Hands support. This role is responsible for daily service operations, team leadership, customer experience, and compliance. The role also contributes to the continuous development and standardisation of customer service processes across the campus. What youll do Team Management and Leadership: Lead … Customer Services team on-site. Manage daily operations, prioritising service requests and allocating resources effectively. Collaborate with HR on recruitment, training, and performance management. Drive continuous improvement and maintain high service standards across all shifts. Process Management and Compliance: Oversee service fulfilment, material usage, and internal/external reporting. Ensure compliance with legal, safety, and quality standards. Maintain up … Support customer onboarding by understanding technical requirements and coordinating delivery. Escalation & Resource Management Participate in the 24x7 escalation rota as part of the 3rd-line support structure. Ensure the availability of tools, consumables, and other essentials for smooth operations. Act as a role model, fostering a culture of accountability, performance, and alignment with company values. What youll need Master More ❯
framework. Evidence of on-Going professional development . Able to work on projects to tight deadlines and budgets. Able to work in a team that delivers a range of highavailability services. Ability to work with a diverse range of customers with widely varying requirements and priorities. Keywords: IT support, support engineer, 1st line, 2nd line, AD, MS More ❯
extensive network systems with a focus onrouters, switches, firewalls, and advanceddata networking solutions. The candidate will also play a strategic role incloud connectivity,hybrid solutions, andSD-WAN deployments, ensuring highavailability, reliability, and performance across global network infrastructure. Key Responsibilities: 1. Network Infrastructure Management: Manage, configure, and troubleshoot routers, switches, and firewalls to ensure secure and reliable connectivity. … hybrid and multi-cloud integrations. Ensure network reliability and scalability through proactive planning and implementation. 4. Network Monitoring and Management: Administer over 6K+ network devices across 550+ circuits, maintaining highavailability and performance. Implement automation and monitoring tools to reduce manual intervention and optimize network efficiency. Perform capacity planning, monitoring, and regular audits of network infrastructure. 5. Incident More ❯
framework. Evidence of on-Going professional development . Able to work on projects to tight deadlines and budgets. Able to work in a team that delivers a range of highavailability services. Ability to work with a diverse range of customers with widely varying requirements and priorities. Must have a Drivers' Licence and a car Keywords: IT support More ❯
Join an International Leading Bank High growth team About Our Client Business Area Our Global Markets e-Trading Technology teams deliver front-office technology solutions to support e-Trading desks and clients worldwide. The Rates/FX Pricing Platform team specialises in developing software systems that provide Pricing functionality for e-Trading workflows across major global hubs in London … to key stakeholders . Developing relationships across technology teams to align workflows and co-deliver overall business workflows . The Successful Applicant Experience leading the design and development of high-availability, distributed platforms in the Front Office Pricing domain . Proven ability to manage and direct small teams of technical experts. Strong expertise in providing technical guidance to More ❯
more startup in nature. Experience with web-based applications and/or web services-based applications is also helpful. Key job responsibilities • You create and execute test plans with high quality test cases for straightforward scenarios involving one or more product components and features. You clearly document your work. • You perform initial defect triage and root cause analysis. You … ability to come up with creative solutions to hard-to-solve problems. - Proven experience defining and pioneering an effective quality assurance process and driving QA initiatives. - Experience with testing high scale highavailability web services. - Experience with Selenium or other Web UI automation frameworks - Experience testing applications for mobile platforms (iOS, Android) Our inclusive culture empowers Amazonians More ❯