refining and executing the IT vision, from market data aggregation and distribution, market making execution & hedging, client negotiation and order management. Hands-on experience in ultra low latency and highavailability applications is required. Summary of Key Purposes of the Role: We are rebuilding our low latency Foreign ExchangeCash platform covering client connectivity, negotiation, price construction and streaming … Lead the architectural design of Forex trading applications Collaborate with stakeholders to define and refine the vision for the Forex trading chain Oversee the implementation of low latency and highavailability solutions Build & lead a local feature team on the build out of components of the new platform Provide technical leadership and guidance to the development team ensuring … skills Very strong knowledge of Forex products and workflows (market making, price distribution, algo execution & hedging, order management) Proven software development experience with expertise in ultra low latency/highavailability platforms Proficiency in .net core Proven ability to develop clean, testable code Experience with Unix & Azure cloud hosting, postGre & oracle databases, gRPC, kafka, FIX, SBE middlewares, elk More ❯
is responsible for driving the strategy, execution, and optimization of BCG's global network infrastructure across on-premises, cloud, and hybrid environments. This role ensures end-to-end automation, highavailability, security, and scalability of network services while integrating SD-WAN, cloud networking, and AI-driven automation to support global business operations. The leader will oversee next-generation … network architecture, operations, and transformation, ensuring a seamless and high-performance connectivity experience. Key Responsibilities: Strategic Leadership & Transformation: Define and execute a modern network platform strategy, integrating cloud networking, software-defined networking (SDN), and AI-driven automation. Ensure end-to-end network automation to improve operational efficiency, agility, and reliability. Drive zero-trust network security principles, ensuring compliance and … efficiency. IT Service Management & Operational Excellence: Establish network reliability objectives, including SLOs, SLIs, and error budgets. Implement real-time incident detection and response using AI-driven network analytics. Ensure highavailability, network resilience, and 24x7 operational support. Develop a follow-the-sun support model, ensuring global network performance optimization. Implement network observability and predictive analytics to proactively prevent More ❯
Chester, Cheshire, North West, United Kingdom Hybrid / WFH Options
Marstep Resourcing Solutions
cloud environments. Design and implement scalable, resilient, and secure microservices to support critical business functions. Support and maintain existing Laravel applications and microservices during the migration phase. Monitor system availability, performance, and reliability, with a focus on infrastructure supporting Laravel-based apps. Manage databases and caching layers, including MySQL and Redis , to support Laravel app performance. Support and maintain … and orchestration tools such as Kubernetes (EKS preferred) . Strong understanding of AWS cloud services (ECS/EKS, RDS, S3, etc.). Experience with API design, distributed systems, and high-availability architectures. Familiarity with CI/CD tools and DevOps practices. Proficient in database administration for MySQL and Redis . Desirable Familiarity with Infrastructure-as-Code (e.g., Terraform … Kubectl). Experience with scaling infrastructure for high-availability systems or large databases. Knowledge of distributed systems and multi-cloud environments. Important: Strong Laravel expertise (large monoliths) Ideally have deep experience working with large, complex Laravel codebases Proven experience working with microservices You will be helping splitting up the monolith into a microservice architecture, working with microservices (PHP More ❯
experience We communicate early, clearly, and often. Whether it's a project update, an idea, or a concern-nothing gets left unsaid. Open, transparent conversations build trust and fuel high performance because when communication flows, everything runs smoother. We challenge ideas, not people. We tackle tough topics, welcome honest feedback, and speak our truth-even when it's uncomfortable … help drive performance across the business through world-class development In this role, you'll work as part of a collaborative, fast-moving team to build scalable, reliable, and high-performance software that powers real-time user experiences in the i-gaming space. We're on a mission to create extraordinary experiences for our customers and your passion, drive … inc. software designs) from product requirements Produce well structured, testable code Apply best practices (industry and internal standards) Solution must be performant and observable in the context of a high complexity, high throughput systems via metrics and monitoring Balance technical solutions with commercial constraints Provide technical mentorship to less experienced members of the team Scope and estimate work More ❯
management, deployment, and monitoring. Implement infrastructure as code (IaC) practices using tools such as Terraform and Ansible. Monitoring and Alerting: Implement monitoring solutions to track the health, performance, and availability of infrastructure components and applications. Configure alerting mechanisms to notify teams of potential issues and proactively address them before they impact users. Incident Response and Root Cause Analysis: Participate … enhancements to ensure optimal performance and resource utilization. Security and Compliance: Implement security controls, and respond to security incidents in accordance with established policies and procedures. Disaster Recovery and HighAvailability: Design and implement disaster recovery (DR) and highavailability (HA) solutions to ensure business continuity and minimize downtime. Develop and test DR plans, implement failover More ❯
platforms and processes, ensuring reliable outcomes. SME Influence - provide technical direction, best practices, and support for key automation platforms. Operational Focus - deliver resilient, well-documented solutions that improve system availability and reduce manual effort. What you'll be doing Act as the technical lead for automation tooling, owning the design, delivery and continuous improvement of deployment solutions using Puppet … Ansible, SaltStack, and OLAM. Coach, mentor and develop the skills of a small team of engineers, fostering technical growth and knowledge sharing. Ensure highavailability, performance and security across the Puppet infrastructure in Test/Dev, Pre-Prod, and Live environments. Develop infrastructure-as-code solutions to support automated software deployments and configuration management at scale. Implement and … repeatable automation using Ansible and SaltStack in production environments. Working with OLAM or equivalent orchestration tools to improve deployment efficiency. Troubleshooting complex automation issues in hybrid OS environments. Producing high-quality technical documentation for operational teams. Driving platform upgrades, patch cycles, and compliance initiatives. Coaching and mentoring technical team members to enhance capability. Nice to have Working within cloud More ❯
Newcastle Upon Tyne, Tyne and Wear, North East, United Kingdom
Anson Mccade
You must have the permanent right to work in the UK. What You'll Be Doing Manage and maintain SQL Server environments (on-premise and Azure) Support and optimise high-availability configurations and backup strategies Collaborate with developers and technical teams to improve schema design, stored procedures, and overall SQL performance Implement and monitor SQL Server agent jobs … a genuine desire to learn and solve problems Strong appreciation for robust and well-documented systems A collaborative, solutions-oriented approach Nice-to-Haves Powershell scripting (dbatools, SqlServer modules) HighAvailability (Always-On, Basic Availability Groups) SQL Server configuration and partitioning SQL Snapshots, Change Data Capture Understanding of SAN arrays (Nimble preferred, PURE or others also suitable … services experience is a plus but not required Why Join Us? Make a real impact - have ownership from day one Work alongside top-tier engineering talent in a collaborative, high-performance team Solve real-world business problems closely aligned with market dynamics Engage directly with users - see the results of your work in production Learn and grow - deepen your More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
and scalability for microservices. Performance Optimization & Tuning: Proactively analyze and optimize complex queries, implement efficient indexing strategies, and manage partitioning/sharding to ensure peak database performance and handle high throughput. Reliability & Disaster Recovery: Design, implement, and maintain robust backup, disaster recovery, and high-availability solutions, including replication (master-slave/multi-master) and failover configurations, to … processes or architectural improvements to enhance team efficiency, code quality, and timely delivery. Contribute to software architecture discussions, translate system-level designs and architectural blueprints into robust, maintainable, and high-quality code, applying the latest best practices in software engineering. Required Skills/Experience: 6+ years in roles directly responsible for the availability, performance, and security of critical … . Cloud Database Operations: Hands-on experience deploying, managing, and optimizing databases within AWS infrastructure (RDS, Aurora) and virtual machine infrastructure (SQL Server). Database Reliability & Security: Experience with highavailability (replication, failover), backup automation, PITR, and data security (encryption, SQL injection prevention). Automation & DevOps: Proficiency in integrating database changes into CI/CD pipelines using schema More ❯
Technology, you will be responsible for designing, implementing, and maintaining robust database solutions that support enterprise infrastructure services. This role involves working with cloud and on-premises databases, ensuring highavailability, security, and performance while supporting database migrations, automation, and modernization efforts. You will collaborate with cross-functional teams to optimize database architectures and contribute to the continuous … improvement of infrastructure services. Key Responsibilities: Lead and manage database infrastructure services, including deployment, migration, and administration of databases (SQL, Oracle, PostgreSQL, etc.). Ensure highavailability, security, and optimal performance of database environments. Support the development and execution of database infrastructure strategies and modernization projects. Monitor and maintain database health, backups, disaster recovery, and performance tuning. Collaborate More ❯
with a wide array of asset issuers. As a well-established market maker, our distinctive expertise led us to expand rapidly. Today, our services span market making, options trading, high-frequency trading, OTC, and DeFi trading desks. But we're more than a service provider. We're an initiator. We're pioneers in adopting the Rust Development language for … highly scalable and resilient cloud infrastructure to support our trading operations. The ideal candidate will have a strong background in cloud computing, automation, and CI/CD pipelines, ensuring highavailability and performance for mission-critical systems. Key Responsibilities Cloud Infrastructure Management: Design, deploy, and maintain scalable and resilient infrastructure on AWS using Infrastructure-as-Code (IaC). … Kubernetes Administration: Manage and optimize Kubernetes clusters for containerized applications, ensuring highavailability and security. Automation & CI/CD: Implement and manage CI/CD pipelines for efficient deployment, testing, and monitoring of applications. Observability & Monitoring: Develop comprehensive monitoring solutions using Prometheus, Grafana, ELK stack, or similar tools to improve system reliability. Security & Compliance: Apply best practices for More ❯
fast, reliable, and resilient CI/CD pipelines, empowering engineers to self-service their infrastructure needs. Support the development, testing, and maintenance of disaster recovery scenarios to ensure system availability and business continuity. Develop and deploy automated tools that enhance the developer experience, simplifying infrastructure management and deployment processes. Monitor performance, capacity, and availability of systems and infrastructure … BS degree in Data Science, Computer Science, Engineering, Math, Statistics, Physics, or similar formal training or equivalent Desirable Qualifications Technical: Proven experience with looking after data recovery and database highavailability and database tuning Proven experience with FinOps and being able to optimise spend for CE impact Experience with working with IoT and Edge interaction Remote Type Hybrid … Agile Methodology, Amazon Web Services (AWS), Automation Cloud, Cloud Infrastructure, Continuous Integration and Continuous Delivery Methodologies, Cost Optimization, Cross-Functional Work, Curiosity, Data Analysis, Digital Literacy, Disaster recovery and highavailability, Docker (Software), Emotional Intelligence, Empathy, GitHub, Infrastructure As Code (IaC), Initiative, Kubernetes, Lambda, Linux, NoSQL, Problem Solving + 8 more We are an Equal Opportunity Employer, and More ❯
and implementation of our Site Reliability Engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer's critical systems, proactively guaranteeing that our high-ROI products reach customers seamlessly. Your responsibilities: Lead the design, implementation, and maintenance of Writer, Inc.'s cloud infrastructure to ensure highavailability and performance. Design and … reliability practices. Is this you? Proven expertise in Site Reliability Engineering with at least 7 years of hands-on experience. Deep understanding of system architecture and infrastructure design for highavailability and performance. Bachelor's degree in Computer Science, Engineering, or a related field. Strong proficiency in programming languages such as Python, Java, or Go for automation and More ❯
Technical Team Manager (Level 3) to lead our senior infrastructure team in supporting and enhancing a mission-critical, on-premises enterprise infrastructure environment. This role is responsible for the availability, scalability, performance, and integrity of enterprise data centre operations, virtualisation platforms, storage systems, and server infrastructure. The successful candidate will combine technical depth with strategic leadership to ensure operational … excellence in a large-scale, high-availability setting. Key Responsibilities Leadership & Strategic Oversight Lead and develop a team of Level 3 Infrastructure Engineers responsible for enterprise infrastructure services. Define team objectives, enforce operational standards, and ensure adherence to enterprise architecture and governance policies. Promote a culture of accountability, documentation, process improvement, and service excellence. Act as the primary … server environments Virtualisation platforms (eg, VMware vSphere, Hyper-V) Enterprise storage systems (SAN, backup/restore, archival) Identity and access systems (eg, Active Directory, ADFS, Group Policy) Ensure continuous availability and performance of systems through robust monitoring, patching, and automation. Lead technical resolution of high-impact incidents, performing root cause analysis and implementing long-term fixes. Maintain relationships More ❯
Understanding of Peer-to-Peer technologies. Quick Learner: Ability to quickly adapt and learn new technologies. Security Awareness: Strong understanding and experience implementing best security practices. Nice to have High-Availability Concepts: Experience with high-availability concepts and best practices for ensuring system reliability. Additional Programming Skills: Knowledge of Go, Rust, or C++ is advantageous. Familiarity More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom Hybrid / WFH Options
Noir
/CD, Containerisation, Kubernetes, Terraform, Automation, Monitoring, Agile, Sports Betting, Cloud Engineer) Our client is a fast-paced and tech-driven sports betting company with a passion for delivering high-performance digital platforms to millions of users worldwide. Known for their innovation and reliability in the online gaming and betting industry, they are expanding their cloud operations team to … the forefront of architecting, deploying, and maintaining scalable and secure cloud infrastructure across Azure and/or AWS, enabling the rapid delivery of new features and services across their high-traffic platforms. The ideal Cloud Engineer candidate will have deep experience in cloud platforms (Azure and/or AWS), infrastructure-as-code (Terraform/ARM), CI/CD pipelines … and Kubernetes. You'll also have a keen eye for automation, performance monitoring, and system reliability in a 24/7 environment. Experience in the online betting, gaming or high-availability transactional industries is a strong plus . All Cloud Engineer positions come with the following benefits: Competitive salary based on experience (£45,000 - £65,000 range). More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom Hybrid / WFH Options
Noir
/CD, Containerisation, Kubernetes, Terraform, Automation, Monitoring, Agile, Sports Betting, Cloud Engineer) Our client is a fast-paced and tech-driven sports betting company with a passion for delivering high-performance digital platforms to millions of users worldwide. Known for their innovation and reliability in the online gaming and betting industry, they are expanding their cloud operations team to … the forefront of architecting, deploying, and maintaining scalable and secure cloud infrastructure across Azure and/or AWS, enabling the rapid delivery of new features and services across their high-traffic platforms. The ideal Cloud Engineer candidate will have deep experience in cloud platforms (Azure and/or AWS), infrastructure-as-code (Terraform/ARM), CI/CD pipelines … and Kubernetes. You'll also have a keen eye for automation, performance monitoring, and system reliability in a 24/7 environment. Experience in the online betting, gaming or high-availability transactional industries is a strong plus . All Cloud Engineer positions come with the following benefits: Competitive salary based on experience (£45,000 - £65,000 range). More ❯
with a primary focus on MongoDB. Your mission is to lead database administration efforts, define the MongoDB roadmap, and collaborate with IT Operations and other stakeholders to ensure the availability, performance, and security of our database systems. We are looking for a candidate who is passionate about database technologies and values collaboration, innovation, and continuous learning. You should have … you to: Lead the design, implementation, and maintenance of MongoDB database systems. Develop and enforce database security measures, policies, and best practices. Monitor and optimize database performance to ensure highavailability, scalability, and efficient resource utilization. Collaborate with development teams on database-related activities, including schema changes, data migrations, and performance tuning. Troubleshoot and resolve complex database issues … maintain robust backup and recovery strategies to ensure data integrity and recoverability. Plan and execute database upgrades, patches, and migrations. Implement and maintain database replication and clustering technologies for highavailability and disaster recovery. Document database configurations, procedures, and troubleshooting steps. Stay current with the latest database technologies, industry trends, and best practices. Mentor and provide guidance to More ❯
Senior Director - IT Network is responsible fordriving the strategy, execution, and optimization of BCG's global network infrastructureacrosson-premises, cloud, and hybrid environments. This role ensuresend-to-end automation, highavailability, security, and scalabilityof network services while integratingSD-WAN, cloud networking, and AI-driven automationto supportglobal business operations. The leader will overseenext-generation network architecture, operations, and transformation … ensuring a seamless and high-performance connectivity experience. Key Responsibilities: Strategic Leadership & Transformation: Define and execute amodern network platform strategy, integratingcloud networking, software-defined networking (SDN), and AI-driven automation. Ensureend-to-end network automationto improve operational efficiency, agility, and reliability. Drivezero-trust network securityprinciples, ensuring compliance and proactive threat mitigation. Establish aglobal observability and telemetry frameworkforreal-time network … ensuring agility and operational efficiency. IT Service Management & Operational Excellence: Establishnetwork reliability objectives, includingSLOs, SLIs, and error budgets. Implementreal-time incident detection and responseusing AI-driven network analytics. Ensurehigh availability, network resilience, and 24x7 operational support. Develop afollow-the-sun support model, ensuringglobal network performance optimization. Implementnetwork observability and predictive analyticstoproactively prevent outages. Security, Compliance & Risk Management: Drivezero-trust More ❯
Director - Network Platform Lead is responsible fordriving the strategy, execution, and optimization of BCG's global network infrastructureacrosson-premises, cloud, and hybrid environments. This role ensuresend-to-end automation, highavailability, security, and scalabilityof network services while integratingSD-WAN, cloud networking, and AI-driven automationto supportglobal business operations. The leader will overseenext-generation network architecture, operations, and transformation … ensuring a seamless and high-performance connectivity experience. Key Responsibilities: Strategic Leadership & Transformation: Define and execute amodern network platform strategy, integratingcloud networking, software-defined networking (SDN), and AI-driven automation. Ensureend-to-end network automationto improve operational efficiency, agility, and reliability. Drivezero-trust network securityprinciples, ensuring compliance and proactive threat mitigation. Establish aglobal observability and telemetry frameworkforreal-time network … ensuring agility and operational efficiency. IT Service Management & Operational Excellence: Establishnetwork reliability objectives, includingSLOs, SLIs, and error budgets. Implementreal-time incident detection and responseusing AI-driven network analytics. Ensurehigh availability, network resilience, and 24x7 operational support. Develop afollow-the-sun support model, ensuringglobal network performance optimization. Implementnetwork observability and predictive analyticstoproactively prevent outages. Security, Compliance & Risk Management: Drivezero-trust More ❯
Huddersfield, West Yorkshire, United Kingdom Hybrid / WFH Options
Applause IT Recruitment Ltd
Benefits Hours: Monday-Friday, 08:30-17:00 (Flexibility required) Are you a seasoned Lead PHP Developer ready to take a hands-on leadership role in a fast-paced, high-availability ecommerce environment? This is an opportunity for a PHP/Laravel expert to step into a strategic role leading development teams and technical delivery. You'll work … cloud-based deployment strategies (AWS, Azure, etc.) Maintain and optimise database architecture using MySQL and SQL Server What You'll Need: Strong PHP and Laravel development background Experience in high-availability and ecommerce applications Solid database knowledge: MySQL and SQL Server Familiar with Git, CI/CD, and agile practices Team leadership: stand-ups, sprints, mentoring, code reviews More ❯
Day You'll Be: Infrastructure & Operations: Participate in designing, implementing, and maintaining our infrastructure for reliability, scalability, and security. Support, monitor, and enhance live infrastructure and platform solutions for highavailability and performance. Assist in integrating our current infrastructure into TransUnion's group-wide cloud platform with minimal disruptions. Participate in migrating infrastructure from AWS to GCP, ensuring … troubleshooting skills. Knowledge of programming languages such as JavaScript, Node.js, PHP. Experience with version control systems like Git. Web server configuration (Apache, Nginx). Database management (MySQL, MongoDB), including highavailability and backups. Experience managing cloud providers, especially AWS and GCP. Familiarity with GCP services like Compute Engine, GKE, Cloud Storage, BigQuery, IAM. Configuration management and IT automation More ❯
Woking, Surrey, England, United Kingdom Hybrid / WFH Options
Michael Page Technology
operations, and commercial offices. This role is pivotal in ensuring IT operations are resilient, secure, and aligned with the dynamic needs of the maritime and port services industry, delivering high performance across a complex operational landscape. Client Details The employer is a well-established organisation in the leisure, travel, and tourism industry. Description The Head of IT Operations will … operations, and commercial offices. This role is pivotal in ensuring IT operations are resilient, secure, and aligned with the dynamic needs of the maritime and port services industry, delivering high performance across a complex, 24/7 operational landscape. Duties and Responsibilities: Infrastructure & Cloud Management * Lead the architecture and lifecycle management of hybrid infrastructure supporting all operations, both onshore … scalability, performance, and disaster resilience across geographically dispersed operations. * Implement automation, Infrastructure as Code (IaC), and DevOps practices to modernize deployments and reduce downtime. Network & Telephony * Ensure secure and high-availability networks across port terminals, remote logistics sites, and central offices-including LAN/WAN, fibre, Wi-Fi, SD-WAN, and VPN connectivity. * Manage operational and technical delivery More ❯
management and operations expertise. As part of the AWS Managed Operations team, you will play a pivotal role in building and leading operations and development teams dedicated to delivering high-availability AWS services, including EC2, S3, Dynamo, Lambda, and Bedrock, exclusively for EU customers. For more information on ESC please check out our blog: Your responsibilities will encompass … of AWS services and technology. A typical day in this role involves collaborating with technology leaders, contributing to the enhancement of day-to-day operations, and ensuring improvements in availability, reliability, latency, performance, and efficiency of the ESC. As a leader, you will manage teams in both systems operations and development, emphasizing automation and engineering efficiency. The overarching goal … is to deliver scalable services and ensure a high-availability experience for EU customers. If you are an experienced professional ready for a challenging and impactful opportunity, we invite you to join our efforts in building a best-in-class development engineering and operations team that aligns with AWS' commitment to customer satisfaction and continual innovation. European Sovereign More ❯
in automation and operations. As part of the AWS Managed Operations team, you will play a pivotal role in building and leading operations and development teams dedicated to delivering high-availability AWS services, including EC2, S3, Dynamo, Lambda, and Bedrock, exclusively for EU customers. For more information on ESC please check out our blog: Utility Computing (UC) AWS … operating services in AWS Experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar) • Experience scripting operating system tasks in Bash, Python, etc. • Proficient in operating 24x7 high-availability, distributed software applications Desire to dive deep into, and find opportunities to improve, the reliability, availability, and performance of distributed software systems Amazon is an equal More ❯
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
KO2 Embedded Recruitment Solutions LTD
and secure mission-critical systems. In this role the successful Senior Linux Engineer will be responsible for designing, deploying, and troubleshooting robust Linux-based environments-often in complex or high-availability configurations, whilst also providing consultancy-level support and expertise to a wide range of organisations As a key part of a highly skilled technical team, you'll … PostgreSQL, PHP, Python Networking: Static/dynamic routing, DNS, VPNs, and firewalls Containers & automation: Docker, Kubernetes, and CI/CD pipelines Cloud platforms: AWS, Azure, and Google Cloud Infrastructure: High-availability clusters, Pacemaker, filesystem replication, hybrid cloud environments, remote desktops Internal tools: Request Tracker (RT), MediaWiki, Icinga, and other custom utilities Skills and Experience Required: Proven experience in More ❯