76 to 100 of 289 High Availability Jobs

Senior Infrastructure Engineer

Hiring Organisation
Context Recruitment
Location
London, United Kingdom
Employment Type
Permanent
Salary
£75000 - £80000/annum
Plan and execute migrations, including tenant-to-tenant Microsoft 365 and Active Directory consolidations Collaborate with managed service providers and internal teams to maintain high availability, security and compliance Support business continuity through proactive monitoring, incident management and disaster recovery planning Mentor and provide technical guidance to team ...

IBM Sterling Specialist

Hiring Organisation
Genesis10
Location
Charlotte, North Carolina, United States
Employment Type
Permanent
Salary
USD 7,194 Hourly
contract opportunity with a hybrid schedule requirement of a minimum of 3 days per week onsite. This role will analyze, strengthen, and work on high-volume, high-availability, and resilient applications. The IT Security Specialist will focus on identifying potential threats and risks and providing recommendations. This ...

Production Support Engineer (Full Stack Engineer)

Hiring Organisation
Solugenix Corp
Location
Colorado, United States
Employment Type
Permanent
Salary
USD 100,000 Annual
pipelines, version control systems (Git), and Agile environments. Ability to debug issues across application layers under time-sensitive conditions. Preferred Qualifications: Experience supporting high-traffic, production-grade applications. Exposure to cloud platforms (AWS, Azure, or GCP). Experience with DevOps practices, containerization, or infrastructure troubleshooting. Prior experience in financial … solving skills with a sense of ownership and accountability. Able to balance production support responsibilities with ongoing engineering work. Thrives in a fast-paced, high-impact environment with minimal supervision. Strong communication skills to work effectively with cross-functional teams. Responsibilities: Provide end-to-end production support for full ...

Application Coordinator

Hiring Organisation
EMBL-EBI
Location
Saffron Walden, Essex, South East, United Kingdom
Employment Type
Permanent, Work From Home
EBIs training mission. The role requires strong technical leadership, stakeholder management, and a holistic view of platform operations to ensure efficiency, cost control, and high service quality across all environments. In addition, the Platform Coordinator will lead the Solutions Architecture function, driving the standardisation of platform and solution architectures … plans for the platform team. Oversee the lifecycle of technical projects, ensuring timely delivery and adherence to best practices. Manage operational performance and ensure high availability of both on-premises and cloud platforms. Maintain and develop strong collaborative relationships with IT teams and EMBL-EBI service stakeholders. Lead ...

Observability Platform Engineer - Elite Quant Hedge Fund

Hiring Organisation
Winston Fox
Location
London, UK
core Production Engineering team, who are accountable for the reliability, operability and performance of the firm’s trading-critical systems, in an environment where availability, correctness and latency directly impact outcomes. The successful hire will own the reliability of business-critical systems, from observability design through to incident resolution … systemic improvement; lead high-severity incident management; and reduce toil with software engineering, primarily in Python but also with Golang, TypeScript, SQL and/or PowerShell. The role requires someone who can evidence outstanding problem-solving and the ability to solve problems on their own initiative. Requirements Debugging distributed ...

Observability SRE / Platform Engineer – Elite Quant Hedge Fund

Hiring Organisation
Winston Fox
Location
City of London, London, United Kingdom
core Production Engineering team, who are accountable for the reliability, operability and performance of the firm’s trading-critical systems, in an environment where availability, correctness and latency directly impact outcomes. The successful hire will own the reliability of business-critical systems, from observability design through to incident resolution … systemic improvement; lead high-severity incident management; and reduce toil with software engineering, primarily in Python but also with Golang, TypeScript, SQL and/or PowerShell. The role requires someone who can evidence outstanding problem-solving and the ability to solve problems on their own initiative. Requirements Debugging distributed ...

Streaming Platform Engineer

Hiring Organisation
Genesis10
Location
Plano, Texas, United States
Employment Type
Permanent
Salary
USD 150,000 Annual
able to work onsite up to 3 days per week. Description: We are seeking a Streaming Platform Engineer to design, implement, and operate high-throughput, low-latency event streaming systems using modern distributed messaging platforms (e.g., Apache Kafka, Apache Pulsar, Azure Event Hubs, Amazon Kinesis, Google Pub/… replication, and geo-redundancy. Implement schema management using Schema Registry, Pulsar Schema, Protobuf/Avro/JSON Schema, or cloud-native schema services. Ensure high availability, disaster recovery, and multi-region failover. Producer & Consumer Development Write scalable, fault-tolerant producers and consumers in Java, Scala, Python, or Go. ...

Data Ops Engineer

Hiring Organisation
Stackstudio Digital Ltd
Location
Norwich, Norfolk, East Anglia, United Kingdom
Employment Type
Contract
Contract Rate
From £400 to £450 per day
optimizing data pipelines and ensuring smooth data flow across the organization. The role bridges data engineering, operations, and DevOps practices to deliver reliable, high quality, and timely data for analytics, reporting, and business applications. Your Responsibilities 1. Data Pipeline Development & Maintenance Design, build, and maintain automated, scalable data pipelines … pipelines meet SLAs, quality standards, and security guidelines 2. Data Platform Operations Manage and monitor data platform operations using DataOps/DevOps practices Ensure high availability and reliability of data platforms (cloud or on prem) Troubleshoot pipeline failures and perform root cause analysis (RCA) 3. Automation & CI/ ...

Senior Online Specialist

Hiring Organisation
Skillsearch
Location
London Area, United Kingdom
design, development, and maintenance of online game services such as authentication, matchmaking, real time gameplay systems, and player data synchronization. Ensure services maintain high availability and low latency for players across multiple regions. • Cross Functional Collaboration Work closely with global game development teams to refine and implement technical … Mandarin Chinese is beneficial but not essential. • Proficiency in at least one backend programming language, with strong preference for C++ or Go. Experience building high performance, highly concurrent, and highly available systems is important, including work with threading or coroutine based architectures. • Solid understanding of distributed systems and backend ...

Internal Cloud Architect

Hiring Organisation
MBDA UK
Location
Bristol, Filton, Gloucestershire, United Kingdom
Employment Type
Permanent
Salary
£74000/annum
will have deep expertise in cloud infrastructure (OpenStack) and container orchestration (Red Hat OpenShift/Kubernetes), with a strong focus on scalability, automation, and high availability. You will play a key role in defining cloud strategy, leading platform architecture, and guiding engineering teams in delivering robust, secure, and performant … Define and implement cloud-native architecture patterns Integrate OpenStack with container platforms (OpenShift/Kubernetes) Drive automation using Infrastructure as Code (IaC) tools Ensure high availability, scalability, and disaster recovery strategies Establish security best practices across infrastructure and platforms Collaborate with DevOps, security, and application teams Provide technical ...

AWS Cloud Platform Manager

Hiring Organisation
Ciphr
Location
Reading, England, United Kingdom
identify and resolve issues, implement automation and monitoring solutions, and drive continuous improvement across our infrastructure. Your leadership will be instrumental in fostering a high-performing, innovative engineering culture, optimising system performance, and minimising downtime. You will also be responsible for managing cloud costs effectively, ensuring spend remains within … optimise cloud infrastructure usage to ensure cost efficiency, maintaining spend within agreed budgets. Ensure the optimal performance of cloud services , delivering a seamless and high-quality experience for end users. Take accountability of the on-call rota , ensuring appropriate coverage, managing participation, and leading rapid incident response to maintain ...

Linux Engineer

Hiring Organisation
VIQU IT
Location
London, Tachbrook, United Kingdom
Employment Type
Permanent
Salary
£65000 - £70000/annum
using Bash, Python, or Perl scripts. • Design, deploy, and manage Linux workloads in Azure. • Implement and manage virtual networks, storage, and Azure resources. • Ensure high availability and disaster recovery of critical systems. • Use Azure DevOps for CI/CD pipeline automation and infrastructure as code (IaC). • Collaborate ...

DevOps Systems Engineer

Hiring Organisation
Everest Computers Inc
Location
Roswell, Georgia, United States
Employment Type
Any
Salary
USD 130,000 Annual
identify and remediate vulnerabilities across the SDLC, ensuring compliance and reducing security risks. Designed and implemented automated resiliency/disaster recovery frameworks to ensure high availability and fault tolerance in cloud applications. Experience with Cloud application container Security and Access Control via OAuth2.0/OIDC/SAML/ ...

PowerShell DSC Engineer ArcGIS Enterprise Automation

Hiring Organisation
Stackstudio Digital Ltd
Location
Wokingham, Berkshire, South East, United Kingdom
Employment Type
Contract
Contract Rate
From £350 to £380 per day
event responses, ensuring ArcGIS Enterprise components remain fully functional and consistent across all VMSS nodes. Optimize VMSS deployment images and custom script extensions for high availability GIS workloads. Cloud & Infrastructure Support Support automated deployments in Azure cloud environments. Manage infrastructure provisioning scripts using Terraform. Optimize system performance ...

IT Network Engineer

Hiring Organisation
US Tech Solutions, Inc
Location
Charlotte, North Carolina, United States
Employment Type
Permanent
Salary
USD Annual
technologies to ensure resilient and efficient network performance. Wireless Network Engineering Architect and manage wireless infrastructure with a focus on MIST-based solutions, ensuring high availability, performance, and seamless user experience. Network Automation & Scripting Develop and maintain automation workflows using Ansible, Terraform and other tools. Leverage APIs ...

Senior Machine Learning Engineer (KServe + Kubernetes, building Kubernetes Clusters, PyTorch, TensorFlow)

Hiring Organisation
Capital One
Location
New York, United States
Employment Type
Permanent
Salary
USD Annual
applications using existing and emerging technology platforms. You'll focus on machine learning architectural design, develop and review model and application code, and ensure high availability and performance of our machine learning applications. You'll have the opportunity to continuously learn and apply the latest innovations and best ...

Senior Machine Learning Engineer (KServe + Kubernetes, building Kubernetes Clusters, PyTorch, TensorFlow)

Hiring Organisation
Capital One
Location
Mc Lean, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
applications using existing and emerging technology platforms. You'll focus on machine learning architectural design, develop and review model and application code, and ensure high availability and performance of our machine learning applications. You'll have the opportunity to continuously learn and apply the latest innovations and best ...

Storage Senior SAN SME/Arch

Hiring Organisation
Square One Resources
Location
Birmingham, United Kingdom
Employment Type
Contract
Contract Rate
GBP 600 Daily
multi-cloud platforms. You will act as the subject matter expert for block storage architecture, with a strong focus on Pure Storage technologies, ensuring high-performance, scalable, and secure storage solutions for containerised workloads. Job Responsibilities/Objectives Define and lead storage architecture strategy for OpenShift clusters and workloads … storage with compute and networking layers. Ensure seamless support for containerised workloads and persistent storage requirements. Conduct performance assessments and drive optimisation initiatives. Implement high availability, redundancy, and disaster recovery strategies. Ensure compliance with banking security, regulatory, and data protection standards. Produce and maintain detailed technical documentation. Partner ...

DevSecOps Engineer

Hiring Organisation
Space Executive
Location
City of London, London, United Kingdom
with cutting-edge technologies, including GCP, Kubernetes, Terraform/OpenTofu, and modern CI/CD pipelines, while helping define best practices for a regulated, high-impact platform. What You’ll Do: Infrastructure & Automation: Build and maintain cloud infrastructure, CI/CD pipelines, and automated deployment workflows. Security & Compliance: Implement … securely. Incident Response: Contribute to operational and security incident handling to minimize downtime and risk. What You Bring: Strong experience designing, managing, and securing high-availability cloud infrastructure (GCP preferred). Expertise with Kubernetes (GKE), container runtime security, and cluster hardening. Solid skills in Infrastructure as Code (Terraform ...

Card Solutions Architects - Contract role - Sheffield - Immediate

Hiring Organisation
StrategicsIQ Ltd
Location
Sheffield, South Yorkshire, Yorkshire, United Kingdom
Employment Type
Contract
strong focus on banking and payments systems Deep domain knowledge of card processing systems (e.g., authorization, clearing, settlement, dispute management) Proven experience designing high-throughput, low-latency transactional systems Experience operating within large-scale, matrixed enterprise environments Strong background in cloud-native architecture (AWS, Azure, or GCP), including relevant … Data: Distributed data systems, RDBMS/NoSQL, data modelling, data consistency patterns Security: Secure-by-design principles, IAM, encryption, threat modelling, regulatory compliance Resilience: High availability, fault tolerance, disaster recovery, observability (logging, monitoring, tracing) Channels: Experience designing across web, mobile, API gateways, and backend orchestration layers Legacy Integration ...

AWS DevOps Engineer - Blackburn/ Hybrid

Hiring Organisation
Oscar Associates (UK) Limited
Location
Blackburn, Lancashire, North West, United Kingdom
Employment Type
Permanent
Actions, GitLab CI, or Jenkins) to move code from commit to production with zero friction. Kubernetes & Containers: Manage and scale EKS clusters, focusing on high availability, service mesh, and cost optimization. Serverless & Scaling: Architecting solutions utilizing Lambda, API Gateway, and DynamoDB to handle fluctuating global traffic. Security & Observability … Docker and orchestration (Kubernetes/EKS is a must). The MSP/Consultancy Edge: (Optional but preferred) Experience managing diverse client environments or high-traffic SaaS platforms. Problem Solver: The ability to troubleshoot complex distributed systems and perform root-cause analysis on production incidents. AWS DevOps Engineer - Blackburn ...

AWS Architect

Hiring Organisation
Ascendion
Location
London, United Kingdom
Employment Type
Permanent
secure AWS architectures for market data and pricing platforms Architect and govern API layers (REST/event-driven) for internal and external consumers Ensure high availability, performance, and resilience across critical financial systems Solutioning & DevOps Lead solution design across multiple delivery teams Define and embed DevOps and Infrastructure ...

AWS DATA ARCHITECT

Hiring Organisation
Aroha Technologies
Location
Torrance, California, United States
Employment Type
Permanent
Salary
USD Annual
Functions • Design data ingestion pipelines from structured and unstructured sources • Establish data governance, security, and compliance standards (IAM, encryption, data classification, access control) • Implement high availability, disaster recovery, and cost optimization strategies • Collaborate with Data Engineers on ETL/ELT pipeline design and performance tuning • Support advanced analytics ...

Senior Infrastructure Engineer

Hiring Organisation
Reed Technology
Location
Abingdon, Oxfordshire, South East, United Kingdom
Employment Type
Permanent
Salary
£45,000
including Virtual Machines, Networking, Storage, Backup, and Monitoring. Maintain and secure Active Directory, Group Policy, DNS, DHCP, and Azure AD (Entra ID) synchronisation. Ensure high availability and resilience across all servers, networks, and databases supporting business-critical applications. Deliver advanced troubleshooting and root cause analysis for complex infrastructure ...

Senior Azure Cloud Engineer (Exchange / O365)

Hiring Organisation
Trigyn Technologies Inc
Location
Philadelphia, Pennsylvania, United States
Employment Type
Permanent
Salary
USD Annual
while driving improvements, supporting new projects, and making sound architectural decisions. Responsibilities: • Operate and maintain the existing Azure landing zone and subscription structure • Ensure high availability, performance, and reliability of Azure workloads • Manage core services including networking, compute, storage, identity, and monitoring • Serve as the primary Azure subject ...