51 to 75 of 224 Permanent High Availability Jobs

Senior Full Stack Developer

Hiring Organisation
StackStudio.digital
Location
London, United Kingdom
Employment Type
Permanent
Python, React, TypeScript and DevOps tooling. You will work on cutting-edge projects, develop scalable applications and collaborate with cross-functional teams to deliver high-quality solutions. Senior Full Stack Developer Key Responsibilities: Own the end-to-end architecture across front-end, back-end, APIs, data and system integrations. … data access strategies. Architect cloud-native solutions on AWS, Azure or GCP, incorporating CI/CD pipelines and Infrastructure as Code. Build for high availability, disaster recovery, cost optimisation and embedded observability. Apply secure coding practices and design robust authentication, authorisation and data-protection models. Ensure compliance with ...

DevOps / Cloud Engineer

Hiring Organisation
Lynx Recruitment Ltd
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
Additional Desirable Skills Experience with tools such as Ansible, Maven, or Gradle Knowledge of AWS networking and database services Experience with performance tuning and high-availability systems ...

Senior Infrastructure Engineer

Hiring Organisation
Context Recruitment
Location
London, United Kingdom
Employment Type
Permanent
Salary
£75000 - £80000/annum
Plan and execute migrations, including tenant-to-tenant Microsoft 365 and Active Directory consolidations Collaborate with managed service providers and internal teams to maintain high availability, security and compliance Support business continuity through proactive monitoring, incident management and disaster recovery planning Mentor and provide technical guidance to team ...

IBM Sterling Specialist

Hiring Organisation
Genesis10
Location
Charlotte, North Carolina, United States
Employment Type
Permanent
Salary
USD 7,194 Hourly
contract opportunity with a hybrid schedule requirement of a minimum of 3 days per week onsite. This role will analyze, strengthen, and work on high-volume, high-availability, and resilient applications. The IT Security Specialist will focus on identifying potential threats and risks and providing recommendations. This ...

Production Support Engineer (Full Stack Engineer)

Hiring Organisation
Solugenix Corp
Location
Colorado, United States
Employment Type
Permanent
Salary
USD 100,000 Annual
pipelines, version control systems (Git), and Agile environments. Ability to debug issues across application layers under time-sensitive conditions. Preferred Qualifications: Experience supporting high-traffic, production-grade applications. Exposure to cloud platforms (AWS, Azure, or GCP). Experience with DevOps practices, containerization, or infrastructure troubleshooting. Prior experience in financial … solving skills with a sense of ownership and accountability. Able to balance production support responsibilities with ongoing engineering work. Thrives in a fast-paced, high-impact environment with minimal supervision. Strong communication skills to work effectively with cross-functional teams. Responsibilities: Provide end-to-end production support for full ...

Application Coordinator

Hiring Organisation
EMBL-EBI
Location
Saffron Walden, Essex, South East, United Kingdom
Employment Type
Permanent, Work From Home
EBIs training mission. The role requires strong technical leadership, stakeholder management, and a holistic view of platform operations to ensure efficiency, cost control, and high service quality across all environments. In addition, the Platform Coordinator will lead the Solutions Architecture function, driving the standardisation of platform and solution architectures … plans for the platform team. Oversee the lifecycle of technical projects, ensuring timely delivery and adherence to best practices. Manage operational performance and ensure high availability of both on-premises and cloud platforms. Maintain and develop strong collaborative relationships with IT teams and EMBL-EBI service stakeholders. Lead ...

Observability Platform Engineer - Elite Quant Hedge Fund

Hiring Organisation
Winston Fox
Location
London, UK
core Production Engineering team, who are accountable for the reliability, operability and performance of the firm’s trading-critical systems, in an environment where availability, correctness and latency directly impact outcomes. The successful hire will own the reliability of business-critical systems, from observability design through to incident resolution … systemic improvement; lead high-severity incident management; and reduce toil with software engineering, primarily in Python but also with Golang, TypeScript, SQL and/or PowerShell. The role requires someone who can evidence outstanding problem-solving and the ability to solve problems on their own initiative. Requirements Debugging distributed ...

Observability SRE / Platform Engineer – Elite Quant Hedge Fund

Hiring Organisation
Winston Fox
Location
City of London, London, United Kingdom
core Production Engineering team, who are accountable for the reliability, operability and performance of the firm’s trading-critical systems, in an environment where availability, correctness and latency directly impact outcomes. The successful hire will own the reliability of business-critical systems, from observability design through to incident resolution … systemic improvement; lead high-severity incident management; and reduce toil with software engineering, primarily in Python but also with Golang, TypeScript, SQL and/or PowerShell. The role requires someone who can evidence outstanding problem-solving and the ability to solve problems on their own initiative. Requirements Debugging distributed ...

Streaming Platform Engineer

Hiring Organisation
Genesis10
Location
Plano, Texas, United States
Employment Type
Permanent
Salary
USD 150,000 Annual
able to work onsite up to 3 days per week. Description: We are seeking a Streaming Platform Engineer to design, implement, and operate high-throughput, low-latency event streaming systems using modern distributed messaging platforms (e.g., Apache Kafka, Apache Pulsar, Azure Event Hubs, Amazon Kinesis, Google Pub/… replication, and geo-redundancy. Implement schema management using Schema Registry, Pulsar Schema, Protobuf/Avro/JSON Schema, or cloud-native schema services. Ensure high availability, disaster recovery, and multi-region failover. Producer & Consumer Development Write scalable, fault-tolerant producers and consumers in Java, Scala, Python, or Go. ...

Senior Online Specialist

Hiring Organisation
Skillsearch
Location
London Area, United Kingdom
design, development, and maintenance of online game services such as authentication, matchmaking, real time gameplay systems, and player data synchronization. Ensure services maintain high availability and low latency for players across multiple regions. • Cross Functional Collaboration Work closely with global game development teams to refine and implement technical … Mandarin Chinese is beneficial but not essential. • Proficiency in at least one backend programming language, with strong preference for C++ or Go. Experience building high performance, highly concurrent, and highly available systems is important, including work with threading or coroutine based architectures. • Solid understanding of distributed systems and backend ...

Internal Cloud Architect

Hiring Organisation
MBDA UK
Location
Bristol, Filton, Gloucestershire, United Kingdom
Employment Type
Permanent
Salary
£74000/annum
will have deep expertise in cloud infrastructure (OpenStack) and container orchestration (Red Hat OpenShift/Kubernetes), with a strong focus on scalability, automation, and high availability. You will play a key role in defining cloud strategy, leading platform architecture, and guiding engineering teams in delivering robust, secure, and performant … Define and implement cloud-native architecture patterns Integrate OpenStack with container platforms (OpenShift/Kubernetes) Drive automation using Infrastructure as Code (IaC) tools Ensure high availability, scalability, and disaster recovery strategies Establish security best practices across infrastructure and platforms Collaborate with DevOps, security, and application teams Provide technical ...

AWS Cloud Platform Manager

Hiring Organisation
Ciphr
Location
Reading, England, United Kingdom
identify and resolve issues, implement automation and monitoring solutions, and drive continuous improvement across our infrastructure. Your leadership will be instrumental in fostering a high-performing, innovative engineering culture, optimising system performance, and minimising downtime. You will also be responsible for managing cloud costs effectively, ensuring spend remains within … optimise cloud infrastructure usage to ensure cost efficiency, maintaining spend within agreed budgets. Ensure the optimal performance of cloud services , delivering a seamless and high-quality experience for end users. Take accountability of the on-call rota , ensuring appropriate coverage, managing participation, and leading rapid incident response to maintain ...

Linux Engineer

Hiring Organisation
VIQU IT
Location
London, Tachbrook, United Kingdom
Employment Type
Permanent
Salary
£65000 - £70000/annum
using Bash, Python, or Perl scripts. • Design, deploy, and manage Linux workloads in Azure. • Implement and manage virtual networks, storage, and Azure resources. • Ensure high availability and disaster recovery of critical systems. • Use Azure DevOps for CI/CD pipeline automation and infrastructure as code (IaC). • Collaborate ...

DevOps Systems Engineer

Hiring Organisation
Everest Computers Inc
Location
Roswell, Georgia, United States
Employment Type
Any
Salary
USD 130,000 Annual
identify and remediate vulnerabilities across the SDLC, ensuring compliance and reducing security risks. Designed and implemented automated resiliency/disaster recovery frameworks to ensure high availability and fault tolerance in cloud applications. Experience with Cloud application container Security and Access Control via OAuth2.0/OIDC/SAML/ ...

IT Network Engineer

Hiring Organisation
US Tech Solutions, Inc
Location
Charlotte, North Carolina, United States
Employment Type
Permanent
Salary
USD Annual
technologies to ensure resilient and efficient network performance. Wireless Network Engineering Architect and manage wireless infrastructure with a focus on MIST-based solutions, ensuring high availability, performance, and seamless user experience. Network Automation & Scripting Develop and maintain automation workflows using Ansible, Terraform and other tools. Leverage APIs ...

Senior Machine Learning Engineer (KServe + Kubernetes, building Kubernetes Clusters, PyTorch, TensorFlow)

Hiring Organisation
Capital One
Location
New York, United States
Employment Type
Permanent
Salary
USD Annual
applications using existing and emerging technology platforms. You'll focus on machine learning architectural design, develop and review model and application code, and ensure high availability and performance of our machine learning applications. You'll have the opportunity to continuously learn and apply the latest innovations and best ...

Senior Machine Learning Engineer (KServe + Kubernetes, building Kubernetes Clusters, PyTorch, TensorFlow)

Hiring Organisation
Capital One
Location
Mc Lean, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
applications using existing and emerging technology platforms. You'll focus on machine learning architectural design, develop and review model and application code, and ensure high availability and performance of our machine learning applications. You'll have the opportunity to continuously learn and apply the latest innovations and best ...

DevSecOps Engineer

Hiring Organisation
Space Executive
Location
City of London, London, United Kingdom
with cutting-edge technologies, including GCP, Kubernetes, Terraform/OpenTofu, and modern CI/CD pipelines, while helping define best practices for a regulated, high-impact platform. What You’ll Do: Infrastructure & Automation: Build and maintain cloud infrastructure, CI/CD pipelines, and automated deployment workflows. Security & Compliance: Implement … securely. Incident Response: Contribute to operational and security incident handling to minimize downtime and risk. What You Bring: Strong experience designing, managing, and securing high-availability cloud infrastructure (GCP preferred). Expertise with Kubernetes (GKE), container runtime security, and cluster hardening. Solid skills in Infrastructure as Code (Terraform ...

AWS DevOps Engineer - Blackburn/ Hybrid

Hiring Organisation
Oscar Associates (UK) Limited
Location
Blackburn, Lancashire, North West, United Kingdom
Employment Type
Permanent
Actions, GitLab CI, or Jenkins) to move code from commit to production with zero friction. Kubernetes & Containers: Manage and scale EKS clusters, focusing on high availability, service mesh, and cost optimization. Serverless & Scaling: Architecting solutions utilizing Lambda, API Gateway, and DynamoDB to handle fluctuating global traffic. Security & Observability … Docker and orchestration (Kubernetes/EKS is a must). The MSP/Consultancy Edge: (Optional but preferred) Experience managing diverse client environments or high-traffic SaaS platforms. Problem Solver: The ability to troubleshoot complex distributed systems and perform root-cause analysis on production incidents. AWS DevOps Engineer - Blackburn ...

AWS Architect

Hiring Organisation
Ascendion
Location
London, United Kingdom
Employment Type
Permanent
secure AWS architectures for market data and pricing platforms Architect and govern API layers (REST/event-driven) for internal and external consumers Ensure high availability, performance, and resilience across critical financial systems Solutioning & DevOps Lead solution design across multiple delivery teams Define and embed DevOps and Infrastructure ...

AWS DATA ARCHITECT

Hiring Organisation
Aroha Technologies
Location
Torrance, California, United States
Employment Type
Permanent
Salary
USD Annual
Functions • Design data ingestion pipelines from structured and unstructured sources • Establish data governance, security, and compliance standards (IAM, encryption, data classification, access control) • Implement high availability, disaster recovery, and cost optimization strategies • Collaborate with Data Engineers on ETL/ELT pipeline design and performance tuning • Support advanced analytics ...

GCP Devops Lead

Hiring Organisation
Infoplus Technologies UK Ltd
Location
Bristol, Somerset, United Kingdom
Employment Type
Permanent
Salary
GBP Annual
throughout the platform life cycle, safeguarding organisational and customer data. Performance & Reliability: Define, monitor, and operate against service level objectives (SLOs/SLIs), ensuring high availability, performance, and fault tolerance. Continuous Improvement: Drive automation, observability, and performance tuning to reduce manual effort and improve platform reliability. Collaboration: Work ...

Senior Infrastructure Engineer

Hiring Organisation
Reed Technology
Location
Abingdon, Oxfordshire, South East, United Kingdom
Employment Type
Permanent
Salary
£45,000
including Virtual Machines, Networking, Storage, Backup, and Monitoring. Maintain and secure Active Directory, Group Policy, DNS, DHCP, and Azure AD (Entra ID) synchronisation. Ensure high availability and resilience across all servers, networks, and databases supporting business-critical applications. Deliver advanced troubleshooting and root cause analysis for complex infrastructure ...

Senior Azure Cloud Engineer (Exchange / O365)

Hiring Organisation
Trigyn Technologies Inc
Location
Philadelphia, Pennsylvania, United States
Employment Type
Permanent
Salary
USD Annual
while driving improvements, supporting new projects, and making sound architectural decisions. Responsibilities: • Operate and maintain the existing Azure landing zone and subscription structure • Ensure high availability, performance, and reliability of Azure workloads • Manage core services including networking, compute, storage, identity, and monitoring • Serve as the primary Azure subject ...

Kafka Data Platform Engineer

Hiring Organisation
Brains Workgroup, Inc
Location
New York, United States
Employment Type
Permanent
Salary
USD Annual
Configure and manage Kafka components including brokers, Kraft, MirrorMaker 2, Explore Kafka Connect, and Schema Registry concepts and implementations. Ensure performance, reliability, scalability, and high availability of the Kafka platform. Implement cluster monitoring, logging, and alerting using enterprise observability tools. Manage capacity planning, partition strategies, retention policies ...