High Availability Jobs

1 to 25 of 466 High Availability Jobs

Senior Cloud & Infrastructure Engineer - MSP/MSSP with Security Clearance

Herndon, Virginia, United States
Industrial Security Integrators, LLC
Department, reporting to the Technical Support Manager. This role is responsible for technical execution, optimization, and automation of hybrid cloud, infrastructure, and MSP/MSSP tool stack solutions, ensuring high availability, security, and operational efficiency across both internal and client environments. This position is collaborative with Cybersecurity, Compliance, and Program Management teams to ensure that infrastructure solutions align … with security frameworks, operational best practices, and client needs. Success in this role is measured by improvements in customer retention, profitability, and customer satisfaction, achieved through strong client communication, high-performing teams, automation, and technical mentorship. Key Responsibilities Cloud & Infrastructure Engineering • Design, implement, and optimize hybrid cloud environments (Azure, AWS, GCC/GCC High) with a focus on … automation, scalability, and performance. • Develop and implement automation strategies (PowerShell, Python, Ansible) to streamline provisioning, monitoring, and system management. • Maintain 99.99% uptime and high availability through proactive infrastructure monitoring, redundancy strategies, and disaster recovery planning. • Ensure compliance with NIST 800-171, CMMC, and FedRAMP, partnering with Cybersecurity & Compliance teams. Technical Leadership & Tier 3 Support • Act as the highest More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Delta Capita
versioning. Containerization and Orchestration: Deploy, manage, and provide ongoing support for containerized applications using Kubernetes, including Amazon EKS (Elastic Kubernetes Service) and Azure Kubernetes Service (AKS), ensuring their reliability, availability, and performance. Monitoring and Alerting: Monitor application performance and system health through observability tools (e.g., Prometheus, Grafana, ELK stack), proactively identifying and resolving issues to ensure high availability … OAuth2, and SAML Single Sign-On (SSO) to ensure secure authentication and authorization across services. Database Technologies: Manage and optimize database systems, including SQL databases and Mongo DB, ensuring high availability, performance tuning, and data security. CI/CD Practices: Automate manual processes to enhance operational efficiency, employing Continuous Integration/Continuous Deployment (CI/CD) best practices … languages such as Java, TypeScript, and Python to automate tasks and manage configurations. Load Balancing: Implement and maintain load balancing solutions to ensure optimal distribution of application traffic and high availability. Collaboration with Development Teams: Collaborate with software engineering teams to design, develop, and maintain robust systems and solutions, including RESTful APIs, ensuring seamless integration across platforms. Post-Mortem More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior SQL DBA - Sunderland

Sunderland, Tyne and Wear, Tyne & Wear, United Kingdom
Hybrid / WFH Options
Randstad Technologies Recruitment
join an experienced IT team and take a lead role in the ongoing development, optimisation, and resilience of the organisation's database environment. You'll be responsible for maintaining high availability, supporting integrations across platforms, and ensuring the reliability and performance of systems critical to business operations. Key Responsibilities: Configure and manage high availability and disaster … recovery solutions including Always On Availability Groups, mirroring, and clustering. Implement and test backup and recovery procedures to safeguard data. Monitor performance metrics and carry out tuning and optimisation as required. Support development and integration efforts across cloud and on-prem environments. Use version control tools (e.g. GitHub, Bitbucket) to manage database scripts and schema changes. Maintain comprehensive technical … services. Provide 2nd/3rd line support and investigate root causes of system issues. What We're Looking For: Solid experience in SQL Server database administration. Strong understanding of high availability, backup, and recovery strategies. Proficient in writing and troubleshooting T-SQL. Experience with ETL tools (e.g. SSIS, Azure Data Factory, Informatica, Talend). Familiarity with version control More ❯
Employment Type: Permanent
Salary: £50000 - £52000/annum
Posted:

Database Administrator with Security Clearance

Longmont, Colorado, United States
Caribou Thunder, LLC
Secret (TS/SCI Preferred) Build the mission-critical database infrastructure that powers national security operations. A Day in the Life - What you'll do • Design, build, and maintain high-availability database infrastructure for multi-user, mission-critical systems supporting satellite ground systems. • Implement database security, clustering, failover, and backup solutions to ensure continuous uptime, data integrity, and … Engineering, Information Systems, or related field) • Active Secret security clearance (TS/SCI strongly preferred) • Strong interpersonal skills; able to work cross-functionally with diverse technical teams • Passionate about high-availability systems, security hardening, and mission support Core Skills Database Platforms & Architecture: • PostgreSQL, MySQL, Microsoft SQL Server (Advanced experience required) • High-Availability Architecture, Clustering, Automated Failover More ❯
Employment Type: Permanent
Salary: USD 180,000 Annual
Posted:

Senior Software Engineer

London, United Kingdom
Visa Inc
taking responsibility for your services and the technology within them. These roles fit in to squads who are building out brand new parts to our payments platform, focusing on high availability, cloud native, microservice concepts. You will work as the Lead Engineer in your squad, leading on discussions around technical direction and systems design, as well as mentoring … teams (Scrum or Kanban) Expert knowledge of Docker, AWS (public cloud) and Kafka Ability to communicate effectively with technical and non-technical stakeholders Modern Cloud-Native architectures and practices (high availability, high scalability, microservices, 12-factor apps, CI/CD, automation and observability) TDD, BDD and Contract testing Experience in a DevOps environment or willingness to work More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Linux DevOps Engineer

Düsseldorf, Nordrhein-Westfalen, Germany
Utimaco Management Services GmbH
logging solutions with Grafana Loki to ensure system health, performance, and rapid issue detection. Deploy, configure, and optimize Linux applications like HAProxy, NGINX , and other critical services to ensure high availability and scalability. Drive Infrastructure as Code (IaC) automation using Ansible , enabling scalable, repeatable, and reliable infrastructure deployments. Oversee Linux systems across global data centers, ensuring uptime, implementing … on experience with centralized monitoring solutions like Prometheus and logging platforms like Grafana Loki . In-depth knowledge of Linux-based applications such as HAProxy, NGINX , and experience with high-availability configurations and performance optimization. Practical experience with Infrastructure as Code (IaC) using Ansible , and a mindset for automating operational processes. Familiarity with Linux security concepts (e.g., firewallD … SELinux, OpenSCAP ) and failover/high-availability strategies in distributed data center environments . Fluent in English (written and spoken), with strong documentation, teamwork, and problem-solving skills. Application Process After your application, the first thing you will do is get to know your potential new team leader in a Teams Interview. After that, we look forward to More ❯
Employment Type: Permanent
Salary: EUR Annual
Posted:

Senior Software Engineer, Transaction Tracing

London, United Kingdom
Chainalysis Inc
to our customers. Responsibilities Become part of an established team adept at collaboration and task allocation. A team which can focus on generating direct impact. Make key contributions to high availability solutions in close collaboration with your team through your ability and willingness to take ownership and assist where needed. Your contributions will be thorough and result in … Spring-based backend services. Experience in the full lifecycle of service management, from initial development to continuous operation. A deep understanding of the critical aspects of service scalability and high availability as well as monitoring and maintaining deployed features and services ensuring optimal performance and reliability. Database management systems experience including replication, high availability, performance tuning More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Developer

Guildford, Surrey, England, United Kingdom
Jonothan Bosworth
Are you an experienced systems-level engineer craving impact We’re seeking a Senior Software Developer versed in Rust, or an equivalent systems language, who thrives in high-availability, mission-critical environments. Our client is a fast-growing technology provider delivering next-generation communications solutions to a global customer base. With ambitious growth plans and investment backing, this … forward-thinking engineering team where you’ll make a real impact. You’ll play a key role in architecting, building, and optimising telecommunications systems in Rust , contributing to secure, high-performance, and scalable solutions used worldwide. You’ll collaborate closely across DevOps, API (Java), front-end, and database teams, and be empowered to drive meaningful architectural and reliability improvements. … What You’ll Do Build and scale : Develop production-grade features in Rust (or onboard quickly if transitioning from C/C++). High-availability focus : Help ensure uptime and performance in a real-time, mission-critical telecom platform. Collaborate broadly : Work across disciplines to improve stability, maintainability, and scalability. Mentor and learn : Benefit from peer mentoring, and More ❯
Employment Type: Full-Time
Salary: £65,000 - £75,000 per annum
Posted:

Infrastructure Engineer Operations

Bracknell, Berkshire, England, United Kingdom
Jam Management Consultancy Limited T/A JAM RECRUITMENT
Participate in incident response, threat simulation, and operational runbooks. Troubleshooting & Collaboration Provide 3rd line support, collaborating with 1st and 2nd line teams. Partner with developers to support seamless deployments. High Availability & Flexibility Implement high availability and disaster recovery systems. Support a global, always-on environment with adaptability to evolving needs. About You Bachelor’s degree in More ❯
Employment Type: Full-Time
Salary: £42,000 - £50,000 per annum
Posted:

Senior Database Administrator

Central London, London, United Kingdom
DXC Technology
Technology, you will be responsible for designing, implementing, and maintaining robust database solutions that support enterprise infrastructure services. This role involves working with cloud and on-premises databases, ensuring high availability, security, and performance while supporting database migrations, automation, and modernization efforts. You will collaborate with cross-functional teams to optimize database architectures and contribute to the continuous … improvement of infrastructure services. Key Responsibilities: Lead and manage database infrastructure services, including deployment, migration, and administration of databases (SQL, Oracle, PostgreSQL, etc.). Ensure high availability, security, and optimal performance of database environments. Support the development and execution of database infrastructure strategies and modernization projects. Monitor and maintain database health, backups, disaster recovery, and performance tuning. Collaborate More ❯
Employment Type: Permanent
Posted:

SRE/Infrastructure Engineer

Basingstoke, Hampshire, United Kingdom
InfoSum
management, deployment, and monitoring. Implement infrastructure as code (IaC) practices using tools such as Terraform and Ansible. Monitoring and Alerting: Implement monitoring solutions to track the health, performance, and availability of infrastructure components and applications. Configure alerting mechanisms to notify teams of potential issues and proactively address them before they impact users. Incident Response and Root Cause Analysis: Participate … enhancements to ensure optimal performance and resource utilization. Security and Compliance: Implement security controls, and respond to security incidents in accordance with established policies and procedures. Disaster Recovery and High Availability: Design and implement disaster recovery (DR) and high availability (HA) solutions to ensure business continuity and minimize downtime. Develop and test DR plans, implement failover More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

SQL DBA

Newcastle Upon Tyne, Tyne and Wear, North East, United Kingdom
Anson Mccade
You must have the permanent right to work in the UK. What You'll Be Doing Manage and maintain SQL Server environments (on-premise and Azure) Support and optimise high-availability configurations and backup strategies Collaborate with developers and technical teams to improve schema design, stored procedures, and overall SQL performance Implement and monitor SQL Server agent jobs … a genuine desire to learn and solve problems Strong appreciation for robust and well-documented systems A collaborative, solutions-oriented approach Nice-to-Haves Powershell scripting (dbatools, SqlServer modules) High Availability (Always-On, Basic Availability Groups) SQL Server configuration and partitioning SQL Snapshots, Change Data Capture Understanding of SAN arrays (Nimble preferred, PURE or others also suitable … services experience is a plus but not required Why Join Us? Make a real impact - have ownership from day one Work alongside top-tier engineering talent in a collaborative, high-performance team Solve real-world business problems closely aligned with market dynamics Engage directly with users - see the results of your work in production Learn and grow - deepen your More ❯
Employment Type: Permanent
Posted:

Senior Site Reliability Engineer

Addison, Texas, United States
INSPYR Solutions
reliability and innovation. Direct experience building, launching, configuring, and maintaining AWS and/or Microsoft Azure cloud resources. • Expertise preferred in implementing methodologies for Automation, Continuous Integration, Continuous Delivery, High Availability, High Scalability, Monitoring, Logging, Security and Governance Experience with Terraform and a strong understanding of Infrastructure as Code (IaC) principles. Strong scripting knowledge using languages such More ❯
Employment Type: Permanent
Salary: USD 150,000 Annual
Posted:

Senior Dev Ops Engineer

United Kingdom
Hybrid / WFH Options
Deekay Technical Recruitment
expertise in network appliances to join our growing infrastructure and DevOps team. The ideal candidate will play a key role in designing, implementing, migrating and maintaining secure, scalable, and high-performance infrastructure, with a particular focus on network and cloud environments. This role requires advanced hands-on experience with F5, Palo Alto, CheckPoint firewalls, Zscalar and AWS, alongside fluency … service, single pattern for the service. Configure and manage network security appliances, particularly Palo Alto and CheckPoint firewalls in enterprise environments. Design, deploy, and manage AWS cloud infrastructure, ensuring high availability, security, and per-formance. Implement and maintain CI/CD pipelines using tools like Jenkins, GitLab CI. Automate infrastructure provisioning using IaC/CaC tools such as … and more. Practical knowledge of DevOps tools: Git, Jenkins, Docker, Ansible, Terraform. Strong scripting skills (Bash, Python, or equivalent). Experience in managing production systems, preferably in regulated or high-availability environ-ments. Experience in mentoring junior members of the team More ❯
Employment Type: Contract
Rate: GBP 650 - 750 Daily
Posted:

Senior Dev Ops Engineer

North West, United Kingdom
Hybrid / WFH Options
Deekay Technical Recruitment
expertise in network appliances to join our growing infrastructure and DevOps team. The ideal candidate will play a key role in designing, implementing, migrating and maintaining secure, scalable, and high-performance infrastructure, with a particular focus on network and cloud environments. This role requires advanced hands-on experience with F5, Palo Alto, CheckPoint firewalls, Zscalar and AWS, alongside fluency … service, single pattern for the service. Configure and manage network security appliances, particularly Palo Alto and CheckPoint firewalls in enterprise environments. Design, deploy, and manage AWS cloud infrastructure, ensuring high availability, security, and per-formance. Implement and maintain CI/CD pipelines using tools like Jenkins, GitLab CI. Automate infrastructure provisioning using IaC/CaC tools such as … and more. Practical knowledge of DevOps tools: Git, Jenkins, Docker, Ansible, Terraform. Strong scripting skills (Bash, Python, or equivalent). Experience in managing production systems, preferably in regulated or high-availability environ-ments. Experience in mentoring junior members of the team More ❯
Employment Type: Contract
Rate: £650 - £750/day
Posted:

Postgres DBA (Experienced/Senior Level)

United Kingdom
Hybrid / WFH Options
Exact IT Resources Ltd
in the fast growing OpenSource Database market. Our client are looking for a talented and experienced PostgreSQL Database Administrator (DBA) to manage, optimize, and maintain PostgreSQL databases in a high-performance and mission-critical environment. You will work closely with teams across the organisation to ensure database availability, performance, security, and scalability. You will have strong knowledge in … database tuning, backups, high availability, monitoring systems, and automation (especially with tools like Ansible). Your responsibility will be across the following core areas: Installation Install and configure new database Servers using best practices. Knowledge of High Availability (HA) and Disaster Recovery (DR). Upgrade minor and major versions. Backup and recovery Ensure all database Servers More ❯
Employment Type: Permanent
Salary: GBP 65,000 - 70,000 Annual
Posted:

Global Platform Team Lead and Senior Director - IT Network

London, United Kingdom
The Boston Consulting Group GmbH
Senior Director - IT Network is responsible fordriving the strategy, execution, and optimization of BCG's global network infrastructureacrosson-premises, cloud, and hybrid environments. This role ensuresend-to-end automation, high availability, security, and scalabilityof network services while integratingSD-WAN, cloud networking, and AI-driven automationto supportglobal business operations. The leader will overseenext-generation network architecture, operations, and transformation … ensuring a seamless and high-performance connectivity experience. Key Responsibilities: Strategic Leadership & Transformation: Define and execute amodern network platform strategy, integratingcloud networking, software-defined networking (SDN), and AI-driven automation. Ensureend-to-end network automationto improve operational efficiency, agility, and reliability. Drivezero-trust network securityprinciples, ensuring compliance and proactive threat mitigation. Establish aglobal observability and telemetry frameworkforreal-time network … ensuring agility and operational efficiency. IT Service Management & Operational Excellence: Establishnetwork reliability objectives, includingSLOs, SLIs, and error budgets. Implementreal-time incident detection and responseusing AI-driven network analytics. Ensurehigh availability, network resilience, and 24x7 operational support. Develop afollow-the-sun support model, ensuringglobal network performance optimization. Implementnetwork observability and predictive analyticstoproactively prevent outages. Security, Compliance & Risk Management: Drivezero-trust More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Database Administrator with Security Clearance

Longmont, Colorado, United States
Kaztronix
maintaining the database infrastructure to support various product teams database needs. The focus of this role will be on creating a robust and reliable platform, with an emphasis on high availability, security, and scalability, rather than data management or analytics. The Database Engineer will work closely with the technical team to ensure the database infrastructure is properly secured … requirements. The ideal candidate will have a strong background in database infrastructure development, security, and maintenance, and will be able to collaborate effectively with the technical team to deliver high-quality solutions. Desired Skills: RHEL Linux knowledge and experience Advanced Experience with PostgreSQL, MySQL and Microsoft SQL Server database platforms High-availability Clustering and automated failover Architecture More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Linux Systems Engineer

Leeds, West Yorkshire, United Kingdom
Hybrid / WFH Options
Context Recruitment
Linux Systems Engineer - Leading Managed Services Provider A leading Managed Services Provider is seeking a skilled Linux Systems Engineer to join their high-performing infrastructure team. This role offers the opportunity to work on enterprise-scale environments, delivering secure, scalable solutions across a diverse client base. Responsibilities: Design, implement, and maintain Linux-based systems in high-availability … PowerShell, Bash, Ruby, and Python. Work with virtualisation platforms including VMware, and deploy containerisation technologies such as Docker. Manage load balancing and clustering solutions including HAProxy, VTMs, and other high-availability architectures. Collaborate with cross-functional teams to deliver resilient infrastructure solutions. Requirements: Proven experience in Linux systems engineering within enterprise or MSP environments. Strong understanding of networking More ❯
Employment Type: Permanent
Salary: £40000/annum
Posted:

Tier2 Deskside/Systems Technician

Arlington, Texas, United States
Dallas
environment that encourages career development, gaining valuable skills in time management, critical thinking, and complex problem-solving. The ideal candidate is passionate about providing exceptional customer service, thrives in high-availability environments, and enjoys the variety that comes with supporting both routine IT needs and emergency response situations. Duties and Responsibilities •Provide on-site 7x24 deskside support and … operations •Process tickets and update CMDB using ServiceNow ITSM system •Coordinate with Tier 3 teams for system changes and upgrades •Support disaster recovery and continuity of operations exercises •Maintain high-availability environments through virtualization and redundancy best practices •Perform system maintenance tasks including software upgrades, backups, and recovery •Review systems weekly for critical updates and security threats •Provide … certifications •Microsoft certifications - Experience in software testing, system administration, or IT operations •Background in enterprise IT environments and patch deployment •Knowledge of test automation frameworks •Experience supporting government or high-security environments •ITIL framework knowledge (certification training will be provided Education Bachelor's degree preferred, or equivalent relevant experience in IT support, system administration, or related technical field. Pay More ❯
Employment Type: Permanent
Salary: USD 3,442 Hourly
Posted:

Head of IT Operations

Woking, Surrey, England, United Kingdom
Hybrid / WFH Options
Michael Page Technology
operations, and commercial offices. This role is pivotal in ensuring IT operations are resilient, secure, and aligned with the dynamic needs of the maritime and port services industry, delivering high performance across a complex operational landscape. Client Details The employer is a well-established organisation in the leisure, travel, and tourism industry. Description The Head of IT Operations will … operations, and commercial offices. This role is pivotal in ensuring IT operations are resilient, secure, and aligned with the dynamic needs of the maritime and port services industry, delivering high performance across a complex, 24/7 operational landscape. Duties and Responsibilities: Infrastructure & Cloud Management * Lead the architecture and lifecycle management of hybrid infrastructure supporting all operations, both onshore … scalability, performance, and disaster resilience across geographically dispersed operations. * Implement automation, Infrastructure as Code (IaC), and DevOps practices to modernize deployments and reduce downtime. Network & Telephony * Ensure secure and high-availability networks across port terminals, remote logistics sites, and central offices-including LAN/WAN, fibre, Wi-Fi, SD-WAN, and VPN connectivity. * Manage operational and technical delivery More ❯
Employment Type: Full-Time
Salary: £90,000 - £110,000 per annum
Posted:

Systems Administrator IV with Security Clearance

Aberdeen Proving Ground, Maryland, United States
Caelum Research Corporation
recovery using scripting languages, such as Bash, PowerShell, Python, Ruby, and/or Perl. • Supports performance tuning, capacity planning and bottleneck resolution, administration of Microsoft Servers, building and maintaining high-availability, clustered and scalable systems, and load balancing • Manages production environments using automated install, automated build and deployment, automated configuration management, enterprise monitoring systems, server virtualization, load balancing … RAID 1-N • Experience in database design, data modeling and query execution, database replication, migration, backup and recovery • Experience performance tuning, capacity planning and bottleneck resolution, building and maintaining high-availability, clustered and scalable systems • Experience managing a production environment with the following: automated install, automated build and deployment, automated configuration management, enterprise monitoring systems, server virtualization, load More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Network Administrator

Key West, Florida, United States
ManTech
Responsibilities include but are not limited to: The Senior Network Administrator leads the design, implementation, and maintenance of complex network infrastructure across multiple security domains, including classified networks, ensuring high availability and robust security They possess expert-level knowledge of advanced networking protocols, security best practices, and infrastructure technologies They manage complex network infrastructure devices, implement and enforce … security policies, configure advanced access control lists (ACL), and leverage software-defined networking (SDN) solutions They conduct advanced system troubleshooting, implement comprehensive backup and recovery strategies, and ensure the high availability and resilience of critical network services They lead the development and maintenance of detailed network documentation, provide expert technical guidance and mentorship to Junior and Mid-Level More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Network Engineer

London, South East, England, United Kingdom
Hybrid / WFH Options
Precise Placements
experience Ideally a BSc in CompSci or Networking related fields. Relevant certifications (CCNP, CCIE, AWS, Azure etc.) Duties and Responsibilities: Contribute to global Wide Area Network (WAN) operations, ensuring high availability, scalability, and performance. Participate in the deployment and optimization of Software-Defined WAN (SD-WAN) solutions to enhance network agility and cost-efficiency. Design, implement, and maintain … meet organizational goals. Provide technical guidance and mentorship to junior team members. Maintain network uptime, security, and compliance with industry standards (e.g., ISO 27001, NIST). Configure monitoring tools, high availability setups, and disaster recovery plans for network infrastructure. Maintain detailed documentation of network configurations, policies, and procedures. Partner with IT leadership, application teams, and external vendors to More ❯
Employment Type: Full-Time
Salary: £70,000 - £80,000 per annum
Posted:

Senior Network Engineer with Security Clearance

Springfield, Virginia, United States
MKS2
have 5 or more years' of demonstrated understanding and hands-on experience in the following networking concepts: network traffic flow analysis, network management, network topology design, network security, performance, high availability, load balancing, and fault tolerant architectures. Shall have 5 or more years' of experience working with secure encrypted networking devices (i.e., Taclanes and Border Guards) and communication … requirement. Personal experience working with Cisco Adaptive Security Appliances (ASA) and like products is a plus. Demonstrated experience in networking engineering enterprise solutions to directly support a variety of high availability, fault tolerant, disaster recovery, and continuity of operations (COOP) scenarios. Demonstrated experience operating and maintaining the Riverbed Steelhead appliance product line to provide WAN Optimization and Deep More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:
High Availability
10th Percentile
£41,325
25th Percentile
£52,500
Median
£67,500
75th Percentile
£86,250
90th Percentile
£107,375