be crucial in ensuring the seamless operation of our applications, DevOps, middleware, security, and infrastructure components. Key Responsibilities: Provide 24/7 technical support for cloud-based solutions, ensuring highavailability and performance across various applications and infrastructure components. Design, build, and maintain infrastructure and configuration as code using tools like Ansible and Terraform. Administer Dev, Test, and More ❯
be crucial in ensuring the seamless operation of our applications, DevOps, middleware, security, and infrastructure components. Key Responsibilities: Provide 24/7 technical support for cloud-based solutions, ensuring highavailability and performance across various applications and infrastructure components. Design, build, and maintain infrastructure and configuration as code using tools like Ansible and Terraform. Administer Dev, Test, and More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Leonardo UK Ltd
based engineering team working within a highly secure environment; with responsibility for providing a bridge between software development, testing, infrastructure, operations and system administrators to facilitate efficient, continuous and high-quality software and environment delivery. The processes and tooling implemented by the Principal DevSecOps Engineer will enable source code and associated artefacts to be installed and configured on scalable … code and supporting artefacts using appropriate tools. As required, lead on software/application development activities. Plan and undertake installations on development, test, reference and operational environments. Deliver consistent high-quality software and environmental builds through automation. Creation and maintenance of automaton frameworks for software and/or environment provisioning and ongoing operation. Managing and controlling software configuration for … projects including the source repository. Implement and maintain of Continuous Integration (CI) and/or Build pipelines where appropriate. Understand change control and release management practices. Ensure highavailability and security of systems through robust monitoring, logging, and alerting. Assist in data engineering activities (data cleansing, integration, onward data analytics). Provide application support to existing deployed services. More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
CACI Limited
protection, regulatory compliance, and alignment with industry best practices (e.g., AWS IAM, encryption, VPC, security monitoring, and auditing). • Containerisation & Orchestration: Architect and oversee containerised environments using Kubernetes, ensuring highavailability, scalability, and fault tolerance for critical applications. • Event-Driven Systems: Lead a team to architect event-driven systems using Kafka, designing and managing messaging frameworks to handle … Certified Security - Specialty) preferred. • Understanding of architectural standards and frameworks e.g. TOGAF Due to the industries we work in, we require the successful candidate to be able to obtain high level security clearance. To qualify for this, you must be a British citizen and have lived permanently in the UK for the last 5 years. Why work for us More ❯
the infrastructure engineers: VMware Infrastructure Design and deployment of large-scale Vmware environments Configuration and management of ESXi, vCenter, vSAN, and NSX Implementation of RBAC for Vmware access control Highavailability, disaster recovery, and backup strategies Operating Systems Deployment, configuration, and management of Linux (various distributions) Windows Server setup, including Active Directory, DNS, and Group Policy Linux repositories More ❯
Proactively monitor and report on system capacity and performance. Provide 2nd and 3rd line technical support for Linux and IBM-Power platforms. Lead and contribute to infrastructure projects, delivering high-quality solutions aligned to business needs. Ensure availability of mid-range platforms, resolving service-affecting issues as necessary. Implement best practices across Linux platforms to meet availability … participate in out-of-hours support as part of a rota (37.5 hour week). Desirable Experience: IBM Power, AIX, VIO, NIM, CMC/HMC administration. Designing and supporting highavailability architectures. Experience with public cloud environments (Azure and/or AWS). Job scheduling tools such as Redwood Cronacle/RunMyJobs. Understanding of project methodologies such as More ❯
South West London, London, England, United Kingdom
Oscar Technology
evolution Build and optimise CI/CD pipelines (GitHub Actions, Azure DevOps, Jenkins) Implement robust monitoring and alerting solutions (CloudWatch, Azure Monitor, Grafana, ELK) Own incident response processes, ensuring highavailability and rapid resolution Collaborate with stakeholders to communicate solutions and technical trade-offs clearly Ideal Experience: 3-5 years SRE or DevOps experience across AWS and Azure More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
into a consultancy that delivers cutting-edge, secure, and scalable technical solutions. With teams deployed both on-site and remotely, we continue to grow—driven by demand for our high-trust, high-performance services. Now, we're looking for a Site Reliability Engineer (SRE) to join our fast-growing team. Why this Site Reliability Engineer role? Our SREs … networking fundamentals Experience with AWS (EC2, RDS, S3, Lambda) Desirables: Coding in Java , Go , Python , or similar Familiarity with cross-domain security principles Previous work in service management or high-availability environments Experience building or using reliability metrics Security Matters Due to the nature of our work, any offer will be subject to passing security screening and eligibility More ❯
plus. Developing automated tools and processes for building, deployment, monitoring, testing, and maintenance of infrastructure and applications. Maintaining security, compliance, and disaster recovery policies and procedures. Optimizing infrastructure for highavailability, fault tolerance, and cost efficiency. Monitoring and improving infrastructure and application performance. Troubleshooting and resolving infrastructure and application issues. Managing and mentoring junior DevOps engineers. The role … secure products using appropriate technologies and patterns. Key cloud concepts and technologies. Designing and operating highly available shared solutions. Working in agile environments using Scrum and Kanban. Engaging with high-level stakeholders internally and externally. Technologies such as GitLab, Jenkins, Kubernetes, Docker, Terraform, Packer, Vault, Serverless, Elastic Stack, Prometheus, Grafana, Artifactory, Nexus. Due to the sector's nature, applicants … should hold high-level security clearance, which requires being a British passport holder and having lived permanently in the UK for the last 5 years. CULTURE & BENEFITS: We pride ourselves on providing a dynamic, inclusive, and team-led environment. Our culture combines the intimacy of a small business with the benefits of a larger organization. We offer a comprehensive More ❯
a Candidate Account, please Sign-In before you apply. Job Description: The primary responsibilities include: To form part of a critical operations function that is responsible for the monitoring, availability and performance of production services. Responding to stakeholder requests within agreed timescales or SLO Drive automation to reduce failures, manual tasks and therefore improving overall application performance and availability. … resolve application issues. Thorough knowledge of HTTP(S), SMTP, TLS/SSL, DNS, LDAP, Kubernetes and Docker containers Experience of system/application administration in a distributed, customer-facing, high-availability and large-scale environments Experienced and confident in at least one scripting language such as Perl, shell, Ruby or Python. Experience of tuning and optimising monitoring systems. More ❯
scenarios (including cyber related scenarios), identifying weaknesses, and remediating issues. Performance Tuning & Optimization: Analyse system performance, identifying bottlenecks, HA concerns and identifying ways to improve resilience. Disaster Recovery (DR) & HighAvailability (HA): Create and maintain disaster recovery strategies and high-availability architectures, leveraging AWS Resilience Hub and other services like Route 53, Auto Scaling, and Elastic More ❯
to join our team. This role is ideal for professionals who excel in supporting, maintaining, and optimizing critical IT infrastructure. You will play a key role in ensuring the highavailability, security, and performance of our systems while troubleshooting complex issues efficiently. Additionally, you will participate in an on-call rota to provide 24/7 support for … support role, managing complex IT environments. -Certifications in VMware, Microsoft, Dell, or security technologies (highly desirable). -Strong problem-solving skills, attention to detail, and a commitment to delivering high-quality solutions. -Familiarity with Hyper-converged Infrastructure. Desired Competencies: -Knowledge of cross-domain technologies (Tiger Traps, Garrison, OpsWat, NiFi). -Familiarity with DevOps tools (GitLab, Harbor, FluxCD, Kubernetes-based More ❯
Taunton, Somerset, United Kingdom Hybrid / WFH Options
Clear IT Recruitment Limited
including Active Directory, Azure AD and other cloud-based platforms. • Plan, implement, and manage migration projects, particularly around Microsoft 365 and other cloud solutions. • Monitor system performance and ensure highavailability for business-critical services. • Liaise with external vendors and service providers to resolve issues when required. • Provide mentorship and guidance to junior support staff. • Microsoft 365 - advanced More ❯
Swindon, Wiltshire, United Kingdom Hybrid / WFH Options
Thames Water Utilities Limited
performance, and drive innovation in a business delivering life's essential service every day. What you'll do as a Waste Site SCADA Systems Engineer SCADA System Support: Maintain highavailability of site-based SCADA platforms (targeting 99.2% uptime or higher). Perform essential planned maintenance, including health checks, patches, and hotfixes. Deliver site SCADA system updates and … systems. Proactively identify and mitigate risks related to system performance, integrity, and resilience. Participate in the out-of-hours support roster for SCADA fault response. Respond to and resolve high-impact incidents, using technical expertise and historical data. Escalate recurring faults and contribute to root cause analysis and long-term mitigation. Respond to SCADA-related service requests (TAPS) within … be given for what is undoubtedly an incredibly rewarding experience. It's also a great opportunity to learn more about our business and meet colleagues. Disclaimer: due to the high volume of applications we receive, we may close the advert earlier than the advertised date, so we encourage you to apply as soon as possible to avoid disappointment. More ❯
Gloucester, Gloucestershire, England, United Kingdom Hybrid / WFH Options
IMT Resourcing Solutions
benefits package, and a collaborative, forward-thinking environment where your voice and expertise are valued. If you're an experienced systems professional ready to take the lead in a high-impact, well-supported role, this is your chance to step into something truly rewarding. The Opportunity This is a pivotal role within a high-performing IT function that … Intune and SCCM for endpoint onboarding, policy deployment, and software packaging. Overseeing server patching processes and infrastructure monitoring, including WSUS/AUM configuration and third-party update scheduling. Supporting high-priority incident resolution, root cause analysis, and the documentation of best practices and knowledge articles. Driving continual service improvement and supporting major infrastructure and cloud transformation projects. What We … systems administration or infrastructure engineering. Strong knowledge across Active Directory, Windows Server, Microsoft 365, Exchange Hybrid, Azure, PowerShell scripting, and endpoint management tools like Intune and SCCM. Experience supporting high-availability environments, with a focus on security, resilience, and user experience. Excellent problem-solving skills and a proactive approach to automation and process improvement. Previous experience leading or More ❯
Company: Marsh Description: We are seeking a skilled and proactive Kubernetes Platform Engineer to provide configuration and maintenance support for Kubernetes clusters in high-availability environments within the EMEA region. The candidate will work closely with development teams and manage Kubernetes clusters using tools such as kubectl , Helm, or Kustomize. Requirements include in-depth knowledge of container technologies More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯
centre facility in Longacre. This hands-on role is ideal for a technically minded individual with experience in mechanical and electrical (M&E) systems looking to develop within a high-performance, high-availability environment. Key Responsibilities * Carry out routine inspections, preventative maintenance, and corrective repairs on data centre infrastructure. * Respond to building management system (BMS) alarms and More ❯