based applications. Build and maintain deployment pipelines and configuration management for Windows workloads Create tooling and automation around the deployment of a customer-specific Windows-based SaaS product Ensure highavailability, reliability, and scalability of Windows services. Integrate observability tooling (metrics, logs, traces) into IIS-hosted services Harden Windows infrastructure for security, compliance, and operational best practices Lead More ❯
City, Cardiff, United Kingdom Hybrid / WFH Options
SRT Marine Systems PLC
proper tagging of cloud resources. Implement basic IAM policies, roles, and access control best practices. Late r phases to include more sophisticated test cases, building on previous phases, involving highavailability and resiliency where needed in this hybrid non-prod environment. Document architecture, procedures, and deployment steps. Required Skills & Experience: Proven experience as a Cloud Engineer, DevOps Engineer More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
SRT Marine Systems PLC
proper tagging of cloud resources. Implement basic IAM policies, roles, and access control best practices. Late r phases to include more sophisticated test cases, building on previous phases, involving highavailability and resiliency where needed in this hybrid non-prod environment. Document architecture, procedures, and deployment steps. Required Skills & Experience: Proven experience as a Cloud Engineer, DevOps Engineer More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Astro Studios, Inc
work in isolation or as part of a larger team and able to use your own initiative rather than always relying on direction from others Appreciate the importance of highavailability and secure systems and infrastructure. Ability to communicate at all levels of an organisation with the ability to work effectively as part of a team A keen More ❯
plus. Developing automated tools and processes for building, deployment, monitoring, testing, and maintenance of infrastructure and applications. Maintaining security, compliance, and disaster recovery policies and procedures. Optimizing infrastructure for highavailability, fault tolerance, and cost efficiency. Monitoring and improving infrastructure and application performance. Troubleshooting and resolving infrastructure and application issues. Managing and mentoring junior DevOps engineers. The role … secure products using appropriate technologies and patterns. Key cloud concepts and technologies. Designing and operating highly available shared solutions. Working in agile environments using Scrum and Kanban. Engaging with high-level stakeholders internally and externally. Technologies such as GitLab, Jenkins, Kubernetes, Docker, Terraform, Packer, Vault, Serverless, Elastic Stack, Prometheus, Grafana, Artifactory, Nexus. Due to the sector's nature, applicants … should hold high-level security clearance, which requires being a British passport holder and having lived permanently in the UK for the last 5 years. CULTURE & BENEFITS: We pride ourselves on providing a dynamic, inclusive, and team-led environment. Our culture combines the intimacy of a small business with the benefits of a larger organization. We offer a comprehensive More ❯
New Milton, Hampshire, United Kingdom Hybrid / WFH Options
Appello Careline Limited
resilient, cost-effective, and aligned with business needs - all underpinned by ITIL best practices. As the Head of Service Delivery, you will oversee key functions including Service Level Management, Availability, and Capacity Management. You'll work closely with, Technical Operations, Change Management, and third-party suppliers to maintain high service standards and drive continuous improvement. IT Operations Management … service issues. Drive root cause analysis and preventative actions. Service Level Management Define and manage SLAs and OLAs. Monitor performance and lead service reviews. Identify and implement service improvements. ️ Availability Management Ensure services meet availability targets. Analyse and reduce service interruptions. Identify and mitigate single points of failure. Capacity Management Forecast future capacity needs. Monitor system performance and … field Certifications : ITIL Foundation Certification is required (minimum requirement); additional ITIL Intermediate or Expert-level certifications are preferred. Experience managing teams and third-party suppliers. Deep understanding of SLA, availability, and capacity principles Strong analytical and decision-making skills Familiarity with monitoring and reporting tools Advantageous but not essential Experience in health tech or critical service delivery Additional ITIL More ❯
suite. They are looking to take their business to the next level through various acquisitions and require someone who can help them achieve this. It is a hands-off, high-impact role which is ideal for someone looking to play a key part in leading design and delivery in a cross-functional team. Responsibilities Lead Technical Design & Architecture Help … design secure & high-availability systems with compliance in mind. Guide backend design for high-volume transactional systems. Collaborate Across Product & Business Work with Product Managers to understand requirements and create technical plans. Influence the roadmap with a clear understanding of feasibility and scale. Work with stakeholders at all levels to communicate technical risk. Drive Delivery & Execution Lead More ❯
suite. They are looking to take their business to the next level through various acquisitions and require someone who can help them achieve this. It is a hands-off, high-impact role which is ideal for someone looking to play a key part in leading design and delivery in a cross-functional team. Responsibilities Lead Technical Design & Architecture Help … design secure & high-availability systems with compliance in mind. Guide backend design for high-volume transactional systems. Collaborate Across Product & Business Work with Product Managers to understand requirements and create technical plans. Influence the roadmap with a clear understanding of feasibility and scale. Work with stakeholders at all levels to communicate technical risk. Drive Delivery & Execution Lead More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
Very Group
services that your squad are responsible for. You'll help shape best practices in software engineering, then both work to and coach other Squad members into driving and ensuring high quality solutions. Key responsibilities Work with and sometimes lead other Engineers, Application Specialists in your Squad/Performance Unit, Business Analysts, Product Owners and Architects to envision, design and … your Squad/Tribe and across The Very Group. About you. You're someone who thrives in a hands-on technical role and has a passion for building scalable, high-quality solutions. You've worked in a similar capacity and have a strong track record of enabling teams to deliver robust, secure, and observable systems. You're a natural … clear direction, coaching, mentoring and development support. Be passionate about code quality, testing and other engineering best practices. Significant experience of building software applications that have security, reliability, scalability, highavailability and concurrency built-in from the outset. A strong collaborator and communicator. Proven ability to influence and negotiate with Technical Stakeholders from all Software Engineering levels, to More ❯
Didcot, Oxfordshire, England, United Kingdom Hybrid / WFH Options
Corriculo Ltd
Engineer/AWS Cloud Engineer you will work in a project-focused capacity, playing a critical role in building, deploying, configuring, securing, and automating the infrastructure that supports a high-availability platform, working predominantly within AWS, using both Terraform and Ansible. Additionally, whether an area in which you already have experience, or one in which you would like … Linux-based systems. The Company The Cloud Infrastructure Engineer/AWS Cloud Engineer will be joining a company whose patient-monitoring platform has been scientifically proven to deliver safer, high-quality and efficient patient care, at an exciting time of growth and expansion into new markets. As a well-funded scale-up, they offer an entrepreneurial team spirit, where More ❯
East Hagbourne, Oxfordshire, United Kingdom Hybrid / WFH Options
Corriculo Ltd
Engineer/AWS Cloud Engineer you will work in a project-focused capacity, playing a critical role in building, deploying, configuring, securing, and automating the infrastructure that supports a high-availability platform, working predominantly within AWS, using both Terraform and Ansible. Additionally, whether an area in which you already have experience, or one in which you would like … Linux-based systems. The Company The Cloud Infrastructure Engineer/AWS Cloud Engineer will be joining a company whose patient-monitoring platform has been scientifically proven to deliver safer, high-quality and efficient patient care, at an exciting time of growth and expansion into new markets. As a well-funded scale-up, they offer an entrepreneurial team spirit, where More ❯
/TPU) and significantly improving computation efficiency. Collaborate cross-functionally with ML researchers, bioinformaticians, and scientists to translate research needs into scalable engineering solutions. Ensure system reliability, robustness, and highavailability, proactively implementing comprehensive monitoring, logging, and alerting solutions. Champion infrastructure-as-code (IaC) practices, promoting clarity, reproducibility, security, and auditability. Expected Growth In 1 month you will … created a prototype model architecture, demonstrated significant algorithmic improvements, and contributed to scaling methods for large-scale data ingestion and training. In 6 months you'll have developed a high-performance version of a foundation model, implemented key algorithmic optimizations that boost scalability and throughput, and published internal benchmarks demonstrating significant research impact. Why Join Us: Meaningful Impact: Contribute More ❯
data warehouse. A broad understanding of financial products across all asset classes would be highly beneficial. Your primary focus will be the development of all server-side logic, ensuring high performance and efficient interaction with the data warehouse and the calculation services. Thus, a commitment to collaborative problem solving, sophisticated design, and product quality is essential. A competent level … a daily basis. Design, build, and maintain efficient, reusable, and reliable Python code Rapidly prototyping solutions to quickly prove/disprove new concepts Design and implementation of low-latency, high-availability, and performant applications Work directly with Risk analysts and Product owners to turn functional requirements into efficient and maintainable code Ensure the best possible performance, quality, and More ❯
data warehouse. A broad understanding of financial products across all asset classes would be highly beneficial. Your primary focus will be the development of all server-side logic, ensuring high performance and efficient interaction with the data warehouse and the calculation services. Thus, a commitment to collaborative problem solving, sophisticated design, and product quality is essential. A competent level … a daily basis. · Design, build, and maintain efficient, reusable, and reliable Python code · Rapidly prototyping solutions to quickly prove/disprove new concepts · Design and implementation of low-latency, high-availability, and performant applications · Work directly with Risk analysts and Product owners to turn functional requirements into efficient and maintainable code · Ensure the best possible performance, quality, and More ❯
Have a deep technical understanding and to support engineers in the build and development of POCs and have a hands-on approach to designing and implementing solutions which have highavailability, achieve performance metrics and are scalable Helping to shape, evolve and document the architecture that underpins the existing platforms and services and lead their on-going transition … organisational skills A flexible and Agile way of working within a fast paced and everchanging environment. Attention to detail with a pragmatic and enthusiastic attitude to work Demonstrates a high level of energy, enthusiasm, and tenacity to achieve a positive result Is a self-starter, overcomes obstacles and is driven to succeed Works well under pressure to meet deadlines More ❯
a deep technical understanding and to support engineers in the build and development of POC's and have a hands-on approach to designing and implementing solutions which have highavailability, achieve performance metrics and are scalable Helping to shape, evolve and document the architecture that underpins the existing platforms and services and lead their on-going transition … organisational skills A flexible and Agile way of working within a fast paced and everchanging environment. Attention to detail with a pragmatic and enthusiastic attitude to work Demonstrates a high level of energy, enthusiasm, and tenacity to achieve a positive result Is a self-starter, overcomes obstacles and is driven to succeed Works well under pressure to meet deadlines More ❯
Telford, Shropshire, West Midlands, United Kingdom
LA International Computer Consultants Ltd
SAS environments. Collaborate with data engineers and DevOps teams to integrate SAS with Oracle, GitLab CI/CD pipelines, and other enterprise platforms. Monitor and optimise platform performance, ensuring highavailability and compliance with security standards. Provide technical support and troubleshooting for SAS users and developers. Automate environment provisioning and data transfers using approved EA integration patterns. Participate … tools - Airflow Strong documentation and communication skills. Ability to work collaboratively across multidisciplinary teams. Due to the nature and urgency of this post, candidates holding or who have held high level security clearance in the past are most welcome to apply. Please note successful applicants will be required to be security cleared prior to appointment which can take up More ❯
experience in building scalable microservices, modern API architectures, and cloud-native platforms using Google Cloud (GCP). You'll be working in an agile environment as part of a high-performing backend team, contributing to the design and deployment of key applications. As a Node.js Developer, you will: • Design, develop, and maintain scalable and performant backend microservices using Node.js … technologies such as Istio • Experience with ESB/SOA , GraphQL, Gradle/Maven Nice to Have: • Knowledge of Swagger, OpenAPI 3.0, SOAP, MQ, JSON • Experience with performance optimisation and high-availability architecture • Exposure to CI/CD automation and testing frameworks Node.js Developer Key Benefits: • Competitive daily rate: £475 Inside IR35 • Hybrid working across 5 UK locations • Agile More ❯
Northampton, West Northamptonshire, Northamptonshire, United Kingdom
Howdens Joinery
part-time hours (minimum 30 hours per week). What you will be doing as a Cloud Platform Engineer: - You will be responsible for creating, maintaining and deploying secure, high-available, fault-tolerant systems at scale via on-premise and cloud-native architectures for development, testing and production environments. - Developing and maintain solutions for operational administration, system/data … and work successfully in cross-functional teams Desirable skills include: - Azure Active Directory (Identity Management, Service Principles, Security Groups, Role Management, MFA) - API Management - Cloud Monitoring - CDN (Akamai preferably) - HighAvailability (Zones and Regions, Service Recovery What we can offer you: - Competitive salary and annual company bonus - Excellent pension scheme (company contribution of up to 12%) - 25 days More ❯
a Candidate Account, please Sign-In before you apply. Job Description: The primary responsibilities include: To form part of a critical operations function that is responsible for the monitoring, availability and performance of production services. Responding to stakeholder requests within agreed timescales or SLO Drive automation to reduce failures, manual tasks and therefore improving overall application performance and availability. … resolve application issues. Thorough knowledge of HTTP(S), SMTP, TLS/SSL, DNS, LDAP, Kubernetes and Docker containers Experience of system/application administration in a distributed, customer-facing, high-availability and large-scale environments Experienced and confident in at least one scripting language such as Perl, shell, Ruby or Python. Experience of tuning and optimising monitoring systems. More ❯
chain organisations. Tasks In this role, you will: Design and evolve our multi-cloud platform (Azure and AWS) using infrastructure-as-code and GitOps best practices. Build and maintain high-throughput Kubernetes clusters (EKS) with secure networking, service mesh, policy enforcement, and self-service tooling for application teams. Craft robust CI/CD pipelines that deliver containerised services from … Strengths in the following areas : Strong Language Proficiency : advanced in Python, Go, or Java with clean code principles, test coverage, and performance awareness. Service Architecture : experience building and operating high-availability, distributed systems (microservices and event-driven). API Design : REST and gRPC, with versioning, throttling, auth, and schema validation (OpenAPI, Pydantic, Protobuf). Datastores : deep knowledge of More ❯
Haywards Heath, Sussex, United Kingdom Hybrid / WFH Options
First Central Services
resolution of incidents and technical support for end-users, ensuring minimal disruption to business operations. You'll collaborate with cross-functional teams to investigate and remediate issues affecting platform availability and performance. You'll track and report the status of ongoing incidents, keeping stakeholders informed of progress. You'll work with third-party providers and vendors as an extension … best practices. You'll troubleshoot and resolve platform-related incidents in a timely manner, ensuring minimal disruption to business operations. You'll implement and manage monitoring solutions to ensure highavailability, reliability, and performance of core platforms. You'll create and maintain detailed documentation for platform configurations, standards, and procedures. You'll share knowledge across teams to facilitate More ❯
Salford, Manchester, United Kingdom Hybrid / WFH Options
First Central Services
resolution of incidents and technical support for end-users, ensuring minimal disruption to business operations. You'll collaborate with cross-functional teams to investigate and remediate issues affecting platform availability and performance. You'll track and report the status of ongoing incidents, keeping stakeholders informed of progress. You'll work with third-party providers and vendors as an extension … best practices. You'll troubleshoot and resolve platform-related incidents in a timely manner, ensuring minimal disruption to business operations. You'll implement and manage monitoring solutions to ensure highavailability, reliability, and performance of core platforms. You'll create and maintain detailed documentation for platform configurations, standards, and procedures. You'll share knowledge across teams to facilitate More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
First Central Services
resolution of incidents and technical support for end-users, ensuring minimal disruption to business operations. You'll collaborate with cross-functional teams to investigate and remediate issues affecting platform availability and performance. You'll track and report the status of ongoing incidents, keeping stakeholders informed of progress. You'll work with third-party providers and vendors as an extension … best practices. You'll troubleshoot and resolve platform-related incidents in a timely manner, ensuring minimal disruption to business operations. You'll implement and manage monitoring solutions to ensure highavailability, reliability, and performance of core platforms. You'll create and maintain detailed documentation for platform configurations, standards, and procedures. You'll share knowledge across teams to facilitate More ❯
and best practices for IT infrastructure and operations. Lead cloud migration strategies and optimize Azure & AWS Environments . Cloud & Infrastructure Management Architect and implement Azure & AWS cloud solutions , ensuring highavailability and performance Oversee Unix & Windows server environments , ensuring optimal configuration and security. Design and implement backup and disaster recovery strategies to safeguard critical data. Network & Security Develop More ❯