are We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a DevOps/SiteReliability Engineer with experience managing complex infrastructure and deploying scalable, reliable systems. You are passionate about automation, cloud technologies, and continuous improvement. Must have: Proven track record of … systems. Quaisr is an early stage startup backed by Crane Venture Partners (UK) and Acequia Capital (USA), along with international angel investors. We were founded on partnerships with leading engineering and research institutions including Imperial College London and The Alan Turing Institute. Our customers are early-adopter multinational organisations in the pharmaceuticals, fast-moving consumer goods, defence, and energy More ❯
reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills and Experience: Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language. Proficiency in … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more. More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
SiteReliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a SiteReliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on-premises environments, empowering organisations to enhance customer experiences, maintain regulatory compliance, and proactively fight fraud. The … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp … of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of More ❯
Hampshire, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
SiteReliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a SiteReliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on-premises environments, empowering organisations to enhance customer experiences, maintain regulatory compliance, and proactively fight fraud. The … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp … of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of More ❯
tools/systems in a really fast-paced dynamic team and environment. The key values that we strive towards in Technology Ownership, Continuous Improvement Mindset, Trust and Honesty. Delivery Engineering Team Delivery Engineering is an internal platform team focused on building and improving systems to accelerate the delivery of software and infrastructure. Our team's first target will … be enabling DevOps practices due to its direct impact on the overall processes and culture. The primary goal of Delivery Engineering is to build the "Golden Path" to production by automating and concentrating improvements around bottlenecks such as Integration points and handoffs. We will work closely with Development teams to break down silos, helping to build more resilient pipelines … equipping ourselves with the right tools like operating procedures (SOPs) and architectural diagrams Improve the team skill set by increasing and sharing knowledge, thus setting the quality and precision engineering benchmark Understand the importance of and be a strong advocate for non-functionals eg. monitoring, alerting, logging Required Skills and Experience 3+ years of experience in DevOps oriented roles More ❯
the next level? We have a brand-new opportunity for a bright, driven, customer focussed professional to join our Hybrid Cloud 'Delivery' team, and work alongside our Enterprise Data Engineering consultants to accelerate and drive data engineering opportunities. The Advisory and Professional Services (A&PS) delivery team within HPE Pointnext Services is responsible for bringing thought-leadership, industry … implementation of scalable clustered Big Data solutions, with a specific focus on automated dynamic scaling, self-healing systems. Participating in the full lifecycle of data solution development, from requirements engineering through to continuous optimisation engineering and all the typical activities in between Providing technical thought-leadership and advisory on technologies and processes at the core of the data … Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or similar languages The following Technical More ❯
Description Data System Reliability Engineer (dSRE) Role Overview: A crucial role in CME's Cloud data transformation, the data SRE will be aligned to data product pods ensuring that our data infrastructure is reliable, scalable, and efficient as the GCP data footprint expands rapidly. Accountabilities: Automate data tasks on GCP Work with data domain owners, data scientists, and other … data engineers, data scientists, and software engineers to understand data requirements, troubleshoot issues, and support data-driven initiatives. Continuously assess and improve data infrastructure and data processes to enhance reliability, efficiency, and performance. Maintain clear and up-to-date documentation related to data systems, configurations, and standard operating procedures. Qualifications: Bachelor's or Master's degree in Computer Science … Software Engineering, Data Science or related field, or equivalent practical experience Experience as a Data SiteReliability Engineer or similar role, focusing on data infrastructure management Proficiency in data technologies, such as relational databases, data warehousing, big data platforms (e.g., Hadoop, Spark), data streaming (e.g., Kafka), and cloud services (e.g., AWS, GCP, Azure) Programming skills in Python More ❯
This new team of highly skilled Full Stack Infrastructure Engineers will cover Compute, Storage, Network and Cloud technologies. You will help design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance. Requirements: Proven experience managing and optimizing a diverse infrastructure stack. Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation). Familiarity … Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster recovery and business continuity planning. Experience with performance tuning and capacity planning. Understanding of chaos engineering principles and practices. Skills in cost optimization for cloud infrastructure. Specific Tools and Techniques: Experience in using cloud native monitoring tools like AWS CloudWatch, Azure Monitor, and Google Cloud More ❯
and reliability of our worldwide 24x7 on-premises and cloud environments. As the first line of defense this team has ownership of operations engineering. Collaborating closely with IT, SRE, Network, and Data engineering teams, and key stakeholders across business, product, and software engineering teams. We play a crucial role in maintaining systems health, responding to incidents, and … of-contact team for operational issues, providing both internal and external teams with technical support and ensuring the issue remains in custody until resolution. Collaborate with product and software engineering teams to relay operational insights and requirements. Automation, Tooling & Research Continuously identify opportunities for optimization and present findings to technical leads and management. Research and implement improvements enhancing systems … security & access management. Solid understanding of layer 2-7 networking fundamentals and the relationship between servers & services, and the transit of their packets through network hardware. In-depth experience engineering and maintaining a private-cloud infrastructure: Bare-metal, vSphere, KVM, Kubernetes. Experience with tools like Ansible, Terraform, Docker, Kafka, Nexus Experience with observability platforms: InfluxDB, Prometheus, ELK, Jaeger, Grafana More ❯
is dedicated to developing robust, scalable, and secure data solutions that empower our products to deliver exceptional value to our customers. Embracing a DevOps culture, the team adheres to SiteReliabilityEngineering principles, including proactivity, automation, cross-functional collaboration, and data-driven decision-making. How will you contribute? Support all systems and infrastructure associated with the day2 … authorization, encryption, and auditing. Attend team's standups in support of daily activities, as well as all Agile ceremonies in support of the team goals. Work closely with software engineering teams to support database architecture, provide guidance for database design, schema optimization, and query tuning. What will you bring? Bachelor's degree in Computer Science, Engineering, or a … related field, or equivalent practical experience. 4-6 years of experience in DevOps or software engineering roles. Proficiency in at least one programming language (e.g. Python, Java, Go). Familiarity with IaC/IfC tools (e.g. Terraform). Experience with cloud platforms (AWS, Azure, or GCP) and their associated data services. Hands-on experience with continuous integration and deployment More ❯
Eastbourne, England, United Kingdom Hybrid / WFH Options
AxisOps
Industries : Fintech, Legaltech, Edutech Location : Eastbourne, UK (hybrid working) We're looking for a thoughtful, automation-driven SysOps Engineer to join our infrastructure team, as part of the wider engineering team at AxisOps. This role is central to how we build, ship, and secure systems – enabling product teams to move fast without breaking things, and delivering operational stability that … infrastructure-as-code, containerisation, and deeply understand the value of reliable systems and clear documentation, this one’s for you. Who we are AxisOps is a consultancy-led software engineering business that designs, builds, and operates intelligent software platforms. We work with fast-moving organisations in the Fintech , Legaltech , and Edutech sectors to deliver automation, infrastructure, and compliance-ready … to navigate complexity and bring ideas to life – from design and architecture through to production and operations. Our strength lies in software delivery, supported by deep expertise in platform engineering, built on an understanding of private cloud-native infrastructure, observability, and DevSecOps. Our culture We value sharp thinking, clear communication, and teams that look out for each other. At More ❯
integration across hybrid and multi-cloud platforms (Azure, GCP, AWS) Define architectural design patterns and best practices for containerized microservices in event-driven, distributed computing systems Review and refine engineering designs to ensure maintainability, scalability, and performance Contribute directly to the codebase, developing key components in multiple languages (Java, C#, Python, Rust) Leverage CoPilot, GPT-based tools, and Agentic … platforms to accelerate development and improve SDLC processes Build custom AI agents and tools for automated code generation, testing, refactoring, and design scaffolding Collaborate with engineering teams, QA, DevOps, and product managers to deliver integrated solutions Mentor engineers in architectural thinking and AI-assisted development Ensure architectural alignment across systems with observability using Prometheus, Grafana, ELK Stack Required Skills … Strong communication skills and ability to influence technical decisions Experience with Docker, Kubernetes, and cloud/hybrid infrastructure Database experience (PostgreSQL, SQL Server, MySQL, or Oracle) Familiarity with DevOps, SRE, and Agile frameworks Additional Skills a Plus: Experience with Agentic AI frameworks like LangChain, AutoGen, or OpenAgents Strong interest in improving engineering velocity through AI augmentation AWS/Azure More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
BAE Systems (New)
a mix of disciplines, which allows us to come up with cutting edge, high quality solutions. What background we are looking for: Experience working in a similar DevOps/SRE/Infrastructure role An appreciation of Infrastructure as Code, and CI/CD tooling Scripting abilities with languages such as Shell, Bash, or Python etc A working knowledge of Linux … Digital Intelligence We are embracing Hybrid Working. This means you and your colleagues may be working in different locations, such as from home, another BAE Systems office or client site, some or all of the time, and work might be going on at different times of the day. By embracing technology, we can interact, collaborate and create together, even More ❯
SiteReliability Engineer (Hybrid – London) | RegTech Innovator | AWS, Terraform, Kubernetes Location: London (Hybrid – 2-3 days in office) Are you passionate about scalable infrastructure and modern DevOps practices … Want to make a tangible impact in a fast-growing RegTech company that’s transforming how businesses navigate regulatory compliance? Join us as a SiteReliability Engineer (SRE) and help build and operate the infrastructure that powers cutting-edge compliance solutions used by global financial institutions. What You'll Do Maintain and improve our AWS-based infrastructure using … Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and Kubernetes Experience with production support, incident management, and RCA practices Comfortable working in a fast-paced More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Explore Group
SiteReliability Engineer (Hybrid – London) | RegTech Innovator | AWS, Terraform, Kubernetes Location: London (Hybrid – 2-3 days in office) Are you passionate about scalable infrastructure and modern DevOps practices … Want to make a tangible impact in a fast-growing RegTech company that’s transforming how businesses navigate regulatory compliance? Join us as a SiteReliability Engineer (SRE) and help build and operate the infrastructure that powers cutting-edge compliance solutions used by global financial institutions. What You'll Do Maintain and improve our AWS-based infrastructure using … Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and Kubernetes Experience with production support, incident management, and RCA practices Comfortable working in a fast-paced More ❯
live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure SiteReliability Engineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies and More ❯
Software Engineering Manager (London) About Circadia Health Circadia Health is a growth‐stage healthcare AI company transforming senior care with predictive analytics and agentic AI workflows. We have raised $30M+ from Village Global, Headline, and executives at Zoom, Walgreens, and leading healthcare operators , and our technology now touches 40,000+ patients daily across post‐acute and home‐health settings. … As we expand our platform globally, we are building a London engineering hub to own critical infrastructure that powers our Circadia Intelligence Platform (CIP) and next‐generation IoT devices. Why This Role Exists Our AI agents rely on a robust, secure, and highly available infrastructure spanning cloud, edge, and on‐prem deployments inside skilled‐nursing facilities. We are seeking … Contribute to infrastructure roadmap, capacity planning, and cost optimisation initiatives. Minimum Qualifications Proficiency in at least one programming language ( Python, Go, or C++ ). 5+ years in DevOps/SRE/Infrastructure Engineering, delivering mission‐critical systems in production. Deep expertise with AWS or Azure , Kubernetes , Docker/OCI , and Terraform . Hands‐on experience with event‐streaming platforms More ❯
re a hands-on cloud engineer with a passion for building scalable infrastructure and empowering those around you. You thrive in collaborative environments and enjoy mentoring others while ensuring reliability, scalability, and security across systems. You bring clarity, energy, and technical credibility to every conversation. About The Team: You'll join a collaborative and growing Cloud function embedded within … to a more modern, proactive, automation-first culture with strong leadership backing. About The Role: We're looking for a Senior Cloud Engineer to help us transform our cloud engineering capability from reactive, manual operations to a proactive, automation-first approach embedded across the entire SDLC. You'll collaborate on our infrastructure strategy, tooling decisions, and reliability posture … in delivering high-performing, resilient products What We're Looking For: Proven experience as a Senior Cloud Engineer in a modern Agile environment Deep understanding of infrastructure automation frameworks, SRE principles, and continuous delivery Hands-on skills with tools like Terraform, Ansible, Docker, Kubernetes Familiarity with cloud platforms (AWS, Azure, GCP) Programming/scripting in Python, Bash, or similar Strong More ❯
re a hands-on cloud engineer with a passion for building scalable infrastructure and empowering those around you. You thrive in collaborative environments and enjoy mentoring others while ensuring reliability, scalability, and security across systems. You bring clarity, energy, and technical credibility to every conversation. About The Team: You'll join a collaborative and growing Cloud function embedded within … to a more modern, proactive, automation-first culture with strong leadership backing. About The Role: We're looking for a Senior Cloud Engineer to help us transform our cloud engineering capability from reactive, manual operations to a proactive, automation-first approach embedded across the entire SDLC. You'll collaborate on our infrastructure strategy, tooling decisions, and reliability posture … in delivering high-performing, resilient products What We're Looking For: Proven experience as a Senior Cloud Engineer in a modern Agile environment Deep understanding of infrastructure automation frameworks, SRE principles, and continuous delivery Hands-on skills with tools like Terraform, Ansible, Docker, Kubernetes Familiarity with cloud platforms (AWS, Azure, GCP) Programming/scripting in Python, Bash, or similar Strong More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
The Investigo Group
re a hands-on cloud engineer with a passion for building scalable infrastructure and empowering those around you. You thrive in collaborative environments and enjoy mentoring others while ensuring reliability, scalability, and security across systems. You bring clarity, energy, and technical credibility to every conversation. About The Team: You'll join a collaborative and growing Cloud function embedded within … to a more modern, proactive, automation-first culture with strong leadership backing. About The Role: We're looking for a Senior Cloud Engineer to help us transform our cloud engineering capability from reactive, manual operations to a proactive, automation-first approach embedded across the entire SDLC. You'll collaborate on our infrastructure strategy, tooling decisions, and reliability posture … in delivering high-performing, resilient products What We're Looking For: Proven experience as a Senior Cloud Engineer in a modern Agile environment Deep understanding of infrastructure automation frameworks, SRE principles, and continuous delivery Hands-on skills with tools like Terraform, Ansible, Docker, Kubernetes Familiarity with cloud platforms (AWS, Azure, GCP) Programming/scripting in Python, Bash, or similar Strong More ❯
to undertake other relevant and appropriate duties as reasonably required. Travel should be expected as part of this role. Experience required: 2-3 years of experience in a DevOps, SRE, or related engineering role. Hands-on experience with at least one major cloud provider (OCI preferred). Proficiency in scripting languages (e.g., Go, Python)and Linux server scripting. Experience More ❯
are seeking a foundational member for the Cloud Infrastructure team at Writer. This role involves contributing to the development and implementation of our SiteReliabilityEngineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer's critical systems, proactively guaranteeing that our high-ROI products reach customers seamlessly. Your responsibilities … ensure cost efficiency. Ensure the security and compliance of our systems, adhering to industry standards and regulations. Provide mentorship and technical guidance to junior engineers, fostering a culture of reliability and continuous improvement. Stay current with emerging technologies and industry trends to improve our sitereliability practices. Is this you? Proven expertise in SiteReliabilityEngineering with at least 7 years of hands-on experience. Deep understanding of system architecture and infrastructure design for high availability and performance. Bachelor's degree in Computer Science, Engineering, or a related field. Strong proficiency in programming languages such as Python, Java, or Go for automation and monitoring. Experience with cloud platforms like AWS, Azure, or More ❯