Southampton, Hampshire, South East, United Kingdom Hybrid / WFH Options
Spectrum It Recruitment Limited
SiteReliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a SiteReliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on-premises environments, empowering organisations to enhance customer experiences, maintain regulatory compliance, and proactively fight fraud. The … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp … of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of More ❯
Hampshire, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
SiteReliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a SiteReliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on-premises environments, empowering organisations to enhance customer experiences, maintain regulatory compliance, and proactively fight fraud. The … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp … of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Explore Group
SiteReliability Engineer (Hybrid – London) | RegTech Innovator | AWS, Terraform, Kubernetes Location: London (Hybrid – 2-3 days in office) Are you passionate about scalable infrastructure and modern DevOps practices … Want to make a tangible impact in a fast-growing RegTech company that’s transforming how businesses navigate regulatory compliance? Join us as a SiteReliability Engineer (SRE) and help build and operate the infrastructure that powers cutting-edge compliance solutions used by global financial institutions. What You'll Do Maintain and improve our AWS-based infrastructure using … Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and Kubernetes Experience with production support, incident management, and RCA practices Comfortable working in a fast-paced More ❯
Southampton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
cause analysis, and implement observability best practices (metrics, logging, tracing). Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). Lead incident response, system reliability efforts, and infrastructure scalability initiatives. Manage messaging queues (e.g., Kafka, RabbitMQ) and optimize for low-latency event handling and throughput. Contribute to evolving our security posture, including secrets management … controls, and audit logging. Qualifications: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience. 5+ years of professional experience in a DevOps, SRE, or Cloud Infrastructure role. Strong proficiency with AWS, Linux, and containerized environments (Docker, Kubernetes). Deep understanding of CI/CD best practices and hands-on experience with tools like More ❯
Reigate, England, United Kingdom Hybrid / WFH Options
Willis Towers Watson
Summary : We are seeking a SiteReliability Engineer to join our SRE team based in Reigate. The ideal candidate will have excellent communication skills, experience working with multiple stakeholders, and a track record in Azure and Observability platforms. You will be joining Insurance Consulting and Technology (ICT) at an exciting time of transformation as we work on improving … multiple greenfield workstreams in the delivery family to deliver core foundational functionality that will be used by multiple SaaS product offerings across the business. You will be with other SiteReliability and Response teams as well as with the core Applications Teams, whose responsibility is to deliver and manage business critical services that are used 24×7 by … open to flexible and hybrid working arrangements, with presence in the Reigate office up to two days per week. The Role: Collaborate with cross-functional teams to ensure the reliability, availability, and performance of our client-facing services Maintain and configure observability platforms such as Datadog Proactive monitoring of production and other environments to ensure stability, availability, security and More ❯
High Wycombe, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
native infrastructure (primarily AWS) to support Klear’s distributed systems and APIs. Manage and optimize CI/CD pipelines, ensuring rapid and reliable code delivery. Work closely with the engineering team to support microservices architecture, with focus on latency-sensitive and high-availability services. Monitor system performance, conduct root cause analysis, and implement observability best practices (metrics, logging, tracing … . Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). Lead incident response, system reliability efforts, and infrastructure scalability initiatives. Manage messaging queues (e.g., Kafka, RabbitMQ) and optimize for low-latency event handling and throughput. Contribute to evolving our security posture, including secrets management, access … controls, and audit logging. Qualifications: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience. 5+ years of professional experience in a DevOps, SRE, or Cloud Infrastructure role. Strong proficiency with AWS, Linux, and containerized environments (Docker). Deep understanding of CI/CD best practices and hands-on experience with tools like GitHub More ❯
Woking, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
native infrastructure (primarily AWS) to support Klear’s distributed systems and APIs. Manage and optimize CI/CD pipelines, ensuring rapid and reliable code delivery. Work closely with the engineering team to support microservices architecture, focusing on latency-sensitive and high-availability services. Monitor system performance, conduct root cause analysis, and implement observability best practices (metrics, logging, tracing). … Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). Lead incident response, system reliability efforts, and infrastructure scalability initiatives. Manage messaging queues (e.g., Kafka, RabbitMQ) and optimize for low-latency event handling and throughput. Contribute to evolving our security posture, including secrets management, access … controls, and audit logging. Qualifications: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience. 5+ years of professional experience in a DevOps, SRE, or Cloud Infrastructure role. Strong proficiency with AWS, Linux, and containerized environments (Docker). Deep understanding of CI/CD best practices and hands-on experience with tools like GitHub More ❯
Brighton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
infrastructure (primarily AWS) to support Klear’s distributed systems and APIs. ● Manage and optimize CI/CD pipelines, ensuring rapid and reliable code delivery across ● Work closely with the engineering team to support microservices architecture, with focus on latency-sensitive and high-availability services. ● Monitor system performance, conduct root cause analysis, and implement observability best practices (metrics, logging, tracing … . ● Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). ● Lead incident response, system reliability efforts, and infrastructure scalability initiatives. ● Manage messaging queues (e.g., Kafka, RabbitMQ) and optimize for low-latency event handling and throughput. ● Contribute to evolving our security posture, including secrets management, access … controls, and audit logging. Qualifications: ● Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience. ● 5+ years of professional experience in a DevOps, SRE, or Cloud Infrastructure role. ● Strong proficiency with AWS, Linux, and containerized environments (Docker, ● Deep understanding of CI/CD best practices and hands-on experience with tools like GitHub Actions More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
infrastructure (primarily AWS) to support Klear’s distributed systems and APIs. ● Manage and optimize CI/CD pipelines, ensuring rapid and reliable code delivery across ● Work closely with the engineering team to support microservices architecture, with focus on latency-sensitive and high-availability services. ● Monitor system performance, conduct root cause analysis, and implement observability best practices (metrics, logging, tracing … . ● Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). ● Lead incident response, system reliability efforts, and infrastructure scalability initiatives. ● Manage messaging queues (e.g., Kafka, RabbitMQ) and optimize for low-latency event handling and throughput. ● Contribute to evolving our security posture, including secrets management, access … controls, and audit logging. Qualifications: ● Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience. ● 5+ years of professional experience in a DevOps, SRE, or Cloud Infrastructure role. ● Strong proficiency with AWS, Linux, and containerized environments (Docker, ● Deep understanding of CI/CD best practices and hands-on experience with tools like GitHub Actions More ❯
Reading, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
infrastructure (primarily AWS) to support Klear’s distributed systems and APIs. ● Manage and optimize CI/CD pipelines, ensuring rapid and reliable code delivery across ● Work closely with the engineering team to support microservices architecture, with focus on latency-sensitive and high-availability services. ● Monitor system performance, conduct root cause analysis, and implement observability best practices (metrics, logging, tracing … . ● Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). ● Lead incident response, system reliability efforts, and infrastructure scalability initiatives. ● Manage messaging queues (e.g., Kafka, RabbitMQ) and optimize for low-latency event handling and throughput. ● Contribute to evolving our security posture, including secrets management, access … controls, and audit logging. Qualifications: ● Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience. ● 5+ years of professional experience in a DevOps, SRE, or Cloud Infrastructure role. ● Strong proficiency with AWS, Linux, and containerized environments (Docker, ● Deep understanding of CI/CD best practices and hands-on experience with tools like GitHub Actions More ❯
Portsmouth, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
infrastructure (primarily AWS) to support Klear’s distributed systems and APIs. ● Manage and optimize CI/CD pipelines, ensuring rapid and reliable code delivery across ● Work closely with the engineering team to support microservices architecture, with focus on latency-sensitive and high-availability services. ● Monitor system performance, conduct root cause analysis, and implement observability best practices (metrics, logging, tracing … . ● Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). ● Lead incident response, system reliability efforts, and infrastructure scalability initiatives. ● Manage messaging queues (e.g., Kafka, RabbitMQ) and optimize for low-latency event handling and throughput. ● Contribute to evolving our security posture, including secrets management, access … controls, and audit logging. Qualifications: ● Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience. ● 5+ years of professional experience in a DevOps, SRE, or Cloud Infrastructure role. ● Strong proficiency with AWS, Linux, and containerized environments (Docker, ● Deep understanding of CI/CD best practices and hands-on experience with tools like GitHub Actions More ❯
Crawley, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
infrastructure (primarily AWS) to support Klear’s distributed systems and APIs. ● Manage and optimize CI/CD pipelines, ensuring rapid and reliable code delivery across ● Work closely with the engineering team to support microservices architecture, with focus on latency-sensitive and high-availability services. ● Monitor system performance, conduct root cause analysis, and implement observability best practices (metrics, logging, tracing … . ● Harden infrastructure and deployments with infrastructure as code (Terraform/CDK/CloudFormation). ● Lead incident response, system reliability efforts, and infrastructure scalability initiatives. ● Manage messaging queues (e.g., Kafka, RabbitMQ) and optimize for low-latency event handling and throughput. ● Contribute to evolving our security posture, including secrets management, access … controls, and audit logging. Qualifications: ● Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience. ● 5+ years of professional experience in a DevOps, SRE, or Cloud Infrastructure role. ● Strong proficiency with AWS, Linux, and containerized environments (Docker, ● Deep understanding of CI/CD best practices and hands-on experience with tools like GitHub Actions More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Cognitive Group | Part of the Focus Cloud Group
AWS DevOps Engineer Senior SiteReliability Engineer - Infrastructure Contract - Inside IR35 - Fulltime Location: London - Hybrid (3 days per week onsite) SC Cleared or Eligible for SC Clearance Your responsibilities: Deploy, configure, and monitor AWS services ensuring high availability, scalability, and security. Respond to and resolve infrastructure and service incidents with root cause analysis and preventive measures. Handle change More ❯
Reading, England, United Kingdom Hybrid / WFH Options
Halian
United Kingdom 2 weeks ago Reading, England, United Kingdom 5 days ago Reading, England, United Kingdom 4 days ago Senior Engineer – Data Warehouse SiteReliabilityEngineering (SRE) Reading, England, United Kingdom 4 days ago Reading, England, United Kingdom 4 days ago Reading, England, United Kingdom 2 months ago Reading, England, United Kingdom 1 week ago We’re More ❯
Crawley, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
recruiter to learn more. Base pay range Direct message the job poster from Spectrum IT Recruitment Account Manager @ Spectrum IT | Recruitment, Customer Service SiteReliabilityEngineering (SRE) Lead - Up to £110,000 + Bonus & Benefits Location: Crawley, Hybrid (1 day a week onsite) Are you ready to lead a high-performing team and play a pivotal role … in shaping the future of infrastructure and reliabilityengineering? We're looking for an experienced SiteReliabilityEngineering (SRE) Team Lead/Technical Lead who is passionate about driving operational excellence and has a clear vision for building scalable, resilient systems. In this role, you'll lead from the front; mentoring a team of DevOps … securely. What You'll Be Doing: Lead, inspire, and develop a growing team of DevOps Engineers, setting clear goals and driving team performance. Define and deliver the company's SRE strategy, aligning it with wider technical and business goals. Collaborate across engineering, product, and security teams to ensure operational excellence and seamless delivery. Architect, implement, and manage secure, scalable More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
play a critical role in ensuring system reliability, scalability, and performance across both AWS and Azure environments. This is your opportunity to lead cloud-native transformation and embed SRE best practices into engineering at scale. What you’ll be doing as their SiteReliability Engineer: You’ll be the go-to expert for designing and maintaining … CI/CD pipelines to reduce toil and accelerate deployment frequency. Build observability into everything—own monitoring, alerting, and incident response to minimize MTTR and improve system health. Champion SRE culture and reliability-focused engineering—help shape sustainable engineering practices, SLAs, SLOs, and error budgets. Contribute across the stack with flexibility in tooling—experience with Python, Go … dental insurance 25 days annual leave + bank holidays R&D and personal training budgets And much more... This is an incredibly rare chance for a seasoned, high-performing SRE to leave your mark on high-impact transformation projects in a business that’s truly committed to doing things the right way. #J-18808-Ljbffr More ❯
Maidstone, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
s leading financial institutions to streamline international payments and ensure compliance at scale - all through smart automation and modern cloud-native infrastructure. They’re looking to bring on a SiteReliability Engineer with deep experience in observability . If you’ve worked with tools like Prometheus in AWS , supported development teams with tracing and performance insights , and thrive … great next step. What You’ll Be Doing: Managing and improving observability tools like Prometheus, Grafana, and CloudWatch Helping product teams with tracing and monitoring to improve performance and reliability Defining and improving … SLIs/SLOs , automating tasks, and reducing operational noise Working with AWS (EKS, EC2, Lambda, RDS), Terraform, and CI/CD tools What They’re Looking For: Experience in SRE or DevOps roles in a production environment Strong knowledge of observability tools , especially Prometheus in AWS Experience with tracing , metrics, and logs to support development teams Skills in Python or More ❯
Portsmouth, Hampshire, United Kingdom Hybrid / WFH Options
Checkatrade
journey and providing support throughout the process. You will play a key role in shaping our platform's technical direction, working with modern technologies, and ensuring high standards of reliability, security, and performance. Location: Kings Cross, London or Portsmouth. Hybrid working. Where do you fit in? We're seeking a Senior Platform Engineer with a strong background in cloud … and maintaining our infrastructure using tools like Kubernetes, Terraform, Helm, and Datadog. You will drive the adoption of infrastructure-as-code practices, implement CI/CD pipelines, and champion SRE principles to ensure platform reliability and scalability. Collaborating with cross-functional teams, you'll contribute to a seamless developer experience and play a vital role in securing and optimizing … Experience with AWS is also valuable, with a willingness to work within a GCP environment. Experience with programming languages such as Golang, Python, and JavaScript. Passion for automation, DevOps, SRE, and observability practices. Proven leadership, management skills, and excellent communication abilities. We are an equal opportunities employer committed to diversity and inclusion in the workplace. About us We're Checkatrade More ❯
Woking, England, United Kingdom Hybrid / WFH Options
Plutus
a range of customers, primarily in Transportation, but also in Industrial and Motorsport environments. By applying our expertise, we deliver real benefits and pioneer a better future . Driving Engineering Excellence This isn’t just about writing great code; it’s about advocating practices that push high-quality engineering. You’ll be in a team reflecting on what went … edge products that span various market segments including transportation, automotive and motorsport. You will work collaboratively to solve interesting and challenging problems in a cross-functional team with other engineering disciplines including Data Scientists, Simulation Engineers, Software Engineers, Test Engineers, SiteReliability Engineers, Systems Engineers and Hardware Engineers. The Connected Intelligence Software & Analytics Team is missioned to More ❯
South East London, England, United Kingdom Hybrid / WFH Options
SiSU Health UK
Help build and maintain the infrastructure powering a leading digital health platform. SiSU Health is seeking an experienced DevOps Engineer to ensure the reliability, scalability, and security of our AWS infrastructure and IoT-connected health stations. This is a critical role supporting the operational excellence and future readiness of our infrastructure as we scale across the UK and internationally. … The DevOps Engineer maintains and optimises our cloud platform and health station infrastructure and ensures platform reliability, availability, scalability, recoverability, security and compliance to relevant standards. ️ In this role, you'll: Own and optimise our AWS stack and CI/CD pipelines Maintain our Health Station platform, networks and infrastructure Improve observability, resilience, compliance, security and uptime Collaborate across … and ensure platform stability Be part of the team designing our cloud platform Work at the intersection of healthcare, infrastructure, and innovation Must haves 4+ years in DevOps/SRE Strong AWS experience and familiarity with Well Architected frameworks Familiarity with Information Security management, ISO 27001 and digital healthcare technology, compliance, standards and risk management Familiarity with Azure IOT and More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Client Server
WFH: You can work from home most of the time, meeting up with colleagues in the London office once a week. About you: You have experience in similar DevOps, SRE or Infrastructure engineering positions You have expertise with Kubernetes and Helm, having built mission critical systems in production You have strong IaC, Terraform experience You have strong CI/ More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Unitary
SRE (Unitary AI) Description The company We are a rapidly growing startup developing solutions that blend human expertise and AI agents to handle manual customer and marketplace operations tasks. Our unique approach combines the strengths of human expertise (high accuracy and nuanced decision-making) with the advantages of AI automation (speed and cost efficiency). This cutting-edge technology helps … the beginning of our journey - and we are very excited about our plans for growth over the coming year and beyond! The role We are now looking for a SiteReliability Engineer to ensure our systems run smoothly and reliably at scale. Your expertise in monitoring, observability, and system automation will help maintain the high availability and performance … such as Terraform for scalable system deployment Are familiar with MLOps practices and tools, and monitoring machine learning systems in production This role will report to the VP of Engineering and can be based anywhere within a 3-hour time zone of the UK. Benefits About us The team Unitary is a remote-first team of c. 20 people More ❯
Crawley, England, United Kingdom Hybrid / WFH Options
James Chase
Engineer with a passion for leadership and AWS innovation? We’re partnering with a high-growth technology company that is seeking a SiteReliabilityEngineering (SRE) Team Lead/Technical Lead to join their world-class engineering function. This is not your average technical leadership role — you’ll be driving strategic reliability initiatives, shaping … practices, and leading a team of talented SREs committed to automation, scalability, and operational excellence. What You’ll Be Doing Lead, coach, and grow a high-performing DevOps/SRE team. Define and execute the SRE strategy to support scalability, performance, and resilience across critical systems. Own and evolve the AWS infrastructure – think EC2, RDS, ECS, Fargate, IAM, VPC and … and Datadog. Act as a technical mentor and thought leader within both your team and the broader engineering organisation. What We’re Looking For: Proven leadership experience within SRE, DevOps, or Infrastructure teams. Hands-on mastery of AWS services and cloud-native design patterns (microservices, containers, serverless). Proficient in Ansible (Terraform knowledge is a strong advantage). Strategic More ❯
Reading, England, United Kingdom Hybrid / WFH Options
Oracle
Who are we? We are a world class team of high calibre security tool services SiteReliability Engineers. We are an inclusive and diverse team with a full spectrum of experience distributed globally. We have the resources of a large enterprise and the energy of a start-up, working on a critical greenfield software assurance project collaboratively with … our cloud and mobile engineering teams. The Software Assurance organisation has the mission to make application security and software assurance, at scale, a reality. We are a dedicated team, leveraging each other’s insights and abilities to produce cutting edge solutions to difficult problems through automation and CI/CD. Join us to grow your career and create the … scale together. #LI-DNI Work You’ll Do: Learn and shape the newest industry trends and technologies Communicate and coordinate with external teams for release management, product management and engineering requirements within a globally distributed team Design, develop, implement and operate a third-party artifact repository Evaluate and improve the security of the repository Performance tune software application security More ❯
Portsmouth, England, United Kingdom Hybrid / WFH Options
BAE Systems (New)
is deployed. You will be working in a multiple disciplined team, and require a broad range of technical and soft skills to enable the team to implement sound DevOps engineering practices and deliver value quickly and continuously. These skills are categorized into the following domains: Automation skills : Automation is a key skill domain for DevOps engineers; they need to … Principles and be able to work within a team using Agile methodology. Scrum – DevOps engineers should be an active member of the scrum team and contribute to sprint ceremonies. SRE – Should understand SRE principles and apply these to constantly improve reliability and minimize the support burden within the team. Please note that it is essential that you currently hold More ❯