Milton Keynes, Buckinghamshire, England, United Kingdom
Noir
SiteReliability Engineer (SRE) - Market leading company - Milton Keynes (Tech stack: .Net, C#, ASP.Net Core, SQL Server, PowerShell, Azure CLI, Bash, Azure DevOps, Jenkins, GitHub Actions, Docker, Kubernetes) Help shape the tech future of UK market leader! Backed by a major financial institution with soaring profits - my client is modernising platforms, embracing AI, and driving automation at scale. … We're hiring a Lead SiteReliability Engineer (SRE) to drive reliability, observability, and performance across our Azure cloud infrastructure. You'll work in a modern engineering environment where we live by "you build it, you run it", focused on automation, scale, and resilience. Tech stack you'll work with: .NET, C#, ASP.NET Core, SQL Server … PowerShell, Azure CLI, Bash, Azure DevOps, Jenkins, GitHub Actions, Docker, Kubernetes We want to hear from you if: As a SiteReliability Engineer (SRE) you've delivered scalable systems using .NET, C#, and ASP.NET Core , with real-world experience managing production workloads You've automated operations using PowerShell, Azure CLI, and Bash to reduce toil and boost efficiency More ❯
Engineering Manager, Reliability Because your new ideas are our new ways of working. Evolve, your way. Our Technology team is actively shaping the next wave of advancements. Engaged with innovative initiatives, your expertise will propel our business into the future. Collaborating with a creative team of tech enthusiasts, you'll contribute your unique skills to fuel our technological … advancements. The purpose of EngineeringReliability Manager is to enable smooth operations and to … increase reliability of live products & services. This role will facilitate resolution of incidents that block customer outcomes and embed and advocate for SiteReliabilityEngineering (SRE) principles. This role may sit across a single product group or multiple product groups within the channels domain. What You'll Get People are at the heart of what we More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Senior SiteReliability EngineerLondon - Hybrid£80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliability Engineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression!This company operates … performance. With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries.In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems.The ideal candidate … and conduct chaos engineering experiments*Monitor and maintain Kafka clusters for performance and reliability*Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and observability tools like Datadog or Grafana*Able to troubleshoot issues across infrastructure More ❯
Vacancy for Snr SiteReliability Engineer (SRE) at Preservica Abingdon/Remote, UK About You You have a proven track record in DevOps and software development, with a passion for creating reliable solutions to deploy software at scale and speed. You are eager to challenge the status quo, learn, and adopt new technologies. Excellent communication skills across all … Our team is small but growing, so self-motivation, organization, and the ability to multitask and prioritize are crucial. The Role Serve as a primary visionary for DevOps/SiteReliabilityEngineering across the entire technology organization. Eliminate process bottlenecks to enable frictionless, reliable, and high-velocity feature development through automation of Build, Test, Deploy, and Operate More ❯
SiteReliability Engineer - Microsoft Admin (Windows Server, IIS, MS SQL Server) Team Summary The Reliability Engineer (SRE) is a member of a cross-functional Operations & Infrastructure team responsible for running our Visa Spend Clarity for Enterprises production infrastructure and ensuring the highest levels of availability, performance, and operational excellence. What a SiteReliability Engineer does … at Visa: The SRE is responsible for finding the right way to run robust applications in our environments. In this role, you will balance engineering improvements, systems operations, and contributions to strategic initiatives. You will work closely with all members of the Technology Group to improve the reliability, availability, performance, monitoring, and operations of Visa Spend Clarity for More ❯
Southampton, Hampshire, South East, United Kingdom Hybrid / WFH Options
Ordnance Survey Limited
hear from you. Essential Criteria Good knowledge of Azure Cloud hosting technologies Experience with PostgreSQL databases (including PostGIS spatial extension) Good understanding of SiteReliabilityEngineering (SRE) and software engineering best ractices Experience investigating the root cause of failures to understand why they have occurred and propose/enact solutions, and work with external suppliers if More ❯
in high-impact delivery teams that support some of the worlds most well-known organisations. Youll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estatescontributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such … mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User Desirable Experience DevOps or SiteReliabilityEngineering (SRE) experience Automation with Terraform or similar tools Building CI/CD pipelines Experience with Docker and Kubernetes for packaging and deployment Ability to adapt to new technologies in fast-paced More ❯
Hemel Hempstead, Hertfordshire, South East, United Kingdom Hybrid / WFH Options
Eckoh PLC
regulations, as well as enhancing their brand and increasing customer satisfaction. The Role: This is an exciting opportunity to join Eckoh's Cloud Native Platform team in a senior engineering capacity. You will play a key role in designing, building, and operating a secure, highly available, PCI-compliant AWS platform that underpins Eckoh's mission-critical services. As a … senior member of the team, you will drive improvements in platform reliability, observability, and operational excellence. You will collaborate closely with development teams to enable secure, automated delivery of services while championing DevSecOps principles. This role offers the chance to shape the future of Eckoh's cloud environments, mentor other engineers, and have a real impact on system performance … Contribute to the evolution of CI/CD pipelines, IaC, and automation tooling to improve delivery speed and reliability. About you: Required skills and qualifications: 2+ years in a SiteReliabilityEngineering, DevOps, or Software Engineering role focused on platform development. 3+ years of development experience with a high-level programming language (e.g., Python, Go, JavaScript More ❯
Watford, Hertfordshire, South East, United Kingdom
La Fosse
SiteReliability Engineer (Python) £70,000 pa Hertfordshire My client, a leading entertainment group, are looking for a mid level SRE to join their platform team in their Hertfordshire office. In the role you'll take ownership of the end-to-end monitoring and alerting stack, designing and maintaining infrastructure and alert configurations (e.g., with Prometheus/Grafana More ❯
Wokingham, Berkshire, United Kingdom Hybrid / WFH Options
Nordcloud
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure SiteReliability Engineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies and … solving We encourage you to apply , even if you don't meet all of the requirements. We value your growth potential and enthusiasm! This role is required to on site in Wokingham twice a week, please do not apply if this is not possible for you. What we offer: Individual training budget and exam fees for certifications Flexible working More ❯
Milton Keynes, Buckinghamshire, South East, United Kingdom Hybrid / WFH Options
Interact Consulting Limited
are looking for a skilled Azure DevOps Engineer with strong hands-on experience in IaC (infrastructure as code) using Terraform. As a DevOps Engineer, youll be joining the Cloud Engineering team of 6 engineers working to drive solutions for design, build and operational aspects of our infrastructure in Azure Cloud. You will work closely with the engineering and … technical support teams to proactively improve our offerings from a scalability, reliability, security and speed to market perspective. You will boast: 3+ years of experience in an operational role with a cloud platform, specifically Microsoft Azure. Extensive hands-on experience with test Automation using Terraform for Infrastructure-as-Code (IaC). Experience of Configuration-as-Code, Containerisation and Orchestration … CI/CD. Proficiency with Kubernetes, Docker and AKS. Familiarity with Azure cloud-native services. Knowledge of observability and site-reliabilityengineering principles. Proficiency in SQL and experience working with relational databases. This is a fully remote (UK only) position within a fabulous team. Lots of flexibility, opportunity to learn (and grow) and all within a fabulous More ❯
Farnborough, England, United Kingdom Hybrid / WFH Options
Addition+
cutting-edge tech solutions into Defence and National Security — helping drive innovation in some of the UK's most sensitive environments. With a reputation for high-quality, no-nonsense engineering, they’re now hiring a Senior Platform Engineer to lead on some of their most high-impact work. Role Overview: Location: Farnborough (Hybrid) Package: Up to £550 outside IR35 … emerging tech and helping to shape the future of internal tooling and delivery Main Skills Needed: Experience with with Azure cloud is a must Strong experience in Platform or SiteReliabilityEngineering (5+ years ideally) Proven background with Kubernetes, CI/CD tooling (e.g. GitLab, Jenkins), and IaC (Terraform, Ansible) Confident with monitoring tools (e.g. Prometheus, Grafana … Git proficiency and solid repository management knowledge Comfortable leading technical decisions and collaborating with engineering teams What’s in It for You: A genuinely collaborative, no-blame engineering culture Interesting, high-stakes work in secure, meaningful environments 25 days’ holiday plus bank holidays Private medical, life cover, and a strong pension Bonus scheme and professional development support On More ❯
robust IAM and security controls Collaborate across teams to embed DevOps best practices and improve the software delivery lifecycle THE IDEAL CANDIDATE WILL HAVE: 5+ years in DevOps or SiteReliabilityEngineering, with deep expertise in Azure and Azure DevOps Hands-on experience managing production Kubernetes environments and deploying with Helm Strong skills in Terraform, CI/ More ❯
MySQL, Vue.js, and AWS. Participating in an on-call roster is required as part of this role. This is a hybrid role with 2 days in the office. Senior SRE Position We are seeking a Senior SRE with experience of working with scaled SaaS production infrastructure. The successful candidate will work as part of a team focused on sitereliability, security, and scalability as we manage our rapid growth. Monitoring the above environments and reacting to alerts and issues that may arise in day-to-day operation of their product line. They will participate in an on-call rota for priority-1 level health, security, stability, and uptime of production, staging, and development environments. More ❯
guildford, south east england, united kingdom Hybrid / WFH Options
Experis
Role Title: SiteReliability Engineer (SRE) Duration: 5 month contract Location: Wokingham (Reading). Hybrid, 60% remote and 40% onsite Clearance required: Active SC is essential Key Skills/requirements Detect and mitigate system issues to ensure high availability. Automate operational tasks to improve efficiency and reduce manual intervention. Prepare disaster recovery plans and ensure business continuity. Monitor … Implement CI/CD pipelines for seamless deployment and release management. Ensure compliance with security standards, governance policies, and regulatory requirements. Required Skills & Experience Expertise in software development and engineering for large-scale distributed systems. Strong proficiency in programming languages such as Golang, Java, or Python. Extensive experience with cloud infrastructure providers (AWS, Azure, or GCP). Deep knowledge More ❯
Wokingham, England, United Kingdom Hybrid / WFH Options
eTeam
We are a Global Recruitment specialist that provides support to the clients across EMEA, APAC, US and Canada. We have an excellent job opportunity for you. Role Title: Principal SRE Location: Wokingham (Reading). Hybrid, 60% remote and 40% onsite Duration: Until 30/01/2026 Rate: £580 per day Inside IR35 through an Umbrella Company C ontractor Must … Hold Active SC Clearance Role Description: Key Responsibilities: Lead and drive platform-first initiatives to improve scalability, reliability, and performance. Design, build, and maintain resilient infrastructure supporting distributed systems. Implement monitoring and alerting systems to ensure high availability and performance. Collaborate with engineering teams to enhance system reliability and mitigate risks. Develop and maintain CI/CD … Continuously evaluate and recommend improvements to platform infrastructure and processes. Ensure compliance with security standards, governance policies, and regulatory requirements. Required Skills & Experience Proven expertise in software development and engineering for large-scale distributed systems. Strong proficiency in programming languages such as Golang, Java, or Python. Extensive experience with cloud infrastructure providers (AWS, Azure, or GCP). Deep knowledge More ❯
DevOps or similar tooling Write and maintain Infrastructure as Code using Terraform or OpenTofu Develop scripts and automation to support infrastructure and deployment workflows - PowerShell is preferred Collaborate with engineering teams to support platform reliability and enable delivery Maintain visibility and awareness through monitoring and logging tools such as Datadog, Azure Monitor, App Insights etc. Support incident resolution … and participate in an on-call rota to help maintain service uptime Qualifications The Requirements: Essential Experience: Proven experience in a Platform, Infrastructure, or DevOps engineering role Hands-on experience operating 24x7 services in a public cloud, ideally Azure Strong experience managing infrastructure using Terraform or OpenTofu Experience managing and scaling Kubernetes clusters in production environments Proficient with CI … Flux Familiarity with Configuration as Code tools like Ansible or Puppet Exposure to large-scale distributed systems or high-volume web APIs Awareness of incident response processes and platform reliability best practices Equal Opportunity Employer At WTW, we believe difference makes us stronger. We want our workforce to reflect the different and varied markets we operate in and to More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
EngineerLondon - Hybrid£80,000-£90,0000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension + Package Excellent opportunity for a Data-focused SiteReliability Engineer or a Data Platform Engineer with some DevOps principals to join a forward-thinking and high-growth Fintech company offering a hybrid work environment, a great … data systems, a solid understanding of monitoring a observability, and hands-on experience working with AWS, Apache Flink, Kafka, and Python.This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud native environment, with full ownership of incident management, architecture, and performance. The Role: *Maintaining and monitoring real-time and batch … SLIs/SLOs, and drive reliability *Woking closely with various other departments and teams to architect scalable, fault-tolerant data solutions The Person: *Experience in a data-focused SRE, Data Platform, or DevOps role*Strong knowledge of Apache Flink, Kafka, and Python in production environments*Hands-on AWS experience with AWS (Lambda, EMR, Step Functions, Redshift, etc.)*Comfortable with More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
EngineerLondon - Hybrid£80,000-£90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension + Package Excellent opportunity for a Front-end SiteReliability Engineer or Senior React Engineer with strong Kubernetes, AWS, and CI/CD to join a forward-thinking and high-growth Fintech company offering a hybrid work … role you'll take ownership of the reliability, performance, and deployment of React-based user interfaces used by clients around the world. You'll work very closely with SRE, DevOps, and Product Teams, and drive front-end observability, automate deployments, and play a key part in optimising UX at scale.The ideal candidate will be someone who combines deep React … CD Pipelines for automation testing, canary deployments, and rollbacks*Optimising front-end delivery via CDN configuration, caching strategies, and real user monitoring (Datadog/Sentry)*Collaborate with platform and SRE teams to ensure seamless delivery of React applications on Kubernetes and AWS The Person: *Strong commercial experience with React (ideally with Next.js or SSR experience)*Comfortable managing deployments using CI More ❯
for delivering open system database infrastructures with operational excellence, being the industry leader in preventive and predictive operations driven by continuous innovation through self-heal and automated ecosystems, architecting, engineering, and building the reliable, scalable, secure, available, and cost-effective database infrastructure solutions for product development partners, working closely with them to use database products to best effect, meeting … an engineer in this team, the individual will be involved in the build and run activities related to NoSQL database technology and infrastructure. The role will contribute to solution engineering and support as well as being responsible for delivering database projects, maintaining running systems and performing problem analysis and troubleshooting. The individual should be well versed in database administration … and engineering practices and principles. The individual will share the responsibility of overall database build, deployment and support. He or she will provide technical influence and subject matter expertise for the database team, responsible for solving complex problems and taking a broad perspective to identify innovative solutions. The role serves as a technical and functional specialist and must be More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
BOSS Professional Services LTD
SRE Engineer Full-time UK - Remote/Hybrid My client is a high growth ecommerce business which runs it technology stack on AWS. Due to the nature of the business the SRE Engineer will need to support sudden peaks in traffic smoothly scaling. They also host other ecommerce platform for other brands which also need supporting. As an SRE Engineer … you will maintain a scalable and reliable production environment for running software services while helping grow the customer base and product offering. For the SRE Engineer role we are seeking: Technology stack: Kubernetes, MySQL, PostgreSQL, PHP, Python, Docker, AWS Lambda, AWS, Redis, ELK, monitoring: Prometheus, Grafana or Loki You have previous experience of working within SRE capacity or experience in … Assist and support the DevOps engineers: setting up the infrastructure for microservices Work closely with rest of the DevOps and QA team to load test applications Responsibilities for the SRE Engineer include: Create sustainable systems and services through automation and uplifts Partner with development teams to improve services Gather and analyse metrics from both operating systems and applications Participate in More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Pontoon
Subject Matter Expert to join their IT Infrastructure team. This role will be key in improving performance monitoring and observability across business-critical applications. You will work closely with SiteReliability Engineers, Application Stewards, and external software vendors to ensure a reliable and efficient monitoring setup. Key Responsibilities :Build and maintain automated monitoring of key business processes using … insights Additional Skills : Ability to manage competing priorities in a fast-paced environmentStrong communication and problem-solving skillsDelivery-focused with a flexible, can-do attitudeBackground in resilience or uptime engineering is a plus Apply Now , if this is relevant to you. Please be advised if you haven't heard from us within 48 hours then unfortunately your application has More ❯
optimization technologies and beyond, our innovative mobile perception system empowers our clients to accelerate transit, enhance street safety, and drive toward a sustainable future. Job Summary: The Cloud Fleet Engineering team at Hayden AI is at the forefront of managing and scaling our fleet of AI-powered edge devices deployed across global transportation agencies. As a senior engineer, you … and remediation efforts, learning and applying best practices. Design, build, and maintain scalable cloud services that support device observability, OTA updates, and fleet operations. Lead efforts to improve the reliability, security, and performance of multi-region AWS infrastructure using Infrastructure as Code (IaC) tools. Own CI/CD pipelines for cloud services, enabling efficient, secure, and automated delivery processes. … and operate scalable, secure APIs and microservices. Strong communication skills and a track record of leading cross-functional technical initiatives. Experience working in agile environments with modern DevOps and SRE practices. More ❯
Portsmouth, Hampshire, United Kingdom Hybrid / WFH Options
Checkatrade
journey and providing support throughout the process. You will play a key role in shaping our platform's technical direction, working with modern technologies, and ensuring high standards of reliability, security, and performance. Location: Kings Cross, London or Portsmouth. Hybrid working. Where do you fit in? We're seeking a Senior Platform Engineer with a strong background in cloud … and maintaining our infrastructure using tools like Kubernetes, Terraform, Helm, and Datadog. You will drive the adoption of infrastructure-as-code practices, implement CI/CD pipelines, and champion SRE principles to ensure platform reliability and scalability. Collaborating with cross-functional teams, you'll contribute to a seamless developer experience and play a vital role in securing and optimizing … Experience with AWS is also valuable, with a willingness to work within a GCP environment. Experience with programming languages such as Golang, Python, and JavaScript. Passion for automation, DevOps, SRE, and observability practices. Proven leadership, management skills, and excellent communication abilities. We are an equal opportunities employer committed to diversity and inclusion in the workplace. About us We're Checkatrade More ❯
Cloud Platform Engineer - Hybrid/Farnborough, Circa £80k + bonus + excellent benefits SC Eligible (UK Sole Nationals only) Are you an experienced Platform Engineer with strong SRE and DevOps capabilities to join a team delivering secure, cloud-first infrastructure in the defence and security space. This is a hands-on role, owning critical platform components across CI/CD … similar) * Configuring Kubernetes clusters for secure, scalable deployments * Building automation across infrastructure provisioning and testing * Implementing monitoring and alerting (e.g., Prometheus, Grafana) * Managing repositories and version control (Git) * Driving SRE practices around performance, resilience, and supportability * Working closely with dev teams to integrate platform tooling into workflows * Supporting infrastructure security, maintainability and scalability initiatives What we're looking for * 5+ … years in platform/SRE/DevOps roles * Strong Kubernetes experience (config and deployment) * Deep CI/CD experience - Jenkins, GitLab CI/CD or similar * Skilled with infra observability tooling (Prometheus, Grafana, etc.) * Confident with Git and repo management workflows * Strong automation mindset - reducing manual intervention wherever possible * Cloud experience (AWS, Azure or GCP) * Must be a sole UK More ❯