SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £95,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a SiteReliability Engineer (SRE) on a … permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as … ReliabilityEngineering and want to work in an environment where “that will do” is never good enough, this role is for you. SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £70,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a SiteReliability Engineer (SRE) on a … permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as … ReliabilityEngineering and want to work in an environment where “that will do” is never good enough, this role is for you. SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills and Experience: Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked with.Net, Azure and C# technologies. Proficiency in C# language – alongside knowledge of scripting languages like Bash, Python or PowerShell … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more. More ❯
Job Title: SiteReliability Engineer - 12 Month Contract - No Sponsorship Available Job Description Our client are hiring a SiteReliabilityEngineering to join their SiteReliabilityEngineering group whose main objective is to ensure their services are consistently reliable for their customers. As part of adynamicand self-organizing team, you will have … and databases, providing the resources to build, deploy, and operate their own software. Responsibilities Support applications in production, including responding to incidents and conducting post-incident reviews. Apply observability engineering to proactively detect system degradation, understand system state, and quickly diagnose issues. Investigate and resolve production issues effectively. Build automation tools to reduce operational toil and enhance developer productivity. … Scope technical projects and break them down into user stories and tasks within an engineering team. Directly contribute to the design and coding of software systems. Contribute to building systems that are secure, reliable, scalable, and extensible. Make informed technical decisions with input from teammates and engage in technical discussions with other engineering teams. Build and maintain CI More ❯
Cambridge, Cambridgeshire, East Anglia, United Kingdom
RedTech Recruitment
are already renowned as having game-changing technology within their industry, with exciting scope for expansion into further industries. This role is looking for someone to work within the SRE team responsible for incident response and issue resolution. Location: Cambridge Salary: £32,000 £60,000 + excellent benefits (£32,000 for a new Graduate) Requirements for SiteReliability … of a role involving lots of problem solving identifying the root causes of issues. Good logical reasoning Responsibilities for SiteReliability Engineer Graduate Considered: Working within the SRE team you will be responsible for the architecture of a mission-critical cloud platform for an industry-leading software company. You will be diagnosing issues within complex systems and identifying … emailing (if this email address has been removed by the job-board, full details for contact are available on our website). Keywords- SiteReliability Engineer/SRE/DevOps/Software Engineering/Software Development/Engineering/Physics/Astrophysics/Python/Computer science/Cloud/Mathematics/AWS/Azure/ More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
SiteReliability Engineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site … and remotely, supporting mission-critical systems where performance and reliability are paramount. The SiteReliability Engineer Role: We are seeking a SiteReliability Engineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliability Engineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and evaluate More ❯
# SiteReliabilityEngineering ProfessionalJob Req ID: 51667Posting Date: 1 Oct 2025Function: EngineeringUnit: BusinessLocation: Snowhill, Birmingham, United KingdomSalary: Competitive + great benefitsJob Req ID: 51667Posting Date: 1st October 2025Function: EngineeringLocation: Birmingham, 3 days in officeSalary: Competitive + excellent benefits Why this job matters This role is key to supporting HMRC by delivering and enhancing network tooling, automation … CI/CD pipelines, automation frameworks, and Infrastructure as Code (IaC) solutions using Cloud, Git, and container technologies. • Maintaining and enhancing monitoring tools to maximise system uptime, performance, and reliability, while actively contributing to improvements in SiteReliabilityEngineering practices and workflows. • Installing, configuring (including networking), administering, troubleshooting, and managing virtual machines, along with integrating tools More ❯
SiteReliability Engineer Central London (3 days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package We are working with an exciting technology company that are looking to bring in a SiteReliability Engineer to help scale their cloud infrastructure and DevOps capability. Theyve built a high-performing … engineering team and are now investing further into the platform side of things as demand grows. Think modern, cloud-native architecture, and a real emphasis on automation, scalability, and developer enablement. Youll have the autonomy to make technical decisions and help shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services … days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package Click APPLY NOW to be considered for this position! AWS, SRE, Cloud, Kubernetes, EKS, Terraform, CI/CD, Automation etc. More ❯
SiteReliability Engineer Central London (3 days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package We are working with an exciting technology company that are looking to bring in a SiteReliability Engineer to help scale their cloud infrastructure and DevOps capability. They’ve built a high … performing engineering team and are now investing further into the platform side of things as demand grows. Think modern, cloud-native architecture, and a real emphasis on automation, scalability, and developer enablement. You’ll have the autonomy to make technical decisions and help shape how platform engineering is done as the team continues to scale. Tech stack AWS … days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package Click APPLY NOW to be considered for this position! AWS, SRE, Cloud, Kubernetes, EKS, Terraform, CI/CD, Automation etc. More ❯
Join our team as a MongoDB SiteReliability Engineer, where you'll be at the forefront of designing and maintaining robust, high-performance systems that power critical financial services. In this dynamic and fast-paced environment, your role will be essential to ensuring our infrastructure remains resilient, secure, and scalable. You'll work on automating operations, enhancing system … If you're motivated by solving, multi-layered problems and building systems that perform reliably amid shifting priorities, we encourage you to apply. To be successful as a MongoDB SiteReliability Engineer, you should have experience with: Working in SiteReliabilityEngineering, DevOps, and MongoDB administration in financial services. Using MongoDB features like replicaset, sharding More ❯
Security Cleared SiteReliability Engineer - Contract Outside IR35 - 3 months+ -Hybrid We are seeking a Lead Operations/SiteReliability Engineer to take ownership of day-to-day operations across a legacy technology estate. The role will focus on maintaining service stability, ensuring operational readiness, and leading the response to incidents and outages. The Lead Operations …/SiteReliability Engineer will play a pivotal role during the transition phase by embedding operational standards, improving monitoring and support processes, and enabling knowledge transfer into ongoing service delivery teams. Key Responsibilities: Lead daily operational support of legacy systems, ensuring availability, performance, and resilience. Manage incident, problem, and change activities in line with ITIL and enterprise service More ❯
Must hold UKIC DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
Must hold UKIC DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
TixTrack
the West End, we are passionate about our work and driven to innovate with our customers to solve current and future challenges with new solutions. We are transforming our engineering culture to align with an Empowered Teams model, embracing Continuous Delivery and focusing on building capabilities to drive higher quality, reliability, and speed. About The Role The Staff … modern frontend frameworks (React, Angular, or Vue) to build scalable, high-performance applications. It also requires a deep appreciation for the principles of SiteReliabilityEngineering (SRE) and DevOps to ensure our systems are stable, observable, and rapidly deployable. Beyond technical execution, this position is a force multiplier; you will mentor engineers, assist the team in adopting … to the decoupling of our monolithic application, designing and implementing new service-oriented architectures that enable independent deployment and team autonomy. Elevate Quality and Reliability: Introduce and champion SRE principles and automated quality gates into our development lifecycle to ensure high system reliability. Be a Force Multiplier for the UK Team: Serve as the most senior technical leader and More ❯
Core, BCG X, and CT worldwide. This role is also accountable for embedding security within DevSecOps practices, enforcing automation at scale, and applying SiteReliabilityEngineering (SRE) principles across all security services. The role requires strong partnership with ISRM, with a focus on balancing and prioritizing security requirements, automation opportunities, user experience needs, and broader business outcomes. … that support modern work scenarios, remote access, zero-trust networking, and AI/ML workloads. Leverage automation frameworks and IaC to improve scalability and reduce manual intervention. Operational Security, SRE & Assurance: Ensure security platforms are resilient, continuously monitored, and designed for 24x7 support and incident response readiness. Embed security telemetry and observability to enable proactive threat detection and automated response. … Apply SRE principles to improve reliability, performance, and maintainability of security services. Lead platform health, patching automation, and vulnerability remediation workflows. Define service level objectives (SLOs) and key performance indicators (KPIs) for all security services. Compliance, Governance & Risk Management: Ensure alignment with global compliance requirements such as ISO 27001, NIST, SOC 2, GDPR, and others. Partner with governance, legal More ❯
enabling innovation and agility across BCG Core, BCG X, and CT worldwide. This role is accountable for embedding security within DevSecOps practices, applying SiteReliabilityEngineering (SRE) principles across all security services, and aligning with privacy, compliance, and business leaders to maintain trust and regulatory compliance. Key Responsibilities: Strategic Leadership & Transformation: Define and execute a unified security … remote access, zero-trust networking, and protection of sensitive data in AI/ML workloads. Leverage automation frameworks and IaC to improve scalability and reduce manual intervention. Operational Security, SRE & Assurance: Ensure security platforms are resilient, continuously monitored, and designed for 24x7 support and incident response readiness. Embed security telemetry and observability to enable proactive threat detection and automated response. … Apply SRE principles to improve reliability, performance, and maintainability of security services. Define service level objectives (SLOs) and key performance indicators (KPIs) for all security services. Compliance, Governance & Risk Management: Ensure alignment with global compliance requirements such as ISO 27001, NIST, SOC 2, GDPR, and others. Partner with governance, legal, and ISRM teams to implement enforceable policies and standards More ❯
Bath, Somerset, South West, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions
SiteReliability Engineer Work From Home (WFH) + Quarterly Visits to Bath Full Time, Initial 12 Month Fixed Term Contract Salary DOE ( ?????? £45k - ? £60k) + Benefits + Bonus Deerfoot Recruitment is working with an established FCA-authorised outsourced service provider in the financial services sector, seeking a talented SiteReliability Engineer to join their IT Operations … cloud architecture Engage in infrastructure design, implementation, and operation to ensure highly available, scalable systems Work collaboratively across development and operations teams throughout the software lifecycle Champion system automation, reliability, and continuous improvement initiatives Monitor production systems with auto … healing and auto-scaling methodologies Support CI/CD pipelines and streamline infrastructure-as-code workflows Maintain strong security-first practices within infrastructure design and management About You Proven SRE generalist with broad cloud infrastructure experience and adaptability Experience deploying cloud infrastructure in a regulated financial services environment Skilled in Terraform and PowerShell automation tools Familiarity with Windows Server and More ❯
SiteReliability Engineer We are excited to be able to offer this SiteReliability Engineer role working for an industry-leading software company in Cambridge. This company has won several awards and is pioneering in their machine learning technology. Founded 8 years ago, with a team of 150 brilliant engineers, they are already renowned as having More ❯
Junior DevOps Engineer (SRE) £417/day Inside IR-35 Wokingham - 2 days per week 2 month contract with the possibility to extend beyond Role Description: Work with Agile teams to automate infrastructure, ensure platform reliability, and support continuous delivery. Use modern cloud-native tools and CI/CD platforms to maintain secure, observable, and high-performing environments. Relevant … Skills: Experience working in Agile environments Strong understanding of SiteReliabilityEngineering (SRE) principles Familiarity with Azure DevOps for CI/CD and pipeline management Knowledge of observability tools: Prometheus, Grafana, Loki, Tempo Experience with Infrastructure as Code: Helm, Kustomize Hands-on experience with Tekton and ArgoCD Ability to support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS More ❯
Reading, Berkshire, United Kingdom Hybrid / WFH Options
Octopus Computer Associates
SiteReliability Engineers (OpenShift is a MUST) - SC CLEARED - Remote and Wokingham ( Reading) (OpenShift, Prometheus, Grafana, Loki, Tempo, Kustomize, Helm, CICd, CI Cd, CI-CD, CI/CD, Tekton, ArgoCD) One of our Blue Chip Clients is urgently looking for several SiteReliability … Engineers. You must have very strong experience with OpenShift. CANDIDATES MUST HOLD ACTIVE SC CLEARANCE. 60% remote and 40% onsite in Reading MUST BE PAYE THROUGH UMBRELLA Role Description: SRE in CNI: OpenShift Experience is a MUST Collaborate with Agile teams to automate deployment, monitoring, and infrastructure management. Ensure platform and business application reliability and performance against strict SLAs … and ArgoCD. Support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ). Conduct security reviews and implement controls aligned with national infrastructure standards. Mentor junior engineers and promote SRE best practices. Collaborate with vendors and IT teams for incident resolution and platform improvements. Required Skills: Strong communication skills (written and verbal). Experience in remote team collaboration. Deep expertise More ❯
Security Cleared SiteReliability Engineer - Contract Outside IR35 - 3 months+ -Hybrid We are seeking a Lead Operations/SiteReliability Engineer to take ownership of day-to-day operations across a legacy technology estate. The role will focus on maintaining service stability, ensuring operational readiness, and leading the response to incidents and outages click apply for More ❯
Wokingham, Berkshire, South East, United Kingdom Hybrid / WFH Options
Sanderson Government and Defence
Position: Senior & Principal SiteReliability Engineer Location: Wokingham/Home-Based Salary: £500-£590 p.d inside IR35 (Depending on seniority) Security Clearance: SC Clearance Required We're looking for a sharp-minded SiteReliability Engineer to join our cloud-native mission in Azure. If you thrive in Agile teams, live for automation, and know your way … around observability stacks and CI/CD pipelines - this is your playground. What you'll be doing: Automating deployment, monitoring & infrastructure with precision Owning platform reliability, performance & SLAs Building IaC with Helm/… Kustomize Managing CI/CD pipelines via Tekton & ArgoCD Supporting OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ) Implementing security controls aligned with national infra standards Mentoring junior engineers & championing SRE best practices Collaborating across dev teams, vendors & IT to drive operational excellence If you're interested in hearing more, please do send your profile to Reasonable Adjustments: Respect and equality More ❯
software-defined networking principles. Embed zero-trust principles and user-centric design into all remote connectivity services. Align remote connectivity architecture with broader enterprise network, security, and cloud strategies. Engineering & Operations: Lead the engineering, deployment, and lifecycle management of remote access solutions such as Cisco AnyConnect, Zscaler, and other mainstream VPN … platforms. Drive automation of remote access provisioning, policy enforcement, and configuration management through Infrastructure as Code (IaC) and zero-touch deployment practices. Apply SiteReliabilityEngineering (SRE) principles to improve performance, availability, and troubleshooting. Establish observability practices across all access points with real-time metrics, logs, and telemetry. Security, Compliance & Governance: Ensure compliance with corporate security and … segmentation, and endpoint-based access control. Proven ability to scale remote connectivity solutions to tens of thousands of users and devices. Experience with IaC, network automation, observability tooling, and SRE methodologies. Preferred Qualifications: Certifications such as CCNP, CCIE, PCNSE, Zscaler Certified, or equivalent. Familiarity with secure hybrid work and cloud networking models. Background in network performance optimization, user-centric design More ❯
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Inspire People
team at the heart of the global economy! The Department for Business and Trade ('DBT') and Inspire People are partnering together to bring you an exciting opportunity for Senior SiteReliability Engineers to join a team that ensures DBT's digital services work as users expect, working with development teams giving them the tools for their job, including … service-level objectives. - Participate in an on-call rota (with allowance), helping to keep DBT services resilient and reliable. - Mentor junior engineers and contribute to the growth of the SRE function. Technologies you will work with include AWS, Azure, Terraform/CloudFormation, Docker, ECS, ECR, ElasticSearch, Python/Django, PostgreSQL (RDS), Redis, and more. Essential Criteria - Cloud experience with AWS … application will be assessed against these requirements before being progressed to DBT. Shortlisted candidates will then be invited to interview and technical exercise. If you are a DevOps Engineer, SRE, or Systems Administrator looking to make a real impact across government digital services, apply today or contact Keesha Paulsen at Inspire People in confidence for more information. More ❯
Darlington, County Durham, North East, United Kingdom Hybrid / WFH Options
Inspire People
team at the heart of the global economy! The Department for Business and Trade ('DBT') and Inspire People are partnering together to bring you an exciting opportunity for Senior SiteReliability Engineers to join a team that ensures DBT's digital services work as users expect, working with development teams giving them the tools for their job, including … service-level objectives. - Participate in an on-call rota (with allowance), helping to keep DBT services resilient and reliable. - Mentor junior engineers and contribute to the growth of the SRE function. Technologies you will work with include AWS, Azure, Terraform/CloudFormation, Docker, ECS, ECR, ElasticSearch, Python/Django, PostgreSQL (RDS), Redis, and more. Essential Criteria - Cloud experience with AWS … application will be assessed against these requirements before being progressed to DBT. Shortlisted candidates will then be invited to interview and technical exercise. If you are a DevOps Engineer, SRE, or Systems Administrator looking to make a real impact across government digital services, apply today or contact Keesha Paulsen at Inspire People in confidence for more information. More ❯