SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £85,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a SiteReliability Engineer (SRE) on a … permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as … ReliabilityEngineering and want to work in an environment where “that will do” is never good enough, this role is for you. SiteReliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
Lead SiteReliability Engineer Are you ready to take your career to the next level in a role that’s critical to the reliability, scalability, and performance of cutting-edge systems? We’re on the lookout for a Lead SiteReliability Engineer to bring innovation, leadership, and technical excellence to our growing team. What You … and uphold best security practices. Contribute to quality systems through deviation management, CAPA follow-up, and root cause investigations. What We’re Looking For: 5+ years of experience in SiteReliabilityEngineering or a related field. Hands-on experience with Biosafety and GMP environments. Strong foundation in Lean Six Sigma principles. Proven problem-solving skills with a … knack for performance tuning. Effective communicator and team player. Formal education in an engineering-related discipline. More ❯
Lead SiteReliability Engineer Are you ready to take your career to the next level in a role that’s critical to the reliability, scalability, and performance of cutting-edge systems? We’re on the lookout for a Lead SiteReliability Engineer to bring innovation, leadership, and technical excellence to our growing team. What You … and uphold best security practices. Contribute to quality systems through deviation management, CAPA follow-up, and root cause investigations. What We’re Looking For: 5+ years of experience in SiteReliabilityEngineering or a related field. Hands-on experience with Biosafety and GMP environments. Strong foundation in Lean Six Sigma principles. Proven problem-solving skills with a … knack for performance tuning. Effective communicator and team player. Formal education in an engineering-related discipline. More ❯
london, south east england, united kingdom Hybrid / WFH Options
X4 Technology
Role: SiteReliability Engineer Domain: Energy trading Project: Algorithmic derivatives trading platform Day rate: Circa £800/d (inside IR35) Location: London (hybrid - 3 days per week) Contract: 6 months initial (multi-year scope) X4 Technology are partnered with a global energy trading client, offering the opportunity for a Contract SiteReliability Engineer to join a … small high-performing team working on an algorithmic derivatives trading platform . You’ll work closely with product and engineering teams to ensure end-to-end workflows run securely, efficiently and compliantly (MiFIDII, RTS6...). You'll also optimise connectivity to global futures exchanges, integrate market data (Bloomberg, Refinitiv...) and troubleshoot FIX connections, EMS/OMS platforms, network protocols … and real-time data systems. Responsibilities for the Contract SiteReliability Engineer (Algorithmic Trading) Deploy applications following best practices and manage vendor relationships Monitor and troubleshoot systems proactively, owning reliability improvements Work closely with trading teams to understand the full trade lifecycle Develop and maintain technical solutions (Python, PowerShell, C#, and SQL) Requirements for the Contract SiteMore ❯
slough, south east england, united kingdom Hybrid / WFH Options
X4 Technology
Role: SiteReliability Engineer Domain: Energy trading Project: Algorithmic derivatives trading platform Day rate: Circa £800/d (inside IR35) Location: London (hybrid - 3 days per week) Contract: 6 months initial (multi-year scope) X4 Technology are partnered with a global energy trading client, offering the opportunity for a Contract SiteReliability Engineer to join a … small high-performing team working on an algorithmic derivatives trading platform . You’ll work closely with product and engineering teams to ensure end-to-end workflows run securely, efficiently and compliantly (MiFIDII, RTS6...). You'll also optimise connectivity to global futures exchanges, integrate market data (Bloomberg, Refinitiv...) and troubleshoot FIX connections, EMS/OMS platforms, network protocols … and real-time data systems. Responsibilities for the Contract SiteReliability Engineer (Algorithmic Trading) Deploy applications following best practices and manage vendor relationships Monitor and troubleshoot systems proactively, owning reliability improvements Work closely with trading teams to understand the full trade lifecycle Develop and maintain technical solutions (Python, PowerShell, C#, and SQL) Requirements for the Contract SiteMore ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
X4 Technology
Role: SiteReliability Engineer Domain: Energy trading Project: Algorithmic derivatives trading platform Day rate: Circa £800/d (inside IR35) Location: London (hybrid - 3 days per week) Contract: 6 months initial (multi-year scope) X4 Technology are partnered with a global energy trading client, offering the opportunity for a Contract SiteReliability Engineer to join a … small high-performing team working on an algorithmic derivatives trading platform . You’ll work closely with product and engineering teams to ensure end-to-end workflows run securely, efficiently and compliantly (MiFIDII, RTS6...). You'll also optimise connectivity to global futures exchanges, integrate market data (Bloomberg, Refinitiv...) and troubleshoot FIX connections, EMS/OMS platforms, network protocols … and real-time data systems. Responsibilities for the Contract SiteReliability Engineer (Algorithmic Trading) Deploy applications following best practices and manage vendor relationships Monitor and troubleshoot systems proactively, owning reliability improvements Work closely with trading teams to understand the full trade lifecycle Develop and maintain technical solutions (Python, PowerShell, C#, and SQL) Requirements for the Contract SiteMore ❯
SiteReliability Engineer | Contract | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) - Up to £650 per day (Inside IR35) - 2 days per week onsite in London I'm working with a leading media and technology client that's building next-generation digital platforms used by millions across the UK. They're looking for an experienced SiteReliability Engineer to join their growing team and help drive automation, reliability, and performance across complex systems. What you'll do Collaborate with cross-functional teams to design and deliver reliable, scalable, customer-focused solutions Automate and enhance software deployments and delivery pipelines Support both on-prem and cloud infrastructure (mainly AWS, with some GCP exposure) Work … Familiarity with Kafka, Akamai or Fastly, and databases like MySQL or MongoDB Excellent problem-solving and communication skills Comfortable participating in an on-call rotation If you're a SiteReliability Engineer who enjoys working at scale, loves automation, and wants to make an impact on highly visible digital products, I'd love to chat. - Apply or drop More ❯
SiteReliability Engineer | Contract | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) - Up to £650 per day (Inside IR35) - 2 days per week onsite in London I'm working with a leading media and technology client that's building next-generation digital platforms used by millions across the UK. They're looking for an experienced SiteReliability Engineer to join their growing team and help drive automation, reliability, and performance across complex systems. What you'll do Collaborate with cross-functional teams to design and deliver reliable, scalable, customer-focused solutions Automate and enhance software deployments and delivery pipelines Support both on-prem and cloud infrastructure (mainly AWS, with some GCP exposure) Work … Familiarity with Kafka, Akamai or Fastly, and databases like MySQL or MongoDB Excellent problem-solving and communication skills Comfortable participating in an on-call rotation If you're a SiteReliability Engineer who enjoys working at scale, loves automation, and wants to make an impact on highly visible digital products, I'd love to chat. - Apply or drop More ❯
SiteReliability Engineer (SRE) - eDV Cleared Location: London (On-site) Salary: Up to £75,000 + Clearance Bonus + Company Bonus Clearance: eDV (Enhanced Developed Vetting) required Are you an experienced SiteReliability Engineer (SRE) with active eDV Clearance Do you want to work on mission-critical systems that directly support UK National Security Join … brightest minds in the industry, ensuring the reliability, scalability and performance of complex, high-assurance systems that protect the nation. The Role: As a key member of the SRE team, you'll design, build and maintain reliable infrastructure and automation solutions to keep vital services running smoothly. You'll drive continuous improvement across monitoring, deployment, and incident response for … performance bonus . Opportunity to work on high-impact, national security projects . Career development within one of the UK's most respected secure consultancies. If you're an SRE with eDV clearance looking to make a real impact in a secure and rewarding environment, we'd love to hear from you. Apply now or reach out directly to Dominic More ❯
Ubuntu | Redhat | RHEL | Docker | Docker Swarm | Linux | Systems Engineer | SRE | SiteReliability Engineer | DevOps | Ansible | Python Are you looking for an opportunity that's giving back to society and genuinely changing the world? A role that you can leave the day proud of what you're contributing to? I’ve partnered with a cutting-edge quantum technology company … at robin.shaw@opusrs.com and we can then find time to set up a call and run through all the details. Ubuntu | Redhat | RHEL | Docker | Docker Swarm | Linux | Systems Engineer | SRE | SiteReliability Engineer | DevOps | Ansible | Python More ❯
london, south east england, united kingdom Hybrid / WFH Options
Switch Tech Talent
Role: SiteReliability Engineer 🌍 Location: London/Hybrid (3 days a week in office) 💰 Salary: £90,000 🛠 Key Skills: AWS, IaC, Docker, Scripting As a SiteReliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless, reliable … Kubernetes, or similar containerisation technologies. Knowledge of scripting languages such as Bash, Python, NodeJS. Familiarity with Infrastructure as Code (IaC) tools like Terraform, Pulumi, etc. If you're a SiteReliability Engineer with the above, we want to hear from you More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Switch Tech Talent
Role: SiteReliability Engineer 🌍 Location: London/Hybrid (3 days a week in office) 💰 Salary: £90,000 🛠 Key Skills: AWS, IaC, Docker, Scripting As a SiteReliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless, reliable … Kubernetes, or similar containerisation technologies. Knowledge of scripting languages such as Bash, Python, NodeJS. Familiarity with Infrastructure as Code (IaC) tools like Terraform, Pulumi, etc. If you're a SiteReliability Engineer with the above, we want to hear from you More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Switch Tech Talent
Role: SiteReliability Engineer 🌍 Location: London/Hybrid (3 days a week in office) 💰 Salary: £90,000 🛠 Key Skills: AWS, IaC, Docker, Scripting As a SiteReliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless, reliable … Kubernetes, or similar containerisation technologies. Knowledge of scripting languages such as Bash, Python, NodeJS. Familiarity with Infrastructure as Code (IaC) tools like Terraform, Pulumi, etc. If you're a SiteReliability Engineer with the above, we want to hear from you More ❯
Job Summary This role is to design, build, and scale enterprise cloud platforms with a strong focus on automation, reliability, and developer experience. As part of the Cloud Infrastructure & DevOps team, you will build multi-cloud infrastructure that powers hundreds of production services, including critical Salesforce DevOps pipelines. You’ll partner closely with development, security, and operations teams to … Drive infrastructure compliance, DevSecOps, and policy-as-code practices. What we expect of you Minimum 5 years of experience in Platform Engineering, SiteReliabilityEngineering (SRE), or DevOps roles supporting cloud-native enterprise environments Proficient in Microsoft Azure and AWS platforms with hands-on experience in Kubernetes (AKS/EKS), Helm charts, and service mesh technologies … or HashiCorp Terraform Associate are advantageous Strong interpersonal skills including clear communication, collaboration across teams, adaptability in fast-paced environments, and a proactive mindset with a focus on reliability, performance, and developer enablement More ❯
Job Summary This role is to design, build, and scale enterprise cloud platforms with a strong focus on automation, reliability, and developer experience. As part of the Cloud Infrastructure & DevOps team, you will build multi-cloud infrastructure that powers hundreds of production services, including critical Salesforce DevOps pipelines. You’ll partner closely with development, security, and operations teams to … Drive infrastructure compliance, DevSecOps, and policy-as-code practices. What we expect of you Minimum 5 years of experience in Platform Engineering, SiteReliabilityEngineering (SRE), or DevOps roles supporting cloud-native enterprise environments Proficient in Microsoft Azure and AWS platforms with hands-on experience in Kubernetes (AKS/EKS), Helm charts, and service mesh technologies … or HashiCorp Terraform Associate are advantageous Strong interpersonal skills including clear communication, collaboration across teams, adaptability in fast-paced environments, and a proactive mindset with a focus on reliability, performance, and developer enablement More ❯
SiteReliability Engineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead SiteReliability Engineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP, Cassandra, and … performance systems used by millions. What you’ll do Lead project delivery while supporting day-to-day operations and incident management Build and manage infrastructure as code to improve reliability, scalability, and performance Design and implement new architectures and best practices for infrastructure and delivery Drive automation across monitoring, CI/CD, and deployment pipelines Mentor engineers and guide … troubleshooting in live environments 💰 Up to £600 per day (Inside IR35) 📍 London | Hybrid (2 days onsite) 📅 6-month contract, with strong potential to extend If you’re an experienced SRE who enjoys taking ownership, leading technical delivery, and working on large-scale content platforms, I’d love to chat. 👉 Apply or message me if you’d like to hear more. More ❯
london (city of london), south east england, united kingdom
Arrows
SiteReliability Engineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead SiteReliability Engineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP, Cassandra, and … performance systems used by millions. What you’ll do Lead project delivery while supporting day-to-day operations and incident management Build and manage infrastructure as code to improve reliability, scalability, and performance Design and implement new architectures and best practices for infrastructure and delivery Drive automation across monitoring, CI/CD, and deployment pipelines Mentor engineers and guide … troubleshooting in live environments 💰 Up to £600 per day (Inside IR35) 📍 London | Hybrid (2 days onsite) 📅 6-month contract, with strong potential to extend If you’re an experienced SRE who enjoys taking ownership, leading technical delivery, and working on large-scale content platforms, I’d love to chat. 👉 Apply or message me if you’d like to hear more. More ❯
SiteReliability Engineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead SiteReliability Engineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP, Cassandra, and … performance systems used by millions. What you’ll do Lead project delivery while supporting day-to-day operations and incident management Build and manage infrastructure as code to improve reliability, scalability, and performance Design and implement new architectures and best practices for infrastructure and delivery Drive automation across monitoring, CI/CD, and deployment pipelines Mentor engineers and guide … troubleshooting in live environments 💰 Up to £600 per day (Inside IR35) 📍 London | Hybrid (2 days onsite) 📅 6-month contract, with strong potential to extend If you’re an experienced SRE who enjoys taking ownership, leading technical delivery, and working on large-scale content platforms, I’d love to chat. 👉 Apply or message me if you’d like to hear more. More ❯
architectures, and serverless computing. Hands-On Implementation: Lead the hands-on deployment, configuration, and management of secure, high-performance cloud environments (e.g., AWS, Azure, GCP) for critical workloads. DevOps & SRE Leadership: Instil best practices in DevOps, GitOps, and SiteReliabilityEngineering (SRE) to ensure system reliability, scalability, and performance. Security Integration: Work hand-in-glove with … Develop our cloud service offerings, create best practices, and eventually build and lead a team of cloud engineers. Who You Are: You have 8+ years of experience in cloud engineering and architecture, with at least 2+ years in a leadership or team lead position. You are an expert in containerisation and orchestration, with profound, hands-on experience with Kubernetes More ❯
Stroud, south east england, united kingdom Hybrid / WFH Options
Ecotricity
Visibility & Observability: Help create and maintain a single source of truth for our technology ecosystem, including a central application directory, system diagrams, and standardized monitoring and alerting dashboards. Enhancing Reliability: Work to improve system reliability by helping to define key metrics, implement robust monitoring solutions, and support the development of a company-wide backup and business continuity plan. … Agile development environment. Refining tickets, estimating work and breaking user stories down into smaller tasks. What Success Looks Like (The First Year) As a founding member of the Platform Engineering team, your impact will be a key part of our department's transformation. Within your first year, we expect to see tangible progress on our core initiatives, specifically: Centralized … the entire department. Infrastructure Governance: You will be heavily involved in the effort to identify and eliminate all inefficient applications and infrastructure, improving overall security and cost management. Enhanced Reliability: You will play a key role in developing a Business Continuity Plan (BCP) playbook for the company's core applications, significantly improving our resilience. Desirable Skills (Bonus) Prior experience More ❯
Founding SiteReliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
Founding SiteReliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
london (city of london), south east england, united kingdom
Maze
Founding SiteReliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Michael Page Technology
issues. Ensure adherence to SLAs and help improve operational support efficiency. Participate in on-call rotations to provide 24/7 platform coverage. Continuously optimize monitoring, alerting, and platform reliability processes. Demonstrate a "can do" attitude, with flexibility to work occasional overtime when incidents extend beyond normal working hours. Profile Required … Skills & Qualifications Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent work experience). Proven experience in technical support, sitereliabilityengineering (SRE), or platform operations. Strong knowledge of Linux/Unix and Windows environments. Familiarity with cloud platforms (AWS, Azure, GCP). Hands-on experience with CI/CD tools (Jenkins, GitHub More ❯
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in sitereliabilityengineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯