SiteEngineer III The SiteEngineer III (SE) core responsibility is to perform infrastructure related activities in our Data Centers to ensure a reliable and efficient operation of the technical infrastructure. This is achieved through maintenance, testing and surveillance of all assets installed. The SiteEngineer III is responsible for the planning and control … of all day-to-day maintenance activities in cooperation with the Lead Engineer and provide leadership to the site teams, including project management for maintenance related activities. This role is critical to meet Service Level Agreements (SLAs) and deliver client satisfaction, and when applicable being part of a 7x24 shift rotation. What youll do Operations and Compliance: Lead … process to guarantee smooth transitions and minimize downtime in data center operations. Responsible for contributing to the DC operating efficiency and the implementation of assigned optimization measures. Consult with site management to provide solutions and adhere to SLAs agreed in customer contracts. Ensure site overall appearance and cleanliness, escalating shortfalls when necessary. Proactively indicate and report if SLAs More ❯
The SiteEngineer III (SE) core responsibility is to perform infrastructure related activities in our Data Centers to ensure a reliable and efficient operation of the technical infrastructure. This is achieved through maintenance, testing and surveillance of all assets installed. The SiteEngineer III is responsible for the planning and control of all day-to-day … maintenance activities in cooperation with the Lead Engineer and provide leadership to the site teams, including project management for maintenance related activities. This role is critical to meet Service Level Agreements (SLAs) and deliver client satisfaction, and when applicable being part of a 7x24 shift rotation. What youll do Operations and Compliance: Lead the planning and execution of … process to guarantee smooth transitions and minimize downtime in data center operations. Responsible for contributing to the DC operating efficiency and the implementation of assigned optimization measures. Consult with site management to provide solutions and adhere to SLAs agreed in customer contracts. Ensure site overall appearance and cleanliness, escalating shortfalls when necessary. Proactively indicate and report if SLAs More ❯
Site Reliability Engineer Central London (3 days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package We are working with an exciting technology company that are looking to bring in a Site Reliability Engineer to help scale their cloud infrastructure and DevOps capability. Theyve built a high-performing … and CI/CD (GitHub Actions or similar) Solid scripting/Automation experience with Python, Bash or Go A good communicator who enjoys working collaboratively across product and engineering Site Reliability Engineer Central London (3 days a week in the office) £65,000 - £75,000 per annum + Bonus + Generous Benefits Package Click APPLY NOW to be More ❯
Site Reliability Engineer (SRE) – eDV Cleared Location: London (On-site) Salary: Up to £75,000 + Clearance Bonus + Company Bonus Clearance: eDV (Enhanced Developed Vetting) required Are you an experienced Site Reliability Engineer (SRE) with active eDV Clearance ? Do you want to work on mission-critical systems that directly support UK National Security ? Join More ❯
Site Reliability Engineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion. Theyre looking for a Site Reliability Engineer (SRE) to join their platform team and … company with a modern cloud-native engineering culture. Influence how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | Site Reliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to More ❯
Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £85,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a Site Reliability Engineer (SRE) on a … Leading incident response, postmortems, and continuous improvement processes. Driving cost optimisation, capacity planning, and load testing. Championing best practices in cloud security and resilience. Key Skills & Experience Required: Proven Site Reliability Engineering background. Strong Terraform skills with live environment deployment. Kubernetes/AKS expertise. Scripting in PowerShell, Python or Bash. Monitoring experience (Datadog preferred, Azure or Grafana considered). … Background in web applications and distributed systems. Desirable Skills: Knowledge of Microservices Architecture. Familiarity with Kanban. Experience with Puppet or Chef If you’re passionate about Site Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you. Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes More ❯
Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote THIS IS AN AZURE FOCUSED ROLE, IF YOU APPLY AND DO NOT WORK EITHER SOLEY OR MAINLY ON AZURE YOU WILL NOT BE CONSIDERED. Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £95,000 per annum + benefits Start Date … ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a Site Reliability Engineer (SRE) on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure … Leading incident response, postmortems, and continuous improvement processes. Driving cost optimisation, capacity planning, and load testing. Championing best practices in cloud security and resilience. Key Skills & Experience Required: Proven Site Reliability Engineering background. Strong Terraform skills with live environment deployment. Kubernetes/AKS expertise. Scripting in PowerShell, Python or Bash. Monitoring experience (Datadog preferred, Azure or Grafana considered). More ❯
Role: Site Reliability Engineer 🌍 Location: London/Hybrid (3 days a week in office) 💰 Salary: £90,000 🛠 Key Skills: AWS, IaC, Docker, Scripting As a Site Reliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless, reliable … Kubernetes, or similar containerisation technologies. Knowledge of scripting languages such as Bash, Python, NodeJS. Familiarity with Infrastructure as Code (IaC) tools like Terraform, Pulumi, etc. If you're a Site Reliability Engineer with the above, we want to hear from you More ❯
Site Reliability Engineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead Site Reliability Engineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP, Cassandra, and More ❯
Site Reliability Engineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead Site Reliability Engineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP, Cassandra, and More ❯
Senior Site Reliability Engineer At UnlikelyAI, we are building the future of AI: one that is reliable, accurate and transparent. Our neurosymbolic technology harnesses the power of LLMs and generative AI, and combines it with classical symbolic technology to produce hallucination-resistant artificial intelligence for high-trust applications. To support our rapidly increasing commercial momentum, we're looking … for an experienced and pragmatic site reliability engineer to join our exceptional team. This role is ideal for someone who has successfully scaled systems from prototype to production and enjoys working in cross-functional teams to champion cloud-native engineering. We are looking for someone with the experience and expertise to define, and own, our approach to building … to many classical Agile working practices, and our teams regularly change to adapt to changing projects and business priorities. What we're looking for We are looking for a Site Reliability Engineer with demonstrable experience running production systems at scale. To thrive in this role, you: Excel across the stack with deep expertise in backend development on AWS More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform Oliver Bernard are currently working with an established EdTech, based in London, looking to expand their SRE function with a Lead level engineer. The incoming profile will have the chance to work on a variety of greenfield projects, and be able to help grow and scale their SRE practices whilst … Engineers, able to offer £80-90K, and operates a remote first model (with only quarterly visits required). Please apply here to register interest in this opportunity. Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform More ❯
Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform Oliver Bernard are currently working with an established EdTech, based in London, looking to expand their SRE function with a Lead level engineer. The incoming profile will have the chance to work on a variety of greenfield projects, and be able to help grow and scale their SRE practices whilst … Engineers, able to offer £80-90K, and operates a remote first model (with only quarterly visits required). Please apply here to register interest in this opportunity. Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform More ❯
AWS GCP SRE Site Reliability Engineer Terraform Cloudformation ECS ELK Elasticsearch Logstash Kabana Cloudwatch Grafana Windows Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some … Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future resolutions in a timely way and preventing downtime. The future of the product will be looking to improve up-time … some growth and training and development through certifications too! If it suits, and you'd like to hear more, send over a CV to or apply! AWS GCP SRE Site Reliability Engineer Terraform Cloudformation ECS ELK Elasticsearch Logstash Kabana Cloudwatch Grafana Windows Observability More ❯
East London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some … Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future resolutions in a timely way and preventing downtime. The future of the product will be looking to improve up-time … and training and development through certifications too! If it suits, and you'd like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some … Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future resolutions in a timely way and preventing downtime. The future of the product will be looking to improve up-time … and training and development through certifications too! If it suits, and you'd like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some … Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future resolutions in a timely way and preventing downtime. The future of the product will be looking to improve up-time … and training and development through certifications too! If it suits, and you'd like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
Site Reliability Engineer Team Lead – Leadership, Azure, SolarWinds, SLI/SLO, Infrastructure, Risk, Incident Management, Monitoring, Automation – Financial Services – Up to £110,000 Base + Bonus My client, a leading Private and Commercial Bank is seeking an experienced SRE Lead to join their London based team on a permanent basis. In this role, you will define and evolve More ❯
Overview Site Reliability Engineer, Region Services Job ID: AWS EMEA SARL (UK Branch) Would you like to help implement innovative cloud computing solutions and solve the most complex technical problems? Are you excited by the prospect of building and running the world's largest cloud computing infrastructure to provide a better world for future generations? AWS builds and … you'll be part of a world-class team in a dynamic environment that has the entrepreneurial feel of a start-up. This is an opportunity to operate and engineer systems on a massive scale, and to gain world class experience in cloud computing. You'll be surrounded by people who are passionate about cloud computing, believe that first … Build and operate distributed systems Design and build the tools and utilities that are part of the AWS fleet running our internal services Key job responsibilities The Systems Development engineer will be a key member of a new team pioneering automated build and deployment of Windows based services. The team is adopting a code-first and hands off CI More ❯
Site Reliability Engineer | Trading Platform, Systematic Hedge Fund | £300k+ Our client is a $30bn AUM systematic hedge fund focused on HFT and Start-Up equities. They are 13% up in 2025 and have been the number 1 performing quant fund in Europe since 2021. As part of their aggressive growth plans, they are looking for a pragmatic and More ❯
Site Reliability Engineer | Trading Platform, Systematic Hedge Fund | £300k+ Our client is a $30bn AUM systematic hedge fund focused on HFT and Start-Up equities. They are 13% up in 2025 and have been the number 1 performing quant fund in Europe since 2021. As part of their aggressive growth plans, they are looking for a pragmatic and More ❯
london (city of london), south east england, united kingdom
Hunter Bond
Site Reliability Engineer – Tier-One Quantitative Fund | LDN (Hybrid) Salary: £130,000 + Bonus/Perks An elite, tech-driven trading firm widely regarded as one of the most selective and innovative in the space is expanding its high-performance Application Support team in London. This is not your typical support role. You’ll be joining a small More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Motive Group
Senior/Staff Site Reliability Engineer - Observability | London (Hybrid) If you care deeply about building and operating world-class infrastructure for AI at scale , this one’s worth your time. We’re working with a company that builds the backbone powering some of the most demanding AI workloads on the planet. Think large-scale GPU clusters, global telemetry More ❯
Senior/Staff Site Reliability Engineer - Observability | London (Hybrid) If you care deeply about building and operating world-class infrastructure for AI at scale , this one’s worth your time. We’re working with a company that builds the backbone powering some of the most demanding AI workloads on the planet. Think large-scale GPU clusters, global telemetry More ❯
london, south east england, united kingdom Hybrid / WFH Options
Motive Group
Senior/Staff Site Reliability Engineer - Observability | London (Hybrid) If you care deeply about building and operating world-class infrastructure for AI at scale , this one’s worth your time. We’re working with a company that builds the backbone powering some of the most demanding AI workloads on the planet. Think large-scale GPU clusters, global telemetry More ❯