SiteReliability Engineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion. Theyre looking for a SiteReliability Engineer (SRE) to join their platform team and … performance into the software lifecycle. Managing and evolving CI/CD pipelines to ensure smooth deployments and rollbacks. Contributing to incident response , post-mortems, and reliability improvements. Championing SRE principles such as error budgets, SLIs/SLOs, and automation-first thinking. What Were Looking For Strong experience running cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations … engineering culture. Influence how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | SiteReliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | SiteReliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some … growth and training and development through certifications too! If it suits, and you'd like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | SiteReliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
SiteReliability Engineer (SRE) – eDV Cleared Location: London (On-site) Salary: Up to £75,000 + Clearance Bonus + Company Bonus Clearance: eDV (Enhanced Developed Vetting) required Are you an experienced SiteReliability Engineer (SRE) with active eDV Clearance ? Do you want to work on mission-critical systems that directly support UK National Security ? Join … brightest minds in the industry, ensuring the reliability, scalability and performance of complex, high-assurance systems that protect the nation. The Role: As a key member of the SRE team, you’ll design, build and maintain reliable infrastructure and automation solutions to keep vital services running smoothly. You’ll drive continuous improvement across monitoring, deployment, and incident response for … performance bonus . Opportunity to work on high-impact, national security projects . Career development within one of the UK’s most respected secure consultancies. If you’re an SRE with eDV clearance looking to make a real impact in a secure and rewarding environment, we’d love to hear from you. 📩 Apply now or reach out directly to Dominic More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Techfellow Limited
in Office] Role Overview We’re representing a global trading and digital assets firm at the forefront of high-performance technology and infrastructure innovation. The business is seeking a SiteReliability & Infrastructure Engineer to help design, automate, and scale the systems that underpin its global trading platforms. This role sits within a high-performing 11-person infrastructure team … that combines SiteReliability and Core Infrastructure responsibilities - owning everything from AWS cloud systems to on-prem deployments. The team is expanding to meet new strategic demands, including increased automation, enhanced observability, and the rollout of new colocation environments to support lower-latency trading. It’s a technically hands-on position that blends architecture, build, and operational ownership … latency engineering practices into the infrastructure Optimise Linux systems for performance and reliability, including kernel tuning and networking configuration Partner with development and platform teams to embed SRE best practices, reducing manual toil through automation and observability Drive improvements in monitoring, alerting, and log collection pipelines to enhance system insight and uptime Participate in architecture and design reviews More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Sanderson
UK Engineering and SRE Lead Location: London (Hybrid) Salary: Up to £85,000 + benefits Shape the Future of Fintech Infrastructure in the UK If you’re passionate about designing cloud-native platforms, enabling high-throughput, low-latency systems, and ensuring flawless reliability, this is your opportunity to make a tangible impact in the UK fintech ecosystem. About … the Role As the UK Engineering and SRE Lead (Backend) , you’ll own the performance, scalability, and operational resilience of our UK backend systems — including payment gateways, settlement, fraud monitoring, datalake, and core banking services. You’ll combine deep backend engineering expertise with a strong SiteReliabilityEngineering mindset, leading a hybrid team of software … DevOps, and SRE engineers. This role blends architecture, leadership, and hands-on delivery — you’ll design systems that scale and perform flawlessly, automate everything you can, and guide your team through both innovation and incident. What You’ll Do Lead end-to-end development and deployment of mission-critical backend systems, targeting 99.999% availability. Architect and deliver high-throughput, cloud More ❯
do at CMC Markets, and staying true to that has been pivotal to our success. CMC Markets is seeking an experienced and proactive SiteReliabilityEngineering (SRE) Manager to establish and lead a new SRE function within the IT Production department. This is a key leadership role responsible for defining the SRE strategy, implementing best practices, and … resilience across the trading platforms Ensure new systems are aligned with best practices Drive improvements and alignment in observability and monitoring tools, improving MTTD and MTTR Produce analysis on SRE function performance Provide guidance, recommendations and hands-on support to teams, promoting SRE best practices Develop and maintain a roadmap for continuous improvement of support and observability Maintain personal/… role Read and comply with CMC policies and procedures as they relate to your employment Complete all mandatory compliance training KEY SKILLS AND EXPERIENCE 2 years experience in a SRE function or similar in hybrid cloud/on prem environment 7 years experience in IT operational roles working with highly reliable systems Experience in modern development methodologies and languages Proficiency More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Switch Tech Talent
Role: SiteReliability Engineer 🌍 Location: London/Hybrid (3 days a week in office) 💰 Salary: £90,000 🛠 Key Skills: AWS, IaC, Docker, Scripting As a SiteReliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless, reliable … Kubernetes, or similar containerisation technologies. Knowledge of scripting languages such as Bash, Python, NodeJS. Familiarity with Infrastructure as Code (IaC) tools like Terraform, Pulumi, etc. If you're a SiteReliability Engineer with the above, we want to hear from you More ❯
SiteReliability Engineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead SiteReliability Engineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP, Cassandra, and … performance systems used by millions. What you’ll do Lead project delivery while supporting day-to-day operations and incident management Build and manage infrastructure as code to improve reliability, scalability, and performance Design and implement new architectures and best practices for infrastructure and delivery Drive automation across monitoring, CI/CD, and deployment pipelines Mentor engineers and guide … troubleshooting in live environments 💰 Up to £600 per day (Inside IR35) 📍 London | Hybrid (2 days onsite) 📅 6-month contract, with strong potential to extend If you’re an experienced SRE who enjoys taking ownership, leading technical delivery, and working on large-scale content platforms, I’d love to chat. 👉 Apply or message me if you’d like to hear more. More ❯
Senior & Lead SiteReliability Engineers (SRE) – SC Cleared 🔒 Must hold current SC Clearance 📅 3-month initial contract 💷 Up to £650/day (higher rates considered for exceptional candidates) 📍 Hybrid: 2 days/week in London office, 3 days remote 🕒 Start: ASAP We’re hiring 2 x Senior SREs and 1 x Lead SRE to join a high-impact … functional teams. You’ll help design, build, and deploy cloud-native solutions, contributing to the core cloud platform and enabling secure, scalable infrastructure. Key Skills & Experience Strong DevOps and SRE background Proven Azure migration experience (CAF, WAF, AzureRM) Infrastructure as Code (Terraform) CI/CD pipelines using GitHub Actions Azure services: AKS, Front Door, SQL, Load Balancers Networking: firewalls, routing More ❯
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in sitereliabilityengineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
markets interests you, this could be the perfect opportunity to take your career to the next level! About the role: You will play a crucial role in ensuring the reliability, performance, and efficiency the companies trading platforms. This is not your average DevOps role - this position focuses on sitereliability, where you'll be troubleshooting, supporting traders … support new trading systems, continuously improving the infrastructure. • Drive automation and operational excellence by leveraging your Linux expertise, Kubernetes, and Python scripting skills. • Monitor and ensure high availability and reliability of trading applications while being on top of system alerts and incidents. Key Requirements: • 1-5 years working experience • Background working in the financial services sector, ideally supporting traders … Solid experience with Linux Systems administration and troubleshooting. • Hands-on experience with Kubernetes for container orchestration. • Proficient in Python scripting for automation and system management. • A mindset focused on sitereliability and performance. • Strong troubleshooting skills and a proactive approach to problem-solving. Salary: Up to £90,000 base salary Lucrative bonus scheme Company perks/benefits Location More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson Mccade
Professional-level Google Cloud certification required. Proven expertise in Google Cloud Platform services (Compute Engine, App Engine, GKE, Cloud Storage, IAM, VPC, etc.). Strong experience in architecting and engineering cloud-based solutions that meet both functional and non-functional requirements. Solid understanding of cloud networking and security (e.g. firewalls, encryption, identity management). Experience implementing foundational cloud platforms … communication and stakeholder management skills across technical and non-technical audiences. Desirable Experience with multi-cloud environments (AWS, Azure, hybrid). Familiarity with sitereliabilityengineering (SRE) principles and production systems support. Experience driving innovation, technical transformation, and improvements in ways of working. What's on Offer Competitive base salary between £75,000 and £90,000 . More ❯
SiteReliability Engineer Team Lead – Leadership, Azure, SolarWinds, SLI …/SLO, Infrastructure, Risk, Incident Management, Monitoring, Automation – Financial Services – Up to £110,000 Base + Bonus My client, a leading Private and Commercial Bank is seeking an experienced SRE Lead to join their London based team on a permanent basis. In this role, you will define and evolve the organisation’s SRE practice by establishing principles, objectives, and measurable … consistently meet reliability and performance goals while driving automation to eliminate manual effort and improve efficiency. Experience & Skills To Be Successful: Proven experience leading and managing technical or SRE teams within Financial Services Strong Hands on Experience with Solar Winds Currently Leading a Small-Medium size team Hands-on expertise with cloud platforms (Azure) and infrastructure-as-code tools More ❯
SiteReliability Engineer | Trading Platform, Systematic Hedge Fund | £300k+ Our client is a $30bn AUM systematic hedge fund focused on HFT and Start-Up equities. They are 13% up in … and have been the number 1 performing quant fund in Europe since 2021. As part of their aggressive growth plans, they are looking for a pragmatic and commercially oriented SRE to design, implement and maintain scalable and reliable systems. Tech Stack: Python/C++, Terraform, Prometheus, Kubernetes, Cloud Computing The core function of the role is to monitor and maintain More ❯
City of London, London, United Kingdom Hybrid / WFH Options
ECS
SC Cleared - Senior SRE Initial 6-month Contract Role Hybrid working, twice per week in London office £450 - £500, Inside IR35 Please note, current and active SC Clearance is mandatory to be considered We're partnering with a Global IT Provider who are seeking a highly skilled and experienced Senior SiteReliability Engineers with good proficiency in cloud More ❯
Tech Lead | Azure, Terraform, Kubernetes | Bank-Grade Cloud Platform Build Salary: £100,000 - £115,000 Hybrid: London (2 days/week on-site) The Role Join a newly formed Cloud Platform Engineering team building a greenfield Azure platform for a regulated banking venture. As Tech Lead , you’ll drive technical delivery across a team of ten engineers, staying … Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in SiteReliabilityEngineering or DevOps transformation Exposure to FinOps/cost optimisation Familiarity with Azure Enterprise Scale Landing Zones Accelerators Microsoft certifications (AZ-400/AZ-305 or … greenfield banking platform from day one , defining the standards, practices, and automation that will underpin a regulated enterprise cloud You’ll have genuine technical ownership, influence how a modern engineering culture takes root, and see your work directly impact a mission-critical delivery. Tech Lead | Azure, Terraform, Kubernetes | Bank-Grade Cloud Platform Build More ❯
Lead | Azure, Terraform, Kubernetes | Bank-Grade Cloud Platform Build £600-700 p/d (Outside IR35) 5-month initial contract (extensions likely) Hybrid: London (2 days/week on-site) The Role Join a newly formed Cloud Platform Engineering team building a greenfield Azure platform for a regulated banking venture. As Tech Lead , you’ll drive technical delivery … Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in SiteReliabilityEngineering or DevOps transformation Exposure to FinOps/cost optimisation Familiarity with Azure Enterprise Scale Landing Zones Accelerators Microsoft certifications (AZ-400/AZ-305 or … greenfield banking platform from day one , defining the standards, practices, and automation that will underpin a regulated enterprise cloud You’ll have genuine technical ownership, influence how a modern engineering culture takes root, and see your work directly impact a mission-critical delivery. (Option to extend or convert to permanent after initial term.) Tech Lead | Azure, Terraform, Kubernetes | Bank More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Motive Group
Senior/Staff SiteReliability Engineer - Observability | London (Hybrid) If you care deeply about building and operating world-class infrastructure for AI at scale , this one’s worth your time. We’re working with a company that builds the backbone powering some of the most demanding AI workloads on … the planet. Think large-scale GPU clusters, global telemetry systems, and distributed training environments used by leading research and enterprise teams. They’re looking for a Senior or Staff SRE with deep experience in observability at massive scale - someone who’s tuned Prometheus/Mimir, Loki, or Tempo clusters beyond 100M+ series or 10TB/day logs, and who thrives … in highly technical, fast-moving environments. You’ll be working on: Designing and scaling observability for globally distributed GPU infrastructure Building automation that cuts operational toil and improves reliability Partnering with platform and infrastructure teams to deliver true visibility across complex AI systems If you’ve built or operated telemetry stacks for large-scale, GPU-heavy, or multi-tenant More ❯
with SQL and Python Data Visualisation skills with PowerBI, other Automation and Metrics knowledge handy. Proficiency with tools like Jira, Confluence, Excel, and SharePoint Familiarity with Agile, DevOps, and SiteReliabilityEngineering Excellent communication and stakeholder management skills More ❯
engineers to run their applications at the bleeding edge. Required Skills & Experience: Experience supporting mission critical systems and high performance applications Minimum of 5 years working in trade support, sitereliabilityengineering Bachelor’s degree in STEM or related field Familiarity with trading platforms and financial markets Thrives in high-pressure situations while working alongside traders, developers … and other engineering teams Strong problem-solving skills and the ability to troubleshoot technical issues under pressure Knowledge of Linux/Unix environments Experience with scripting languages such as Python and Bash for automation tasks Ability to devise complex SQL database queries and updates Basic networking knowledge, including multicast, TCP/IP, DNS, DHCP and common network troubleshooting tools More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform Oliver Bernard are currently working with an established EdTech, based in London, looking to expand their SRE function with a Lead level engineer. The incoming profile will have the chance to work on a variety of greenfield projects, and be able to help grow and scale their SRE practices whilst leading … competes heavily with their sizeable competitors. To be considered for this opening you’ll need at least 7-8 years’ experience, encompassing the following: Recent experience in a Lead SRE capacity, coaching/mentoring other engineers Hands-On Cloud experience with AWS and AWS Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform … Engineers, able to offer £80-90K, and operates a remote first model (with only quarterly visits required). Please apply here to register interest in this opportunity. Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform More ❯
a multitude of great technologies. They are heading increasingly towards containerisation/Kubernetes. Kafka is the key platform you'll be building. Required Experience: Kafka Devops/Platform/SRE specialisation Scripting (python/powershell/bash) Kubernetes, Docker Some on-prem exposure including Linux & Windows Beneficial: Financial Services Grafana, Splunk, ELK, OpenShift More ❯
out private cloud technical solutions. Define an API-driven approach to infrastructure. Develop tools and operators to enforce compliance, standardise environments, and eliminate manual tasks. Collaborate with InfoSec and SRE teams to embed security into the software delivery lifecycle. Implement control gates and security infrastructure (e.g. secrets management, encryption at rest/in transit). Add low-latency capabilities to More ❯
Excited by the prospect of joining a scale up who're holding onto their closeknit feel? I've partnered a scaling and backed SaaS on their search for an SRE to work on RKE Kubernetes that scale their customers product. Working on advisory service as well as AI driven products to enable global leaders to make the work life place More ❯