SiteReliabilityEngineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion. Theyre looking for a SiteReliabilityEngineer (SRE) to join their platform … performance into the software lifecycle. Managing and evolving CI/CD pipelines to ensure smooth deployments and rollbacks. Contributing to incident response , post-mortems, and reliability improvements. Championing SRE principles such as error budgets, SLIs/SLOs, and automation-first thinking. What Were Looking For Strong experience running cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations … native engineering culture. Influence how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | SiteReliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | SiteReliabilityEngineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for … Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future resolutions in a timely way and preventing downtime. The future of the product will be looking to improve up-time … growth and training and development through certifications too! If it suits, and you'd like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | SiteReliabilityEngineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
SiteReliabilityEngineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead SiteReliabilityEngineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP … performance systems used by millions. What you’ll do Lead project delivery while supporting day-to-day operations and incident management Build and manage infrastructure as code to improve reliability, scalability, and performance Design and implement new architectures and best practices for infrastructure and delivery Drive automation across monitoring, CI/CD, and deployment pipelines Mentor engineers and guide … troubleshooting in live environments 💰 Up to £600 per day (Inside IR35) 📍 London | Hybrid (2 days onsite) 📅 6-month contract, with strong potential to extend If you’re an experienced SRE who enjoys taking ownership, leading technical delivery, and working on large-scale content platforms, I’d love to chat. 👉 Apply or message me if you’d like to hear more. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Techfellow Limited
in Office] Role Overview We’re representing a global trading and digital assets firm at the forefront of high-performance technology and infrastructure innovation. The business is seeking a SiteReliability & Infrastructure Engineer to help design, automate, and scale the systems that underpin its global trading platforms. This role sits within a high-performing 11-person infrastructure … team that combines SiteReliability and Core Infrastructure responsibilities - owning everything from AWS cloud systems to on-prem deployments. The team is expanding to meet new strategic demands, including increased automation, enhanced observability, and the rollout of new colocation environments to support lower-latency trading. It’s a technically hands-on position that blends architecture, build, and operational … low-latency engineering practices into the infrastructure Optimise Linux systems for performance and reliability, including kernel tuning and networking configuration Partner with development and platform teams to embed SRE best practices, reducing manual toil through automation and observability Drive improvements in monitoring, alerting, and log collection pipelines to enhance system insight and uptime Participate in architecture and design reviews More ❯
SiteReliabilityEngineer Team Lead – Leadership, Azure, SolarWinds, SLI …/SLO, Infrastructure, Risk, Incident Management, Monitoring, Automation – Financial Services – Up to £110,000 Base + Bonus My client, a leading Private and Commercial Bank is seeking an experienced SRE Lead to join their London based team on a permanent basis. In this role, you will define and evolve the organisation’s SRE practice by establishing principles, objectives, and measurable … consistently meet reliability and performance goals while driving automation to eliminate manual effort and improve efficiency. Experience & Skills To Be Successful: Proven experience leading and managing technical or SRE teams within Financial Services Strong Hands on Experience with Solar Winds Currently Leading a Small-Medium size team Hands-on expertise with cloud platforms (Azure) and infrastructure-as-code tools More ❯
SiteReliabilityEngineer | Trading Platform, Systematic Hedge Fund | £300k+ Our client is a $30bn AUM systematic hedge fund focused on HFT and Start-Up equities. They are 13% up in … and have been the number 1 performing quant fund in Europe since 2021. As part of their aggressive growth plans, they are looking for a pragmatic and commercially oriented SRE to design, implement and maintain scalable and reliable systems. Tech Stack: Python/C++, Terraform, Prometheus, Kubernetes, Cloud Computing The core function of the role is to monitor and maintain More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Motive Group
Senior/Staff SiteReliabilityEngineer - Observability | London (Hybrid) If you care deeply about building and operating world-class infrastructure for AI at scale , this one’s worth your time. We’re working with a company that builds the backbone powering some of the most demanding AI workloads on … the planet. Think large-scale GPU clusters, global telemetry systems, and distributed training environments used by leading research and enterprise teams. They’re looking for a Senior or Staff SRE with deep experience in observability at massive scale - someone who’s tuned Prometheus/Mimir, Loki, or Tempo clusters beyond 100M+ series or 10TB/day logs, and who thrives … in highly technical, fast-moving environments. You’ll be working on: Designing and scaling observability for globally distributed GPU infrastructure Building automation that cuts operational toil and improves reliability Partnering with platform and infrastructure teams to deliver true visibility across complex AI systems If you’ve built or operated telemetry stacks for large-scale, GPU-heavy, or multi-tenant More ❯
markets interests you, this could be the perfect opportunity to take your career to the next level! About the role: You will play a crucial role in ensuring the reliability, performance, and efficiency the companies trading platforms. This is not your average DevOps role - this position focuses on sitereliability, where you'll be troubleshooting, supporting traders … support new trading systems, continuously improving the infrastructure. • Drive automation and operational excellence by leveraging your Linux expertise, Kubernetes, and Python scripting skills. • Monitor and ensure high availability and reliability of trading applications while being on top of system alerts and incidents. Key Requirements: • 1-5 years working experience • Background working in the financial services sector, ideally supporting traders … Solid experience with Linux Systems administration and troubleshooting. • Hands-on experience with Kubernetes for container orchestration. • Proficient in Python scripting for automation and system management. • A mindset focused on sitereliability and performance. • Strong troubleshooting skills and a proactive approach to problem-solving. Salary: Up to £90,000 base salary Lucrative bonus scheme Company perks/benefits Location More ❯
SiteReliabilityEngineer – Tier-One Quantitative Fund | LDN (Hybrid) Salary: £130,000 + Bonus/Perks An elite, tech-driven trading firm widely regarded as one of the most selective and innovative in the space is expanding its high-performance Application Support team in London. This is not your typical support role. You’ll be joining a … engineering and automation work. Key Responsibilities: Supporting ultra-low-latency trading systems across global markets Troubleshooting and resolving time-critical production issues Building automation and tooling to enhance system reliability and efficiency Collaborating closely with traders, developers, and infrastructure engineers Required Experience: Around 5 years of experience in technical support or engineering in a trading or finance environment Strong More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform Oliver Bernard are currently working with an established EdTech, based in London, looking to expand their SRE function with a Lead level engineer. The incoming profile will have the chance to work on a variety of greenfield projects, and be able to help grow and scale their SRE practices whilst … competes heavily with their sizeable competitors. To be considered for this opening you’ll need at least 7-8 years’ experience, encompassing the following: Recent experience in a Lead SRE capacity, coaching/mentoring other engineers Hands-On Cloud experience with AWS and AWS Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform … Engineers, able to offer £80-90K, and operates a remote first model (with only quarterly visits required). Please apply here to register interest in this opportunity. Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform More ❯
Excited by the prospect of joining a scale up who're holding onto their closeknit feel? I've partnered a scaling and backed SaaS on their search for an SRE to work on RKE Kubernetes that scale their customers product. Working on advisory service as well as AI driven products to enable global leaders to make the work life place More ❯