SiteEngineer III The SiteEngineer III (SE) core responsibility is to perform infrastructure related activities in our Data Centers to ensure a reliable and efficient operation of the technical infrastructure. This is achieved through maintenance, testing and surveillance of all assets installed. The SiteEngineer III is responsible for the planning and control … of all day-to-day maintenance activities in cooperation with the Lead Engineer and provide leadership to the site teams, including project management for maintenance related activities. This role is critical to meet Service Level Agreements (SLAs) and deliver client satisfaction, and when applicable being part of a 7x24 shift rotation. What youll do Operations and Compliance: Lead … process to guarantee smooth transitions and minimize downtime in data center operations. Responsible for contributing to the DC operating efficiency and the implementation of assigned optimization measures. Consult with site management to provide solutions and adhere to SLAs agreed in customer contracts. Ensure site overall appearance and cleanliness, escalating shortfalls when necessary. Proactively indicate and report if SLAs More ❯
The SiteEngineer III (SE) core responsibility is to perform infrastructure related activities in our Data Centers to ensure a reliable and efficient operation of the technical infrastructure. This is achieved through maintenance, testing and surveillance of all assets installed. The SiteEngineer III is responsible for the planning and control of all day-to-day … maintenance activities in cooperation with the Lead Engineer and provide leadership to the site teams, including project management for maintenance related activities. This role is critical to meet Service Level Agreements (SLAs) and deliver client satisfaction, and when applicable being part of a 7x24 shift rotation. What youll do Operations and Compliance: Lead the planning and execution of … process to guarantee smooth transitions and minimize downtime in data center operations. Responsible for contributing to the DC operating efficiency and the implementation of assigned optimization measures. Consult with site management to provide solutions and adhere to SLAs agreed in customer contracts. Ensure site overall appearance and cleanliness, escalating shortfalls when necessary. Proactively indicate and report if SLAs More ❯
Site Reliability Engineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion. Theyre looking for a Site Reliability Engineer (SRE) to join their platform team and … company with a modern cloud-native engineering culture. Influence how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | Site Reliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to More ❯
Site Reliability Engineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead Site Reliability Engineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP, Cassandra, and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform Oliver Bernard are currently working with an established EdTech, based in London, looking to expand their SRE function with a Lead level engineer. The incoming profile will have the chance to work on a variety of greenfield projects, and be able to help grow and scale their SRE practices whilst … Engineers, able to offer £80-90K, and operates a remote first model (with only quarterly visits required). Please apply here to register interest in this opportunity. Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some … Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future resolutions in a timely way and preventing downtime. The future of the product will be looking to improve up-time … and training and development through certifications too! If it suits, and you'd like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability Are you looking for a genuinely Remote opportunity? Somewhere you're part of something bigger, working on a global product within a close-knit SRE team? I've partnered a WebApp that provide an end to end event management for some … Grafana, ELK stack, and cost optomisation for the product as they continue scaling. Working across the glove their multi-tenanted, AWS environments requires someone who is able to reverse engineer product faults, or post incident audits to ensure future resolutions in a timely way and preventing downtime. The future of the product will be looking to improve up-time … and training and development through certifications too! If it suits, and you'd like to hear more, send over a CV to robin.shaw@opusrs.com or apply! AWS | GCP | SRE | Site Reliability Engineer | Terraform | Cloudformation | ECS | ELK | Elasticsearch | Logstash | Kabana | Cloudwatch | Grafana | Windows | Observability More ❯
Site Reliability Engineer Team Lead – Leadership, Azure, SolarWinds, SLI/SLO, Infrastructure, Risk, Incident Management, Monitoring, Automation – Financial Services – Up to £110,000 Base + Bonus My client, a leading Private and Commercial Bank is seeking an experienced SRE Lead to join their London based team on a permanent basis. In this role, you will define and evolve More ❯
Site Reliability Engineer | Trading Platform, Systematic Hedge Fund | £300k+ Our client is a $30bn AUM systematic hedge fund focused on HFT and Start-Up equities. They are 13% up in 2025 and have been the number 1 performing quant fund in Europe since 2021. As part of their aggressive growth plans, they are looking for a pragmatic and More ❯
london (city of london), south east england, united kingdom
Hunter Bond
Site Reliability Engineer – Tier-One Quantitative Fund | LDN (Hybrid) Salary: £130,000 + Bonus/Perks An elite, tech-driven trading firm widely regarded as one of the most selective and innovative in the space is expanding its high-performance Application Support team in London. This is not your typical support role. You’ll be joining a small More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Motive Group
Senior/Staff Site Reliability Engineer - Observability | London (Hybrid) If you care deeply about building and operating world-class infrastructure for AI at scale , this one’s worth your time. We’re working with a company that builds the backbone powering some of the most demanding AI workloads on the planet. Think large-scale GPU clusters, global telemetry More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Techfellow Limited
in Office] Role Overview We’re representing a global trading and digital assets firm at the forefront of high-performance technology and infrastructure innovation. The business is seeking a Site Reliability & Infrastructure Engineer to help design, automate, and scale the systems that underpin its global trading platforms. This role sits within a high-performing 11-person infrastructure team … that combines Site Reliability and Core Infrastructure responsibilities - owning everything from AWS cloud systems to on-prem deployments. The team is expanding to meet new strategic demands, including increased automation, enhanced observability, and the rollout of new colocation environments to support lower-latency trading. It’s a technically hands-on position that blends architecture, build, and operational ownership, suited … to an engineer with curiosity, precision, and a drive to constantly improve how infrastructure is built and run... Key Responsibilities Design, build, and maintain highly available infrastructure across both cloud (AWS) and on-prem environments Implement automation across the stack using Infrastructure-as-Code principles (Terraform, Ansible, or similar) Administer and optimise Kubernetes clusters across multiple regions, improving resilience More ❯
london (city of london), south east england, united kingdom
Maze
Principal Platform Engineer | Fintech | London | Up to 200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. Theyre creating the worlds first open-source, AI-native Thin Ledgerset to replace legacy infrastructure at Tier 1 banks. We're looking for a … Principal Platform Engineer to drive the infrastructure behind mission-critical systems: think active-active, five-nines uptime, and real-time observability at global scale. What You'll Do: Own platform architecture for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI More ❯
role: You will play a crucial role in ensuring the reliability, performance, and efficiency the companies trading platforms. This is not your average DevOps role - this position focuses on site reliability, where you'll be troubleshooting, supporting traders, and interacting with multiple teams across various locations to ensure systems stay resilient and fast. Responsibilities: • Support traders and trading applications … Solid experience with Linux Systems administration and troubleshooting. • Hands-on experience with Kubernetes for container orchestration. • Proficient in Python scripting for automation and system management. • A mindset focused on site reliability and performance. • Strong troubleshooting skills and a proactive approach to problem-solving. Salary: Up to £90,000 base salary Lucrative bonus scheme Company perks/benefits Location: City … of London (on-site) If the thought of joining an already globally established, but fast growing quantitative trading firm interests you, apply today More ❯
Ansible | Linux | Ubuntu | On-premise | Bare Metal | Kubernetes | Containerisation | K8s | Containers | PostgresSQL | GCP | Google Cloud | Openshift | RHEL Enjoy getting into the weeds of Containers? Working on Bare Metal Servers? Excited by the prospect of joining a scale up who're More ❯