slough, south east england, united kingdom Hybrid / WFH Options
X4 Technology
Role: SiteReliability Engineer Domain: Energy trading Project: Algorithmic derivatives trading platform Day rate: Circa £800/d (inside IR35) Location: London (hybrid - 3 days per week) Contract: 6 months initial (multi-year scope) X4 Technology are partnered with a global energy trading client, offering the opportunity for a Contract SiteReliability Engineer to join a … small high-performing team working on an algorithmic derivatives trading platform . You’ll work closely with product and engineering teams to ensure end-to-end workflows run securely, efficiently and compliantly (MiFIDII, RTS6...). You'll also optimise connectivity to global futures exchanges, integrate market data (Bloomberg, Refinitiv...) and troubleshoot FIX connections, EMS/OMS platforms, network protocols … and real-time data systems. Responsibilities for the Contract SiteReliability Engineer (Algorithmic Trading) Deploy applications following best practices and manage vendor relationships Monitor and troubleshoot systems proactively, owning reliability improvements Work closely with trading teams to understand the full trade lifecycle Develop and maintain technical solutions (Python, PowerShell, C#, and SQL) Requirements for the Contract SiteMore ❯
SiteReliability Engineer | Contract | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) - Up to £650 per day (Inside IR35) - 2 days per week onsite in London I'm working with a leading media and technology client that's building next-generation digital platforms used by millions across the UK. They're looking for an experienced SiteReliability Engineer to join their growing team and help drive automation, reliability, and performance across complex systems. What you'll do Collaborate with cross-functional teams to design and deliver reliable, scalable, customer-focused solutions Automate and enhance software deployments and delivery pipelines Support both on-prem and cloud infrastructure (mainly AWS, with some GCP exposure) Work … Familiarity with Kafka, Akamai or Fastly, and databases like MySQL or MongoDB Excellent problem-solving and communication skills Comfortable participating in an on-call rotation If you're a SiteReliability Engineer who enjoys working at scale, loves automation, and wants to make an impact on highly visible digital products, I'd love to chat. - Apply or drop More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Switch Tech Talent
Role: SiteReliability Engineer 🌍 Location: London/Hybrid (3 days a week in office) 💰 Salary: £90,000 🛠 Key Skills: AWS, IaC, Docker, Scripting As a SiteReliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless, reliable … Kubernetes, or similar containerisation technologies. Knowledge of scripting languages such as Bash, Python, NodeJS. Familiarity with Infrastructure as Code (IaC) tools like Terraform, Pulumi, etc. If you're a SiteReliability Engineer with the above, we want to hear from you More ❯
Job Summary This role is to design, build, and scale enterprise cloud platforms with a strong focus on automation, reliability, and developer experience. As part of the Cloud Infrastructure & DevOps team, you will build multi-cloud infrastructure that powers hundreds of production services, including critical Salesforce DevOps pipelines. You’ll partner closely with development, security, and operations teams to … Drive infrastructure compliance, DevSecOps, and policy-as-code practices. What we expect of you Minimum 5 years of experience in Platform Engineering, SiteReliabilityEngineering (SRE), or DevOps roles supporting cloud-native enterprise environments Proficient in Microsoft Azure and AWS platforms with hands-on experience in Kubernetes (AKS/EKS), Helm charts, and service mesh technologies … or HashiCorp Terraform Associate are advantageous Strong interpersonal skills including clear communication, collaboration across teams, adaptability in fast-paced environments, and a proactive mindset with a focus on reliability, performance, and developer enablement More ❯
SiteReliability Engineer (Lead Level) | London | Up to £600 Inside IR35 | Hybrid (2 Days Onsite) | 6 months I’m partnered with a major media and tech company looking for a Lead SiteReliability Engineer to support and scale their Video on Demand (VOD) infrastructure. You’ll work across modern tech stacks including AWS, GCP, Cassandra, and … performance systems used by millions. What you’ll do Lead project delivery while supporting day-to-day operations and incident management Build and manage infrastructure as code to improve reliability, scalability, and performance Design and implement new architectures and best practices for infrastructure and delivery Drive automation across monitoring, CI/CD, and deployment pipelines Mentor engineers and guide … troubleshooting in live environments 💰 Up to £600 per day (Inside IR35) 📍 London | Hybrid (2 days onsite) 📅 6-month contract, with strong potential to extend If you’re an experienced SRE who enjoys taking ownership, leading technical delivery, and working on large-scale content platforms, I’d love to chat. 👉 Apply or message me if you’d like to hear more. More ❯
Founding SiteReliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in sitereliabilityengineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliability Engineer with a strong focus on Google Cloud Platform (GCP … and respond to cloud incidents using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain documentation for troubleshooting processes and More ❯
quants, and developers to ensure the systems are ultra-reliable, scalable, and secure, while continuously pushing performance boundaries across multiple cloud and bare-metal environments. Skills & Experience Degree in Engineering, Computer Science, or a related quantitative field. 5+ years in SiteReliability, DevOps, or Platform Engineering. Deep experience with Kubernetes Proven AWS expertise across multi-region environments. More ❯
Lead | Azure, Terraform, Kubernetes | Bank-Grade Cloud Platform Build £600-700 p/d (Outside IR35) 5-month initial contract (extensions likely) Hybrid: London (2 days/week on-site) The Role Join a newly formed Cloud Platform Engineering team building a greenfield Azure platform for a regulated banking venture. As Tech Lead , you’ll drive technical delivery … Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in SiteReliabilityEngineering or DevOps transformation Exposure to FinOps/cost optimisation Familiarity with Azure Enterprise Scale Landing Zones Accelerators Microsoft certifications (AZ-400/AZ-305 or … greenfield banking platform from day one , defining the standards, practices, and automation that will underpin a regulated enterprise cloud You’ll have genuine technical ownership, influence how a modern engineering culture takes root, and see your work directly impact a mission-critical delivery. (Option to extend or convert to permanent after initial term.) Tech Lead | Azure, Terraform, Kubernetes | Bank More ❯
with SQL and Python Data Visualisation skills with PowerBI, other Automation and Metrics knowledge handy. Proficiency with tools like Jira, Confluence, Excel, and SharePoint Familiarity with Agile, DevOps, and SiteReliabilityEngineering Excellent communication and stakeholder management skills More ❯
would like to find out more about Trustonic's culture code. A bit about the team and what you’ll deliver We are seeking a world-class SVP of Engineering to lead and strengthen the build, delivery and technical ownership of Trustonic’s Telecoms Platform. This is a mission-critical role, reporting directly to the CEO and working in … close partnership with peers across Product and Operations. You will be responsible for delivering a highly reliable, scalable, and secure platform (TP3) while embedding the engineering discipline, governance, and metrics required to support Trustonic’s next stage of growth. This role requires both strategic oversight and operational excellence, ensuring that Trustonic’s technology remains scalable, performant, reliable and aligned … of scalable architectures (multi-tenant, API-first, streaming/events, caching) and resiliency patterns (graceful degradation, bulkheads, back pressure), empowering teams to deliver robust solutions. Experience leading and governing SRE/DevOps practices at scale, including CI/CD, incident management, post-mortems, and capacity & cost management, fostering a culture of operational excellence and continuous improvement. Sponsor and champion security More ❯
Multiple Junior SRE/DevOps/Linux Engineers | Systematic Trading Firm | Up to £125,000 TC A high-growth systematic trading firm are actively looking to make a large number of junior hires across there technology function spanning SRE, DevOps, and Linux Engineering. Applicants with less than 4 years of relevant experience will be considered across multiple functions, for which … growth Highly collaborative culture – flat structure, cross-team interaction encouraged, direct access to senior leadership Benefits: Free lunch and snacks, 10% pension, life insurance, educational reimbursement, no non-compete SRE/DevOps Team Join a team delivering critical platform infrastructure across design, monitoring, automation, and tooling creation. The division is made up of several specialist sub-teams: Observability – Drive platform … GitLab Database – Improve database automation and performance using Python Reliability – Proactively identify platform weaknesses to improve reliability and stability Requirements: Up to 4 years’ experience in DevOps, SRE, or Platform Engineering Scripting in Python or Bash BSc in Computer Science, Engineering, or related field Experience with Git and Linux environments Bonus: GitLab, Docker, Ansible, Terraform, HPC More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Client Server
to work from home twice a week (please note there is an on-call requirement). About you: You have commercial experience as a Platform Engineer, DevOps Engineer or SRE within an HFT, low latency or web scraping environment You have strong bash or Python scripting skills You have multi-cloud experience (AWS and/or Azure preferred) You have More ❯
Heart Mind Talent are partnering with Verified Global to hire a Reliability & Operations Engineer based in Central London. Verified Global builds cutting-edge algorithms to flip the odds in sports betting. Every hour, millions of fans place sub-optimal bets. We’re changing that—delivering market-beating tips, insights, and data powered by our world-class in-house models. … flagship consumer platform launched in 2024 and is scaling fast, supported by industry-leading social channels with nearly two million highly engaged followers. The Role We’re hiring a Reliability & Operations Engineer to keep our products fast, accurate, and always on. You’ll sit at the heartbeat of our daily operations, monitoring systems and content pipelines, resolving issues in … moments: Join a on-call rotation and occasionally cover evenings/weekends during major sports events. What you’ll bring 1–3 years in technical operations, reliability/SRE, support engineering, or data operations. Comfortable with a scripting language (JavaScript or Python); familiarity with a strongly-typed language (Rust/Java/C/C++) is a plus. More ❯
Production Support Engineer | SiteReliability Engineer – Market Making London | £200,000 – £300,000 (base + performance bonus) | 4 days in office We’re partnering with a leading market making firm seeking a highly skilled Production Support Engineer to join their London team. This is a hands-on role at the heart of a fast-paced trading environment, blending … SQL Bring trading systems online and provide Tier 1 and Tier 2 operational support across trading sessions Streamline, automate, and enhance operational workflows and scripts to improve efficiency and reliability Collaborate closely with traders, quants, and developers to design new tools and refine existing systems Required Skills & Experience Strong Python programming skills, including data analysis with Pandas Advanced SQL More ❯
My client, a successful quantitative investment manager, is looking for a Senior DevOps engineer to join their ML ops team and to implement testing, development, automation tools, and IT infrastructure for the ML platform team and its users. They are More ❯
We are looking for an experienced Technical Lead with active eDV/UK*C/UKIC clearance to guide engineering teams in delivering high-quality, maintainable, and secure software solutions. Acting as a bridge between architecture and implementation, you will play a critical role in shaping technical direction, supporting decision-making, and enabling teams to deliver effectively in highly … practices, and fostering a culture of collaboration and continuous improvement. Key Responsibilities Provide technical leadership and direction across multiple teams to ensure aligned, high-quality delivery Bridge architecture and engineering, translating strategic objectives into robust technical solutions Hands-on involvement in coding, reviewing, and ensuring standards for maintainability and performance Mentor and coach engineers, actively supporting professional growth and … Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript/ More ❯
Eton, Berkshire, United Kingdom Hybrid / WFH Options
Sky
still. We optimise and innovate. We turn big ideas into the products, content and services millions of people love. And we do it all right here at Sky. The SRE team, is responsible for the implementation and administration of the infrastructure for the Video On Demand (VOD) content. The infrastructure consists of many technology stacks, i.e., AWS, GCP, Cassandra, Kafka … clear articulation of design choices along the way. What you'll bring Experience in solving technical issues, managing technical projects, and mentoring engineers Experience working in a multi technical SRE department, with the ability to break down complex problems and find practical solutions Strong interpersonal skills which will allow you to collaborate with stakeholders and communicate effectively with your peers … an on-call working rota for this role Team overview Global Streaming Technology Our team develops and supports market-leading video streaming services, underpinned by state-of-the-art engineering principles. We do this at huge scale: for over 50 million customers globally, spanning NBCUniversal Peacock in the US and Sky, NOW and SkyShowtime across Europe. No matter the More ❯
Windsor, Berkshire, United Kingdom Hybrid / WFH Options
Sky
still. We optimise and innovate. We turn big ideas into the products, content and services millions of people love. And we do it all right here at Sky. The SRE team, is responsible for the implementation and administration of the infrastructure for the Video On Demand (VOD) content. The infrastructure consists of many technology stacks, i.e., AWS, GCP, Cassandra, Kafka … clear articulation of design choices along the way. What you'll bring Experience in solving technical issues, managing technical projects, and mentoring engineers Experience working in a multi technical SRE department, with the ability to break down complex problems and find practical solutions Strong interpersonal skills which will allow you to collaborate with stakeholders and communicate effectively with your peers … an on-call working rota for this role Team overview Global Streaming Technology Our team develops and supports market-leading video streaming services, underpinned by state-of-the-art engineering principles. We do this at huge scale: for over 50 million customers globally, spanning NBCUniversal Peacock in the US and Sky, NOW and SkyShowtime across Europe. No matter the More ❯
Slough, Berkshire, United Kingdom Hybrid / WFH Options
Sky
still. We optimise and innovate. We turn big ideas into the products, content and services millions of people love. And we do it all right here at Sky. The SRE team, is responsible for the implementation and administration of the infrastructure for the Video On Demand (VOD) content. The infrastructure consists of many technology stacks, i.e., AWS, GCP, Cassandra, Kafka … clear articulation of design choices along the way. What you'll bring Experience in solving technical issues, managing technical projects, and mentoring engineers Experience working in a multi technical SRE department, with the ability to break down complex problems and find practical solutions Strong interpersonal skills which will allow you to collaborate with stakeholders and communicate effectively with your peers … an on-call working rota for this role Team overview Global Streaming Technology Our team develops and supports market-leading video streaming services, underpinned by state-of-the-art engineering principles. We do this at huge scale: for over 50 million customers globally, spanning NBCUniversal Peacock in the US and Sky, NOW and SkyShowtime across Europe. No matter the More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Intellect Group
systems (Kafka, RabbitMQ) for low-latency event handling. Enhance reliability, scalability, and infrastructure security through DevSecOps practices. What we’re looking for ✅ 4–6 years’ experience in DevOps, SRE, or Cloud Infrastructure Engineering . ✅ Strong background in AWS and Linux systems . ✅ Skilled in Terraform/IaC and CI/CD automation . ✅ Experience with Docker/Kubernetes More ❯
critical quant trading platform for a systematic hedge fund. This isn’t your typical ops role - they're looking for Engineers who can write code to eliminate toil, improve reliability and automate release, monitoring and recovery processes. You'll build and maintain automated tools in Python for deployment, health checks, alerts and runbooks whilst focusing on reliabilityengineering … a lean, global team here. There's plenty of scope to take ownership, modernise tooling, and influence infrastructure direction. Up to £90k + bonus Central London 5x days on site (4x after passing probation More ❯
days onsite) 🧭 Permanent | Full-time About the Organisation Our client is a global technology-led business consultancy with a mission to help clients innovate, scale, and modernise. Their London engineering hub is home to a diverse international team focused on building next-generation platforms that empower developers and drive automation at scale. They are now looking for a Platform … Engineer, Databases to join their growing Platform Engineering function and take ownership of how database services are automated, delivered, and operated. The Mission This is a hands-on and strategic role … within a fast-evolving Internal Platform initiative. You’ll design and implement database services that can be consumed on demand — secure, compliant, and self-service. Working closely with Platform, SRE, and DevOps teams, you’ll bring automation, observability, and scalability to their database layer, enabling hundreds of developers to ship faster with confidence. What You’ll Do 💾 Design, build, and More ❯
Engineer to join their team in London. You will be supporting the AWS Public Cloud infrastructure and implementation of IaC using Terraform. The role will work closely with the SRE and Engineering teams to ensure that the Cloud environment has sufficient observability and is appropriately managed. Skills and experience required: Strong technical operational skills in supporting AWS Cloud Hosted More ❯