Site Reliability Engineering Jobs in London

1 to 25 of 169 Site Reliability Engineering Jobs in London

Senior Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Stratospherec Ltd
reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills and Experience: Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language. Proficiency in … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more. More ❯
Employment Type: Permanent
Salary: £85000 - £90000/annum Excellent Benefits package
Posted:

Senior Site Reliability Engineer (SRE) / Unix

London, United Kingdom
Morgan Hunt UK Limited
occasional travel to Scotland Employment Type: 6 month Contract Rate: £550 per day, Outside of IR35 Role Overview Morgan Hunt are seeking an experienced Site Reliability Engineer (SRE)/Unix Infrastructure Engineer to support the deployment, migration, and optimisation of critical infrastructure services. The role involves ensuring high availability, disaster recovery readiness, and automation-driven improvements across RHEL More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer SRE / Unix

London, South East, England, United Kingdom
Morgan Hunt Recruitment
occasional travel to Scotland Employment Type: 6 month Contract Rate: £550 per day, Outside of IR35 Role Overview Morgan Hunt are seeking an experienced Site Reliability Engineer (SRE)/Unix Infrastructure Engineer to support the deployment, migration, and optimisation of critical infrastructure services. The role involves ensuring high availability, disaster recovery readiness, and automation-driven improvements across RHEL More ❯
Employment Type: Contractor
Rate: £550 per day
Posted:

Vice President, DevOps Engineer (NE) (London)

Highgate, Greater London, UK
Hybrid / WFH Options
BlackRock, Inc
operating our infrastructure, middleware, and CI/CD systems to ensure our teams have access to the best tools available. We combine problem-solving skills with software and systems engineering to take a proactive approach in building fault-tolerant and secure systems, improving observability and zealously automating away toil. In this role you will: Use your site reliability … internal services. Improving their performance, availability, scalability, latency and efficiency. Drive technical excellence in everything we do, fostering a culture of data-driven reliability, monitoring and automation, following SRE best-practices. Work alongside development teams to design and build scalable and high available services, while establishing effective build frameworks for continuous deployment and self-service automation. Work on incident More ❯
Employment Type: Full-time
Posted:

Senior SRE

London, United Kingdom
Board Intelligence
and capable of adapting to changing customer needs. This role offers full-time working from our Central Stockholm office. The Opportunity As a Senior Site Reliability Engineer (SRE), you'll be joining a team whose mission is to ensure the availability, performance, security and reliability of our platform and core services, ensuring that they meet the needs … be responsible for visibility and monitoring of those systems, for building tooling and automation to reduce TOIL and for responding to incidents as part of our 24/7 SRE on-call team. Reliability Engineering at Board Intelligence The SRE team: Strives to provide the highest standards of Availability, Scalability, Performance and Security for our Software as a … work Proactively monitors our platform and responds to incidents as part of a 24/7 rota Key responsibilities of the role We're looking for a great Senior SRE to be a hands on individual contributor to key technical projects and to help us build a first-class SRE function. This role will involve: Project work Hands on work More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Azure Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Nordcloud group
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure Site Reliability Engineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site reliability engineer

London, United Kingdom
writer.com
are seeking a foundational member for the Cloud Infrastructure team at Writer. This role involves contributing to the development and implementation of our Site Reliability Engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer's critical systems, proactively guaranteeing that our high-ROI products reach customers seamlessly. Your responsibilities … ensure cost efficiency. Ensure the security and compliance of our systems, adhering to industry standards and regulations. Provide mentorship and technical guidance to junior engineers, fostering a culture of reliability and continuous improvement. Stay current with emerging technologies and industry trends to improve our site reliability practices. Is this you? Proven expertise in Site Reliability Engineering with at least 7 years of hands-on experience. Deep understanding of system architecture and infrastructure design for high availability and performance. Bachelor's degree in Computer Science, Engineering, or a related field. Strong proficiency in programming languages such as Python, Java, or Go for automation and monitoring. Experience with cloud platforms like AWS, Azure, or More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DV Cleared Site Reliability / DevOps Engineer

London, United Kingdom
JAM Recruitment
live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a Site Reliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DV Cleared Site Reliability / DevOps Engineer

London, United Kingdom
JAM Recruitment Ltd
live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a Site Reliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
Employment Type: Contract
Rate: GBP 500 - 550 Daily
Posted:

DV Cleared Site Reliability / DevOps Engineer

South West London, London, United Kingdom
JAM Recruitment Ltd
live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a Site Reliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
Employment Type: Contract
Rate: £500 - £550 per day + Umbrella, inside IR35
Posted:

SRE Engineer

London, South East, England, United Kingdom
Robert Walters
industry-recognised certifications, strong mentorship, and technical development programmes, you will have every chance to advance your career while working on cutting-edge AWS native databases and automation projects. SITE RELIABILITY ENGINEER Salary: £400 - £500/PD Inside IR 35Location: London You will be part of a close-knit team that values knowledge sharing, continuous learning, and professional … technical development programmes, you will have every chance to advance your career while working on cutting-edge AWS native databases and automation projects. What you'll do: As a Site Reliability Engineer based in London, you will play an integral role in supporting a wide range of AWS native databases including RDS, Aurora, Neptune, as well as CockroachDB. … to enhancing product observability and telemetry by supporting ongoing modernisation efforts within the infrastructure.* Collaborate closely with engineering teams to brainstorm ideas that simplify infrastructure management and streamline SRE practices. What you bring: * Proficiency in Python or Unix Shell scripting combined with solid SQL skills enables you to automate tasks efficiently across complex environments.* A good understanding of development More ❯
Employment Type: Contractor
Rate: £400 - £500 per day
Posted:

DevOps Solution Architect (London)

London, UK
Wipro Technologies
strong background in DevOps design and transformation, cloud-native engineering, and modern DevOps tooling. The ideal candidate will also bring expertise in Site Reliability Engineering (SRE) principles and practices, with a focus on building scalable, reliable, and resilient systems. Key Responsibilities: • Architect and implement scalable, secure, and high-performance DevOps solutions. • Lead DevOps transformation initiatives across … enterprise environments. • Design and implement cloud-native solutions on Azure, AWS, or GCP. • Apply SRE principles to ensure system reliability, availability, and performance. • Build and maintain CI/CD pipelines and infrastructure as code (IaC). • Evaluate and integrate modern DevOps tools and practices. • Collaborate with cross-functional teams to align DevOps and SRE strategies with business goals. • Mentor … and lead DevOps teams, fostering a culture of innovation and continuous improvement. • Leverage AI and machine learning to optimize DevOps and SRE processes. • Ensure compliance, security, and operational excellence in all DevOps practices. ͏ Required Qualifications: • 15+ years of experience in IT, with a strong focus on DevOps and cloud architecture. • Proven experience in DevOps design and transformation across multiple projects. More ❯
Employment Type: Full-time
Posted:

Cloud Software Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Anson McCade
Cloud Software Engineering Location: Hybrid – London Salary: £70,000 – £85,000 (depending on experience) + 10% Bonus Are you a software engineer with deep cloud expertise, looking to shape complex platform solutions at scale? We’re seeking a Cloud Software Engineer with strong engineering foundations and hands-on experience delivering cloud-native, microservices-based applications in enterprise environments. … CD tooling. Excellent problem-solving skills with a detail-oriented mindset. Strong interpersonal skills and ability to work in a collaborative team environment. Bachelor’s degree in Computer Science, Engineering, or related field. Preferred Skills: Experience with front-end … frameworks such as React . Working knowledge of database technologies: SQL and NoSQL . Exposure to project management and Agile delivery tools (e.g., Jira, Confluence). Understanding of FinOps, SRE practices, and cloud service optimisation. This is more than a typical cloud engineer role — it's an opportunity to lead delivery across cloud platforms while applying solid software engineering More ❯
Posted:

Cloud Software Engineer

London Area, United Kingdom
Hybrid / WFH Options
Anson McCade
Cloud Software Engineering Location: Hybrid – London Salary: £70,000 – £85,000 (depending on experience) + 10% Bonus Are you a software engineer with deep cloud expertise, looking to shape complex platform solutions at scale? We’re seeking a Cloud Software Engineer with strong engineering foundations and hands-on experience delivering cloud-native, microservices-based applications in enterprise environments. … CD tooling. Excellent problem-solving skills with a detail-oriented mindset. Strong interpersonal skills and ability to work in a collaborative team environment. Bachelor’s degree in Computer Science, Engineering, or related field. Preferred Skills: Experience with front-end … frameworks such as React . Working knowledge of database technologies: SQL and NoSQL . Exposure to project management and Agile delivery tools (e.g., Jira, Confluence). Understanding of FinOps, SRE practices, and cloud service optimisation. This is more than a typical cloud engineer role — it's an opportunity to lead delivery across cloud platforms while applying solid software engineering More ❯
Posted:

Head of SRE and Production Engineering (London)

London, UK
SS&C Technologies
distributor services across asset managers, insurance companies, retirement providers, and wealth management platforms. Job Overview As the Head of Production Engineering and Site Reliability Engineering (SRE) for the GIDS organisation, you will lead a team responsible for the scalability, resilience, performance, and reliability of cloud and hybrid infrastructure powering some of the most critical client … with metrics, and build systems and teams that proactively address issues before they impact clients. Key Responsibilities: Define and execute the vision and roadmap for Production Engineering and SRE within GIDS. Build and lead globally distributed, high-performance teams with a focus on talent development, SRE culture, and operational excellence. Collaborate cross-functionally with Engineering, Product, Compliance, and … in around-the-clock operations, including tooling, automation, and shift rotation planning. Qualifications Required: 10+ years of experience in engineering, with 5+ years in a leadership role in SRE, DevOps, or Production Engineering. Proven track record managing reliable, scalable systems in a high-compliance environment (e.g., FinTech, HealthTech). Strong understanding of modern software development lifecycle, CI/CD More ❯
Employment Type: Full-time
Posted:

Lead Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Lloyds Bank plc
a Service capability consisting of a Token Exchange component and the provision of a signed token for Channels to use downstream, allowing management of their access. As a Lead SRE, you'll be responsible for ensuring the reliability, scalability, and operational excellence of distributed systems. This includes driving cloud adoption, automating workflows, and embedding SRE principles into the software … coaching, and developing peers across the teams in the lab, while the other major part of the role will be acting as a completely hands-on Lead DevOps/SRE to lead by example. Our engineers are passionate about ensuring the services, both internal and external have unwavering reliability, uptime appropriate to users' needs, resiliency, architectural simplicity, as well … our 26 million customers. We're growing with purpose. Join us on our journey and you will too. What you'll need A software engineer within the field of SRE with exceptional understanding of SRE & DevOps and can explain in critical detail to your mentees Production Kubernetes experience and debugging all services that run within the K8s ecosystem, including Istio More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer London, United Kingdom

London, United Kingdom
Hybrid / WFH Options
NinjaOne, LLC
we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Site Reliability Engineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
NinjaOne, LLC
we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Site Reliability Engineer to join our SRE team in the Platform Engineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability … and SOP's Develop software, scripts, or tooling to improve efficiency and reduce delivery time of applications and infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal SRE Engineer

London, South East, England, United Kingdom
Robert Walters
A leading global financial institution is seeking a Principal Site Reliability Engineer to provide essential support for their Foreign Exchange (FX) desk, focusing on trading and risk applications, including an advanced algorithmic ultra-low latency stack. … This is a unique opportunity to play a pivotal role in ensuring the reliability, performance, and scalability of a real-time trading environment by applying best-in-class SRE principles. PRINCIPAL SITE RELIABILITY ENGINEER Salary: £110,000 - £125,000Location: London A leading global financial institution is seeking a Principal Site Reliability Engineer to provide essential … This is a unique opportunity to play a pivotal role in ensuring the reliability, performance, and scalability of a real-time trading environment by applying best-in-class SRE principles. You will work directly with senior traders and developers on the trading floor, optimising workflows, troubleshooting complex issues, and driving ongoing improvements across both processes and technology. What you More ❯
Employment Type: Full-Time
Salary: £110,000 - £125,000 per annum
Posted:

DevOps Solution Architect (London)

London, UK
Join DevOps
strong background in DevOps design and transformation, cloud-native engineering, and modern DevOps tooling. The ideal candidate will also bring expertise in Site Reliability Engineering (SRE) principles and practices, with a focus on building scalable, reliable, and resilient systems. Key Responsibilities: •Architect and implement scalable, secure, and high-performance DevOps solutions. •Lead DevOps transformation initiatives across … enterprise environments. •Design and implement cloud-native solutions on Azure, AWS, or GCP. •Apply SRE principles to ensure system reliability, availability, and performance. •Build and maintain CI/CD pipelines and infrastructure as code (IaC). •Evaluate and integrate modern DevOps tools and practices. •Collaborate with cross-functional teams to align DevOps and SRE strategies with business goals. •Mentor … and lead DevOps teams, fostering a culture of innovation and continuous improvement. •Leverage AI and machine learning to optimize DevOps and SRE processes. •Ensure compliance, security, and operational excellence in all DevOps practices. ͏ Required Qualifications: •15+ years of experience in IT, with a strong focus on DevOps and cloud architecture. •Proven experience in DevOps design and transformation across multiple projects. More ❯
Employment Type: Full-time
Posted:

Senior Production Support Engineer

London, United Kingdom
TP ICAP Group
Essential At least 7 years hands-on support experience within a financial institution (buy-side, sell-side, venue/platform provider) Experience with Site Reliability Engineering (SRE) practices, including monitoring, incident response, and post-mortem analysis Hands-on experience with containerization technologies such as Docker and Kubernetes Proven experience managing cloud-based infrastructure and services, including AWS More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer - FinOps - Enabling Services

London, United Kingdom
Hybrid / WFH Options
Lloyds Bank plc
changing needs of our 26 million customers. We're growing with purpose. About this opportunity A fantastic opportunity has arisen for a Senior DevOps Engineer to join the FinOps Engineering team within Enabling Services. In this pivotal role, you'll enable teams to build cost-effective solutions on GCP while maintaining agility and fostering innovation. This position is perfect … for engineers who are passionate about optimising cloud usage, enhancing cost observability, and championing a FinOps culture. What you'll do Partner with engineering, finance and product teams to drive cost-efficiency across GCP Design and implement automation to boost cost optimisation Build infrastructure and pipelines using Git, Terraform and Harness. Contribute to cost visibility by using cost and … offs to customers Work collaboratively across teams to embed cost-awareness into design, development, deployment, and monitoring What you'll need Experience in a DevOps, Platform Engineering or SRE role, with strong hands-on experience in GCP Clear understanding of FinOps principles and how they apply to engineering responsibilities Experience of Infrastructure as Code, CI/CD and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Randstad Technologies Recruitment
Job Title: Senior SRE - Site Reliability Engineering for Observability Location: London (Mostly Remote | 1 Day/Week in Office) Pay Rate: £50 - £62 per hour (Inside IR35) Contract Duration: Initial 12 Months Working Hours: 11:00 AM - 7:00 PM About the Role We're looking for a Senior Site Reliability Engineer (SRE) to join … team within a leading global tech environment. This is a hands-on, senior-level role focused on building and scaling large-scale monitoring and logging platforms that ensure service reliability, performance, and visibility. If you're passionate about distributed systems, high-throughput data pipelines, and enabling engineering teams with top-tier observability tooling-this is the role for … infrastructure using Terraform and configuration with Ansible . Participating in on-call rotations to ensure platform uptime and responsiveness. What We're Looking For 5+ years of experience in SRE/DevOps roles , managing large-scale systems Strong technical knowledge of Linux (Ubuntu/Debian) environments Proven experience with observability tools such as: ELK Stack (Elasticsearch, Logstash, Kibana) Prometheus, Grafana More ❯
Employment Type: Contract
Rate: £50 - £62/hour
Posted:

Lead Site Reliability Engineer

London, United Kingdom
Lloyds Banking Group
a Service capability consisting of a Token Exchange component and the provision of a signed token for Channels to use downstream, allowing management of their access. As a Lead SRE, you'll be responsible for ensuring the reliability, scalability, and operational excellence of distributed systems. This includes driving cloud adoption, automating workflows, and embedding SRE principles into the software … coaching, and developing peers across the teams in the lab, while the other major part of the role will be acting as a completely hands-on Lead DevOps/SRE to lead by example. Our engineers are passionate about ensuring the services, both internal and external have unwavering reliability, uptime appropriate to users' needs, resiliency, architectural simplicity, as well … our 26 million customers. We're growing with purpose. Join us on our journey and you will too. What you'll need A software engineer within the field of SRE with exceptional understanding of SRE & DevOps and can explain in critical detail to your mentees Production Kubernetes experience and debugging all services that run within the K8s ecosystem, including Istio More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Director - Operations and Reliability Engineering

London, United Kingdom
Boston Consulting Group
that allow our clients to thrive. What You'll Do The Senior Director - Operations and Reliability Engineering is responsible for blending Site Reliability Engineering (SRE), DevOps, and traditional operations models to build a next-generation Reliability Engineering function. This role ensures end-to-end automation at scale, 24x7 operational excellence, and high availability … ITSM) processes across all teams, ensuring compliance with standardized frameworks and operational excellence. Key Responsibilities: Strategic Leadership & Transformation: Define and execute a modern Reliability Engineering strategy, integrating SRE, DevOps, and automation-first operational models. Drive end-to-end automation to eliminate toil, improve efficiency, and enhance operational resilience. Lead the transition from traditional IT operations to a proactive … stack observability. IT Service Management & Operational Excellence: Mandate and assure the adoption of IT Service Management (ITSM) processes across all teams, ensuring standardized, efficient, and effective service delivery. Establish SRE-based operational metrics, including SLOs, SLIs, and error budgets. Oversee incident response, problem resolution, and root cause analysis with AI-driven remediation. Ensure high availability, performance, and security compliance for More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Site Reliability Engineering
London
10th Percentile
£65,000
25th Percentile
£81,250
Median
£105,000
75th Percentile
£118,125
90th Percentile
£138,750