Site Reliability Engineer Jobs in London

18 of 18 Site Reliability Engineer Jobs in London

Site Reliability Engineer (SRE)

London, South East, England, United Kingdom
Charles Simon Associates Ltd
Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £85,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a Site Reliability Engineer (SRE … on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing … Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you. Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
Employment Type: Full-Time
Salary: £75,000 - £85,000 per annum
Posted:

Site Reliability Engineer (SRE)

EC1A, Farringdon, Greater London, United Kingdom
Charles Simon Associates Ltd
Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote THIS IS AN AZURE FOCUSED ROLE, IF YOU APPLY AND DO NOT WORK EITHER SOLEY OR MAINLY ON AZURE YOU WILL NOT BE CONSIDERED. Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £95,000 per annum + benefits Start … Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a Site Reliability Engineer (SRE) on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and … Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you. Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
Employment Type: Permanent
Salary: £80000 - £95000/annum
Posted:

Site Reliability Engineer

City of London, London, United Kingdom
REVYBE IT RECRUITMENT LIMITED
Site Reliability Engineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion. Theyre looking for a Site Reliability Engineer (SRE) to join their platform … performance into the software lifecycle. Managing and evolving CI/CD pipelines to ensure smooth deployments and rollbacks. Contributing to incident response , post-mortems, and reliability improvements. Championing SRE principles such as error budgets, SLIs/SLOs, and automation-first thinking. What Were Looking For Strong experience running cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations … native engineering culture. Influence how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | Site Reliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the More ❯
Employment Type: Permanent
Salary: £75,000
Posted:

Azure Site Reliability Engineer

City of London, London, England, United Kingdom
Opus Recruitment Solutions Ltd
Azure Site reliability Engineer|6 month contract|Onsite 2/3 days per week|£650 per day InsideIR35 Opus RS are looking for a Senior Site Reliability Engineer with deep expertise in Azure cloud migration and a strong DevOps background to join our clients team. What We're Looking For Previous experience as a … Site Reliability Engineer Strong skills in Terraform, GitHub, AKS, and networking (load balancing, Firewalls, routing). Proven track record in Agile delivery and DevOps practices. Extensive experience with Azure and cloud migration using frameworks … like CAF and WAF. Ability to communicate effectively with technical and non-technical stakeholders. Familiarity with change control processes and performance monitoring. If you're a results-driven Senior SRE ready to tackle a new cloud challenges and deliver innovative solutions, we'd love to hear from you, please contact me at More ❯
Employment Type: Contractor
Rate: £650 per day
Posted:

Senior Site Reliability engineer

london, south east england, united kingdom
Axiom Software Solutions Limited
Role- Senior Site Reliability Engineer (SRE) Location - London (full onsite- 5 days every week) Perm up to 80K gross Minimum 12+ year profile are required PFB updated JD Core Competency, • Datadog, Splunk, Dynatrace, Grafana, Prometheus, Thousand Eyes, Gremlin etc. • Efficiency in creating Dashboard for Infra/APM/E2E workflows. • Monitoring, logging, Alerting and Error budget , 99.99 More ❯
Posted:

Site Reliability Engineer

london, south east england, united kingdom
Hybrid/Remote Options
GSR Markets
connecting teams with the capital, market access, and insights they need to build what's next. About the Role We are seeking a Site Reliability Engineer (SRE) to design, optimize, and support highly available systems across our global trading infrastructure. As part of GSR's SRE team, you will manage a multi-regional cloud environment while integrating … . You will work across all layers of infrastructure, including: Networking & Exchange Connectivity Linux Systems & Kubernetes Administration Microservice Orchestration & Observability Disaster Recovery & Security Optimization Your mission is to improve reliability, security and performance, underpinning GSR's success as the world's premier crypto partner. We value engineers who take ownership and full-spectrum responsibility for their services, with a More ❯
Posted:

Senior Site Reliability Engineer

london, south east england, united kingdom
Bumble
application, please feel free to note which pronouns you use (For example: she/her, he/him, they/them, etc). We are looking for an experienced engineer with strong Linux and system-level expertise who can operate autonomously in complex production environments. You must be able to independently troubleshoot incidents, lead … and support post-incident service recovery, and drive improvements to overall system stability, performance, and observability. We are looking for a hands-on Site Reliability Engineer (SRE) with a strong background in Linux infrastructure and third-party system operations. This role focuses on managing and optimizing large-scale environments (5,000+ hosts) running technologies like Kafka, Redis … and Kubernetes. The position does not involve application development but requires deep operational expertise and solid troubleshooting skills. Qualifications 5+ years of experience in Linux system administration or SRE roles Proven experience managing large-scale infrastructure environments Strong troubleshooting and performance tuning skills at the infrastructure level Basic scripting/automation experience (Bash, Python) Familiarity with IaC tools (e.g., Ansible More ❯
Posted:

Site Reliability Engineer (London) - Banking & Finance

London, United Kingdom
Hamilton Barnes Associates Limited
With a strong culture of collaboration and technical excellence, the organisation continues to push the boundaries of low-latency infrastructure and reliable system design. The team is hiring a Site Reliability Engineer (London) to build, monitor, and optimise mission-critical trading systems. The role will focus on automation, system scalability, and incident response to maintain maximum uptime … support new trading systems, continuously improving the infrastructure. Drive automation and operational excellence by leveraging your Linux expertise, Kubernetes, and Python scripting skills. Monitor and ensure high availability and reliability of trading applications while being on top of system alerts and incidents. Key Requirements: 1-5 years working experience The right candidate will come from a financial services background … Solid experience with Linux Systems administration and troubleshooting. Hands-on experience with Kubernetes for container orchestration. Proficient in Python scripting for automation and system management. A mindset focused on site reliability and performance. Strong troubleshooting skills and a proactive approach to problem-solving. Benefits: Lucrative bonus scheme Salary: Up to £90,000 base salary More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineering

london, south east england, united kingdom
Ntt Data
The team you';ll be working with: We are seeking an experienced Site Reliability Engineer (SRE)/Observability Technical Lead to join our team and drive the strategy and execution of observability and reliability projects across our clients. The ideal candidate will have deep expertise in Application Performance Monitoring (APM), Infrastructure as Code (IaC), automation, and … using OpenTelemetry. As a lead, you will guide the design, implementation, and continuous improvement of observability solutions, ensuring system reliability, performance, and scalability while fostering best practices in SRE and DevOps. What you';ll be doing: Lead the strategic development and management of observability and reliability frameworks across the organization, ensuring alignment with business goals and technical requirements. … collaborative relationships. Mentor and develop junior engineers and analysts, fostering a culture of reliability, observability, and operational excellence. What experience you';ll bring: 5+ years of experience in SRE, Observability, or DevOps roles, with leadership responsibilities. Proven expertise with Application Performance Monitoring (APM) tools such as New Relic, Datadog, AppDynamics, or Dynatrace. Hands-on experience with OpenTelemetry (OTel) for More ❯
Posted:

Site Reliability Engineer

london, south east england, united kingdom
Hybrid/Remote Options
ZILO
sector, our technology is truly flexible and designed to transform any business at scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious … creates real impact. If you're ready to shape the future, let's talk. Requirements: We're looking for a Site Reliability Engineer to join our SRE team — someone who thrives on solving complex production issues, understands how applications behave in the real world, and takes pride in keeping systems reliable and performant. This is not a … trace and fix issues affecting reliability or performance. Data Fixes: Perform data investigation and fixes using Postgres. Operational Excellence: Patch and maintain Kubernetes clusters and other production systems. SRE Roadmap: Contribute to the continuous improvement of our observability, reliability, and automation initiatives. This role is hybrid and will require regular weekly attendance at our London office. Requirements Solid More ❯
Posted:

Site Reliability Engineer, Region Services

London, United Kingdom
Amazon
Overview Site Reliability Engineer, Region Services Job ID: AWS EMEA SARL (UK Branch) Would you like to help implement innovative cloud computing solutions and solve the most complex technical problems? Are you excited by the prospect of building and running the world's largest cloud computing infrastructure to provide a better world for future generations? AWS builds … you'll be part of a world-class team in a dynamic environment that has the entrepreneurial feel of a start-up. This is an opportunity to operate and engineer systems on a massive scale, and to gain world class experience in cloud computing. You'll be surrounded by people who are passionate about cloud computing, believe that first … Build and operate distributed systems Design and build the tools and utilities that are part of the AWS fleet running our internal services Key job responsibilities The Systems Development engineer will be a key member of a new team pioneering automated build and deployment of Windows based services. The team is adopting a code-first and hands off CI More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - London

London, United Kingdom
Hybrid/Remote Options
Valarian Technologies Limited
software, platforms, and infrastructure. The Role Join us as a Site Reliability Engineer and help us build the future of data sovereignty! We're seeking an SRE passionate about creating high-performance, scalable, and reliable services for our production infrastructure. You'll have a direct impact, improving existing systems and developing innovative solutions to complex challenges. Our … implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting. This will involve designing and implementing robust metrics and logging systems. Engineer the ACRA platform for high availability and fault tolerance. This includes ensuring resilience against Cloud Availability Zone outages and the ability to gracefully handle node failures. Guarantee 99.9% uptime … capacity planning, and optimization of resource utilization. Collaborate closely with the product engineering team to influence the design and implementation of new products and features, ensuring they meet our reliability and scalability standards from the outset. Preferred Qualifications Bachelor's degree (or equivalent) in Computer Science or a related field; relevant practical experience will also be considered Proficiency with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Team Lead - Site Reliability Engineering

London, United Kingdom
Arbuthnot Latham
The Team Lead - Site Reliability Engineering is responsible for ensuring the effective and efficient running of the current NOC team with a view to transition to an SRE function over time. The team is responsible for enabling innovation and velocity of change while ensuring system reliability focusing on the critical features and functionality within products and platforms. … Customer Excellence and Continual Service Improvement within the team. I dentify, develop, communicate, and implement process changes within the team. Act as a point of escalation for the team. SRE responsibilities: Help define the SRE practice for the organisation, collaborate with other stakeholders to select the relevant SRE principles, define the objectives and measurements of the outcomes. Collaborate with stakeholders … to product owners and key stakeholders. Design, code, test and deliver solutions to automate manual operation (i.e., "TOIL"). Participate in operations support and on-call rotation shifts, for SRE supported systems and products. Participate in or lead problem management activities , including post-mortem incident analysis, and provision of technical insight, documented findings, outcomes and recommendations as part of a More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer/Azure Engineer

London Area, United Kingdom
Cognizant
We are seeking highly skilled and experienced Senior Site Reliability Engineers with good proficiency in cloud migration, using Azure CAF. This role will report into the Cloud Platform Lead. The role will be an important cog in the technical implementation for delivering further services to the cloud. This will deliver solutions using traditional style infrastructure to Azure cloud … and the core cloud platform for the products. Role Requirements You should have strong DevOps background, and experience in adoption to public cloud following agile methodology. You must have SRE experience, follow a good development lifecycle, be able to communicate with both technical and less technical colleagues and be able to document both design and handover documentation with high quality. … Cyber Security, Testing the Application Team and the Cloud Operations team. You must be familiar with good change control processes. If you are a passionate and results-focused Senior SRE with expertise in Azure, Terraform, GitHub, and networking, we encourage you to apply for this challenging and rewarding position. Candidates should be SC cleared or SC Eligible. Minimum Essential Criteria More ❯
Posted:

Senior Engineer, SRE

London, United Kingdom
London Stock Exchange Group
We're looking for a Site Reliability Engineer to join our LSEG's (London Stock Exchange Group) Real Time SRE team. As part of the SRE team, you will play a central role in supporting our large, globally-distributed market data infrastructure. You'll be responsible for providing support for critical Real Time applications, identifying and automating … Level Objectives and implement monitoring to ensure our systems are available and healthy WHAT YOU'LL BRING: At least 5 years of hands-on industry experience in DevOps or SRE, or relevant equivalent (experience in supporting financial markets, real-time data, or client-facing applications a plus) Hands-on experience supporting cloud applications (AWS/Azure preferred) Fluent in writing More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
IntaPeople
with a long-standing client in the data and analytics space, an organisation known for its technical excellence, collaborative culture, and meaningful impact across sectors. They re scaling their SRE function and looking for a seasoned engineer to join a high-performing team delivering internal applications that power critical operations. This is a 6-month contract (with strong potential … Experience with scalable, cloud-native applications This is a brilliant opportunity to work with a forward-thinking tech team that values autonomy, innovation, and impact. If you re an SRE who thrives in fast-paced environments and loves solving complex problems, I d love to hear from you. Apply now to learn more. More ❯
Employment Type: Contract
Rate: GBP 450 - 550 Daily
Posted:

Site Reliability Engineer

Sutton, Greater London, United Kingdom
IntaPeople
with a long-standing client in the data and analytics space, an organisation known for its technical excellence, collaborative culture, and meaningful impact across sectors. They’re scaling their SRE function and looking for a seasoned engineer to join a high-performing team delivering internal applications that power critical operations. This is a 6-month contract (with strong potential … Experience with scalable, cloud-native applications This is a brilliant opportunity to work with a forward-thinking tech team that values autonomy, innovation, and impact. If you’re an SRE who thrives in fast-paced environments and loves solving complex problems, I’d love to hear from you. Apply now to learn more. More ❯
Employment Type: Contract
Rate: £450 - £550/day
Posted:

Senior Site Reliability Engineer - HFT Team

London, England, United Kingdom
Selby Jennings
My client, a successful quantitative investment manager, is looking for a Senior DevOps engineer to join their ML ops team and to implement testing, development, automation tools, and IT infrastructure for the ML platform team and its users. They are looking for a senior with 10+ years of experience and proficient in AWS and Terraform. Key Responsibilities: Implement testing More ❯
Posted:
Site Reliability Engineer
London
10th Percentile
£72,169
Median
£85,000
75th Percentile
£150,000