Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
excellence Develop and implement strategic plans to enhance the reliability, scalability, and efficiency of our infrastructure Collaborate with cross-functional teams to align SRE initiatives with broader organizational goals Establish and maintain SLIs, SLOs, and SLAs for critical systems and services Drive the adoption of best practices in automation … and management solution that helps organizations harness AI's potential while ensuring governance, security, compliance, and control. Experience Requirements: Proven experience in a senior SRE role or similar. Strong knowledge of cloud technologies and SLA SLO SLI management. Experience leading teams and implementing SCRUM processes. Excellent communication and leadership skills. … Experience line managing, mentoring, and coaching. Responsibilities: Collaborate with the Principal SRE to shape and implement the SRE strategic plan. Lead the SRE team in translating strategy into actionable plans, coordinating these through the SCRUM process. Address wellbeing and performance concerns, fostering a positive and productive team environment. Work with More ❯
london, south east england, United Kingdom Hybrid / WFH Options
MarkJames Search
Job Title: SiteReliabilityEngineering (SRE) Lead – Observability Location: Stratford, London (Hybrid – 2 days per week onsite) Contract Length: 6 months Rate: £450–£500 per day (Inside IR35) Industry: Financial Services A leading Financial Services organisation in London is seeking a SiteReliabilityEngineering (SRE) Lead – Observability to join their team on a 6-month contract. This is a hybrid role requiring two days per week onsite at their Stratford, London offices. The role sits Inside IR35 . Key Responsibilities: Lead the SRE Observability team and champion observability practices across multiple product groups. … creation and QA of project-level Observability Plans. Input into and assure the quality of testing strategies and results. Requirements Proven experience in an SRE role with a strong focus on Observability. Expert-level proficiency with DevOps tools including GitHub, GitHub Actions, Jenkins, Nexus, CloudFormation/Terraform, and CodeQL. Extensive More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
bet365 Group
A SiteReliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and … availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. … Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user More ❯
stoke-on-trent, midlands, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will … monitor the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and … automation for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will More ❯
manchester, north west england, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will … monitor the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and … automation for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Embarcaderomediagroup
SiteReliability & Platform Engineer to help lead the way. You'll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering … ship faster, safer, and more cost-efficiently. What you'll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through … for someone passionate about building robust infrastructure and enabling others to move faster and more securely. You might come from a cloud engineering, SRE, or DevOps background - what matters most is your curiosity, systems thinking, and drive to improve operational efficiency. At Sorted, we are committed to fostering an More ❯
Dundee, Angus, United Kingdom Hybrid / WFH Options
Ivanti
offerings. We are responsible for the reliability, deployment, and operation of the Ivanti Cloud product portfolio. We are seeking individuals eager to drive SRE maturity through the research and development of internal tooling, operational enhancements, and deployment pipelines. Ivanti SRE takes a holistic view of operational procedures, incident response … procedures, application and infrastructure monitoring, and process automation. Ivanti SRE is a blend of infrastructure, networking, automation, development, and application administration. This is a hands-on technical position. The ideal candidate will have a software engineering background and strong experience with continuous deployment, SaaS delivery, and production incident response. … the company's growth trajectory through continuous innovation and customer-centric solutions. What You Will Be Doing Researching, maintaining, and contributing to automation of SRE tools and processes Contributing to solutions toward reducing toil within SRE Participating in code review and analysis with SRE peers Composing and reviewing contributions to More ❯
Reigate, Surrey, United Kingdom Hybrid / WFH Options
Willis Towers Watson
Description Summary : We are seeking a SiteReliability Engineer to join our SRE team based in Reigate. The ideal candidate will have excellent communication skills, experience working with multiple stakeholders, and a track record in Azure and Observability platforms. You will be joining Insurance Consulting and Technology (ICT … delivery family to deliver core foundational functionality that will be used by multiple SaaS product offerings across the business. You will be with other SiteReliability and Response teams as well as with the core Applications Teams, whose responsibility is to deliver and manage business critical services that … working arrangements, with presence in the Reigate office up to two days per week. The Role: Collaborate with cross-functional teams to ensure the reliability, availability, and performance of our client-facing services Maintain and configure observability platforms such as Datadog Proactive monitoring of production and other environments to More ❯
Bradford, Yorkshire, United Kingdom Hybrid / WFH Options
Freemans Grattan Holdings (fgh)
our customer journey. Working collaboratively with a team of transformation experts you will have the flexibility to leverage your professional experience to solve computer engineering issues across a variety of technical areas, dependent on where your interests lie. Innovation is key as we look for new ideas which will … in a DevOps, or SiteReliabilityEngineering building high-traffic, high availability systems. Experience with sitereliabilityengineering (SRE) principles and monitoring tools, including New Relic. Experience in website performance monitoring and tuning using tools such as Lighthouse and the ability to troubleshoot performance More ❯
leeds, west yorkshire, yorkshire and the humber, United Kingdom Hybrid / WFH Options
Fruition Group
Job Title: Senior SiteReliability Engineer (SRE) Location: Leeds (Hybrid - c. 1-2 days per week) Salary: £60,000 - £80,000 + benefits Why Apply? This is a fantastic opportunity for a seasoned Senior SiteReliability Engineer to take a lead role in shaping the infrastructure … most innovative businesses in their market. Working with cutting-edge technology, this role offers high-impact challenges, meaningful collaboration, and excellent career progression. Senior SRE Responsibilities Manage and optimise cloud infrastructure to ensure scalability, high availability, and security. Design and implement robust CI/CD pipelines for efficient product delivery. … like GitlabCI, Terraform/OpenTofu, Ansible, and scripting languages such as PowerShell or Python. Champion infrastructure best practices and mentor junior team members. Senior SRE Requirements Extensive experience in SRE or DevOps roles within high-availability, cloud-native environments. Strong expertise with AWS (including EKS, MSK, RDS, VPC design, encryption More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Fruition Group
Job Title: Senior SiteReliability Engineer (SRE) Location: Central London (Hybrid - c. 1-2 days per week) Salary: £80,000 - £100,000 + benefits Why Apply? This is a fantastic opportunity for a seasoned Senior SiteReliability Engineer to take a lead role in shaping the … most innovative businesses in their market. Working with cutting-edge technology, this role offers high-impact challenges, meaningful collaboration, and excellent career progression. Senior SRE Responsibilities Manage and optimise cloud infrastructure to ensure scalability, high availability, and security. Design and implement robust CI/CD pipelines for efficient product delivery. … like GitlabCI, Terraform/OpenTofu, Ansible, and scripting languages such as PowerShell or Python. Champion infrastructure best practices and mentor junior team members. Senior SRE Requirements Extensive experience in SRE or DevOps roles within high-availability, cloud-native environments. Strong expertise with AWS (including EKS, MSK, RDS, VPC design, encryption More ❯
Birmingham, Staffordshire, United Kingdom Hybrid / WFH Options
N Consulting Limited
Role: SRE Lead Location: Birmingham, UK (Hybrid, 2-3 days WFO) Contract: 3 months (Possible extension ) Are you a skilled SiteReliability Engineer (SRE) with experience in maintaining scalable and reliable infrastructure? We're looking for a proactive leader with a passion for automation, incident management, and system … optimization. Key Skills Required: 5+ years of SRE or similar experience Expertise in Cloud Platforms (SIEM technologies preferred) Proficiency in Python or Bash scripting Hands-on experience with Infrastructure as Code (e.g., Terraform, Ansible) Familiarity with Docker and Kubernetes Strong problem-solving and collaboration skills Responsibilities: Design, implement, and manage More ❯
london, south east england, United Kingdom Hybrid / WFH Options
RP International
SiteReliability Engineer | Inside IR35 | Hybrid - 2 Days Onsite London | 6 Month Contract Our client a multinational and respected consultancy is hiring for a Lead SiteReliability Engineer with expertise in AWS and DevOps Tools for a new project in the Public Sector. Technical Skills/ More ❯
edinburgh, central scotland, United Kingdom Hybrid / WFH Options
Provn
Platform Engineer/SiteReliability Engineer Location: Edinburgh (Hybrid/Onsite) Salary: Up to £90k + Benefits Provn are currently engaged on behalf of our client, a forward-thinking technology company based in Edinburgh, known … for delivering innovative, scalable solutions across multiple sectors. As part of their continued growth, they are looking to add a skilled Platform Engineer/SRE to their engineering function. This role offers a fantastic opportunity to work in a collaborative, agile environment, helping to shape the reliability, scalability … CloudWatch). Proven ability to troubleshoot and resolve complex infrastructure issues. Experience working in cross-functional engineering teams, ideally in a DevOps or SRE capacity. Strong scripting skills (e.g., Python, Bash) are beneficial. What the company can offer: Outstanding growth & development opportunities The chance to contribute to cloud-native More ❯
crawley, west sussex, south east england, United Kingdom Hybrid / WFH Options
James Chase
Are you an Azure Devops/SRE looking for your next opportunity? Are you passionate about ensuring application reliability and performance? Do you thrive in a collaborative, high-impact environment? If yes, this could be your next big opportunity!!! Our client, a leading provider of financial services are looking … a permanent basis. Responsibilities: Managing incidents and post-mortems for on-premises and cloud applications. Monitoring performance using modern tools and implementing automation. Driving SRE and DevOps best practices. Supporting releases with minimal downtime. Key Skills & Experience: Experience in SRE, IT operations, software development, or DevOps. Familiarity with CI/… KQL, and incident management. Hands on experience with YAML pipelines. Experience with Bicep, SolarWinds, Terraform and PowerShell. Want to be part of a growing SRE team driving automation and reliability? Click Apply now or send your CV to chinmaye.ramnath@james-chase.com *This role is hybrid working with one day More ❯
Hi Professionals Hope you are doing great Our client is looking for multiple SRE Engineer experienced consultants to join a high-growth organization. If you are interested kindly, please share your CV and let me know your rate expectation. Role: SRE Engineer Location: Birmingham UK (Hybrid) Duration: Contract Experience: • Experienced … SRE engineer (3 years +) • Experience in security applications particularly SIEM • Preferred experience in Elastic based systems • Preferred experience in SIEM technologies • Knowledge of Infrastructure Monitoring tools Skill Sets: • Bachelor’s degree in Computer Science, Engineering, or a related field. • Proven experience as a SiteReliability Engineer … be based in the UK Accountabilities: • Design, implement, and maintain scalable and reliable infrastructure. • Monitor system performance, identify issues, and implement solutions to improve reliability and performance. • Collaborate with development teams to ensure new features and services are reliable and scalable. • Automate repetitive tasks to improve efficiency and reduce More ❯
brimington, midlands, United Kingdom Hybrid / WFH Options
Ampstek
Hi Professionals Hope you are doing great Our client is looking for multiple SRE Engineer experienced consultants to join a high-growth organization. If you are interested kindly, please share your CV and let me know your rate expectation. Role: SRE Engineer Location: Birmingham UK (Hybrid) Duration: Contract Experience: • Experienced … SRE engineer (3 years +) • Experience in security applications particularly SIEM • Preferred experience in Elastic based systems • Preferred experience in SIEM technologies • Knowledge of Infrastructure Monitoring tools Skill Sets: • Bachelor’s degree in Computer Science, Engineering, or a related field. • Proven experience as a SiteReliability Engineer … be based in the UK Accountabilities: • Design, implement, and maintain scalable and reliable infrastructure. • Monitor system performance, identify issues, and implement solutions to improve reliability and performance. • Collaborate with development teams to ensure new features and services are reliable and scalable. • Automate repetitive tasks to improve efficiency and reduce More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Future Talent Group
SiteReliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £80,000/£85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech … market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability … teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
SiteReliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £80,000/£85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech … market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability … teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
SiteReliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £80,000/£85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech … market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability … teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Future Talent Group
SiteReliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £80,000/£85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech … market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability … teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go More ❯
leeds, west yorkshire, yorkshire and the humber, United Kingdom Hybrid / WFH Options
Future Talent Group
SiteReliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £80,000/£85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech … market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability … teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go More ❯
greater manchester, north west england, United Kingdom Hybrid / WFH Options
Future Talent Group
SiteReliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £80,000/£85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech … market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability … teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go More ❯
london (west end), south east england, United Kingdom Hybrid / WFH Options
Future Talent Group
SiteReliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £80,000/£85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech … market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability … teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go More ❯
bolton, greater manchester, north west england, United Kingdom Hybrid / WFH Options
Future Talent Group
SiteReliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £80,000/£85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech … market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability … teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go More ❯