Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
excellence Develop and implement strategic plans to enhance the reliability, scalability, and efficiency of our infrastructure Collaborate with cross-functional teams to align SRE initiatives with broader organizational goals Establish and maintain SLIs, SLOs, and SLAs for critical systems and services Drive the adoption of best practices in automation … and management solution that helps organizations harness AI's potential while ensuring governance, security, compliance, and control. Experience Requirements: Proven experience in a senior SRE role or similar. Strong knowledge of cloud technologies and SLA SLO SLI management. Experience leading teams and implementing SCRUM processes. Excellent communication and leadership skills. … Experience line managing, mentoring, and coaching. Responsibilities: Collaborate with the Principal SRE to shape and implement the SRE strategic plan. Lead the SRE team in translating strategy into actionable plans, coordinating these through the SCRUM process. Address wellbeing and performance concerns, fostering a positive and productive team environment. Work with More ❯
Cambridge, Cambridgeshire, East Anglia, United Kingdom
RedTech Recruitment
game-changing technology within their industry, with exciting scope for expansion into further industries. This role is looking for someone to work within the SRE team responsible for incident response and issue resolution. Location: Cambridge Salary: £32,000 £60,000 + excellent benefits (£32,000 for a new Graduate) Requirements … problem solving identifying the root causes of issues. Good logical reasoning Responsibilities for SiteReliabilityEngineer Graduate Considered: Working within the SRE team you will be responsible for the architecture of a mission-critical cloud platform for an industry-leading software company. You will be diagnosing issues … been removed by the job-board, full details for contact are available on our website). Keywords- SiteReliabilityEngineer/SRE/DevOps/Software Engineering/Software Development/Engineering/Physics/Astrophysics/Python/Computer science/Cloud/Mathematics/AWS More ❯
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Microlise
SiteReliabilityEngineer When registering to this job board you will be redirected to the online application form. Please ensure that this … is completed in full in order that your application can be reviewed. We are looking for an experienced SiteReliabilityEngineer (SRE) to join our Technical Operations team within Microlise. Your key responsibilities would include implementing and supporting the Microlise infrastructure. This will involve bringing automation and … TechOps experience, especially from an Infrastructure as Code approach Familiarity with development technologies like C# and SQL, Git In-depth knowledge and understanding of SRE practicesand infrastructure application monitoring frameworks Understanding of diverse monitoring requirements and tools An enthusiasm and ability to learn new technologies and approaches Excellent investigation and More ❯
SiteReliabilityEngineer When registering to this job board you will be redirected to the online application form. Please ensure that this … is completed in full in order that your application can be reviewed. We are looking for an experienced SiteReliabilityEngineer (SRE) to join our Technical Operations team within Microlise. Your key responsibilities would include implementing and supporting the Microlise infrastructure. This will involve bringing automation and … TechOps experience, especially from an Infrastructure as Code approach Familiarity with development technologies like C# and SQL, Git In-depth knowledge and understanding of SRE practicesand infrastructure application monitoring frameworks Understanding of diverse monitoring requirements and tools An enthusiasm and ability to learn new technologies and approaches Excellent investigation and More ❯
SiteReliabilityEngineer When registering to this job board you will be redirected to the online application form. Please ensure that this … is completed in full in order that your application can be reviewed. We are looking for an experienced SiteReliabilityEngineer (SRE) to join our Technical Operations team within Microlise. Your key responsibilities would include implementing and supporting the Microlise infrastructure. This will involve bringing automation and … TechOps experience, especially from an Infrastructure as Code approach Familiarity with development technologies like C# and SQL, Git In-depth knowledge and understanding of SRE practicesand infrastructure application monitoring frameworks Understanding of diverse monitoring requirements and tools An enthusiasm and ability to learn new technologies and approaches Excellent investigation and More ❯
Lead Cloud Infrastructure and SiteReliabilityEngineer Brand: HSBC Area of Interest: Technology Location: Birmingham, GB, B1 1HQ Work style: Office Worker Date: 24 Apr 2025 Join a digital-first bank that's powered by people. Our technology team builds innovative digital solutions rapidly and at scale …/Infrastructure Security. Your work will provide assurance of the effectiveness of security controls to Business Risk Owners. The Lead Cybersecurity Analytics Cloud Infrastructure & SiteReliabilityEngineer will be part of the CSA Platform & Data Engineering Team, joining a global team of data technology professionals to deliver … Monitoring, Availability, Resiliency). To be successful in this role, you should meet the following requirements: Strong understanding of SiteReliability Engineering (SRE) principles and hands-on experience with Azure DevOps. Proficient in scripting (Bash, PowerShell, Azure CLI), coding (Python, C#, Java), and querying (SQL, Kusto Query Language More ❯
Lead SiteReliabilityEngineer Are you ready to take your career to the next level in a role that’s critical to the reliability, scalability, and performance of cutting-edge systems? We’re on the lookout for a Lead SiteReliabilityEngineer to … Contribute to quality systems through deviation management, CAPA follow-up, and root cause investigations. What We’re Looking For: 5+ years of experience in SiteReliability Engineering or a related field. Hands-on experience with Biosafety and GMP environments. Strong foundation in Lean Six Sigma principles. Proven problem More ❯
Reading, England, United Kingdom Hybrid / WFH Options
People Source Consulting trading as Experis
SiteReliabilityEngineer - DevOps Engineer 18 Month Contract PAYE - Fully Remote/or Hybrid based in Midlands if preferred. The role We are working with one of the finest gaming studios in the industry and are on the lookout for an … exceptional SiteReliabilityEngineer who can bring their expertise and unique thinking to help make their team even stronger! As an SRE the main purpose is solving for scale through collaboration and automation, bringing engineering principles to infrastructure and operational problems. Work closely with the different teams More ❯
facilitate effective job matching and career development, not just for our users but also for our own team members. We are looking for a SiteReliabilityEngineer Lead to ensure our systems are reliable, scalable, and efficient. As the SiteReliabilityEngineer Lead, you … maintaining the health and performance of our platforms while also leading a talented team of engineers. You will champion and coach best practices in reliability and operational excellence to deliver an exceptional experience for our users. Key Responsibilities Minimising downtime to products & services and ensuring the platform is stable … availability and performance. Work with senior stakeholders to mature the concept of SiteReliability within the CVL organisation. Lead and mentor the SRE function, fostering a culture of collaboration, innovation, and excellence. Creating a bridge between Development and support teams by applying an ‘as-a-service' mindset to More ❯
The SiteReliability Engineering (SRE) team at Pendo is responsible for provisioning and maintaining cloud infrastructure from development through production for all product initiatives, and working with developers and product managers to ensure that our products are not only reliable and performant, but also cost-efficient. Our platform … on-call and incident management functions, supporting a high-throughput platform which processes more than 15 billion events per day. To ensure the reliability of this environment for our customers, SREs work closely with developers and product managers to understand service level objectives, think through failures scenarios, and design … systems which balance cost with reliability objectives. Additionally, SREs collaborate with the Information Security team to ensure that cloud infrastructure is properly secured, and that sufficient controls are in place to meet our compliance goals with respect to industry standards such as SOC 2. Role Responsibilities Write high-quality More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
bet365 Group
A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability … of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration … is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user demands More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor … the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation … for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure More ❯
Stoke-On-Trent, England, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor … the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation … for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure More ❯
Birmingham, Staffordshire, United Kingdom Hybrid / WFH Options
N Consulting Limited
Role: SRE Lead Location: Birmingham, UK (Hybrid, 2-3 days WFO) Contract: 3 months (Possible extension ) Are you a skilled SiteReliabilityEngineer (SRE) with experience in maintaining scalable and reliable infrastructure? We're looking for a proactive leader with a passion for automation, incident management, and … system optimization. Key Skills Required: 5+ years of SRE or similar experience Expertise in Cloud Platforms (SIEM technologies preferred) Proficiency in Python or Bash scripting Hands-on experience with Infrastructure as Code (e.g., Terraform, Ansible) Familiarity with Docker and Kubernetes Strong problem-solving and collaboration skills Responsibilities: Design, implement, and More ❯
Dundee, Angus, United Kingdom Hybrid / WFH Options
Ivanti
offerings. We are responsible for the reliability, deployment, and operation of the Ivanti Cloud product portfolio. We are seeking individuals eager to drive SRE maturity through the research and development of internal tooling, operational enhancements, and deployment pipelines. Ivanti SRE takes a holistic view of operational procedures, incident response … procedures, application and infrastructure monitoring, and process automation. Ivanti SRE is a blend of infrastructure, networking, automation, development, and application administration. This is a hands-on technical position. The ideal candidate will have a software engineering background and strong experience with continuous deployment, SaaS delivery, and production incident response. This … the company's growth trajectory through continuous innovation and customer-centric solutions. What You Will Be Doing Researching, maintaining, and contributing to automation of SRE tools and processes Contributing to solutions toward reducing toil within SRE Participating in code review and analysis with SRE peers Composing and reviewing contributions to More ❯
SiteReliabilityEngineer (SRE) Head Resourcing is pleased to be working with one of the UK's leading retail banks who are looking for an experienced Cloud SRE to join our engineering team and help drive reliability, scalability, and automation across our cloud-based products and … services on Google Cloud Platform (GCP). This role is all about embedding SRE best practices, improving platform resilience, and troubleshooting service issues with an engineering-first approach, using code and automation rather than manual work. Key Skills & Experience: Background in software engineering or telemetry, ideally with SRE experience. Hands … a mission to transform our technology landscape, investing in automation, innovation, and engineering talent. If you want to help shape the future of cloud reliability, we’d love to hear from you. More ❯
london, south east england, United Kingdom Hybrid / WFH Options
RP International
SiteReliabilityEngineer | Inside IR35 | Hybrid - 2 Days Onsite London | 6 Month Contract Our client a multinational and respected consultancy is hiring for a Lead SiteReliabilityEngineer with expertise in AWS and DevOps Tools for a new project in the Public Sector. Technical More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Embarcaderomediagroup
SiteReliability & Platform Engineer to help lead the way. You'll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices … ship faster, safer, and more cost-efficiently. What you'll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through … opportunity for someone passionate about building robust infrastructure and enabling others to move faster and more securely. You might come from a cloud engineering, SRE, or DevOps background - what matters most is your curiosity, systems thinking, and drive to improve operational efficiency. At Sorted, we are committed to fostering an More ❯
enabler of Capital One's ambitions. We are keen to add a Senior SiteReliability Engineering Manager (SSREM) to our Nottingham based SRE organisation whose primary focus is to provide effective leadership as we evolve and mature sitereliability practices for the benefit of our cloud … applications and their customers. The successful candidate will be a leader of leaders with custodianship of application services across 5+ SRE teams. We're looking for an experienced professional whose technical background allows effective challenge and support of teams managing primarily Java based applications running in a dynamic IaaC AWS … outcomes in the pursuit of business, functional and personal goals. The successful application will lead by example, build strong and valuable relationships within the SRE org, wider tech and business stakeholders. They have the ability to face ambiguity and understand how to make sense of complexity, importantly being able to More ❯
Services, Azure Functions, Azure Logic Apps, Azure SQL, Azure Storage, Application Insights, Azure Redis, VNets and Azure App Gateway. 2+ years of experience with Reliability concepts to ensure high performance and high service availability, able to define implement and improve business performance SLO's. 2+ years of experience with …/paging with OpsGenie, incident management, RCA (Root Cause Analysis) and retrospective analysis. 2+ or more years in hands-on technical roles (such as sitereliabilityengineer, software engineer, DevOps engineer, infrastructure engineer). Experience with infrastructure management across multiple cloud and on-premise … less experienced engineers. Production environments with on-call rotations. Advocacy: Train and mentor engineering teams on modern observability practices and techniques. Define and socialize SRE culture, best practices, architectural and security standards. Assess and raise risks across the organization. Partnership with: Internal engineering, architecture and operations teams to ensure alignment. More ❯
City Of Bristol, England, United Kingdom Hybrid / WFH Options
Gravitas Recruitment Group (Global) Ltd
products and services within the GCP platform. Meaning the next generation of services that form this Financial Services companies vision for 2025! Role - Lead SiteReliabilityEngineer Salary - £90,440 - £106,400 Location - London – Hybrid/Flexible working. Essential Skills: · Experience working with GCP products (or extensive … Jenkins, or alternatives such as Azure DevOps; You will report partner with service teams to drive the adoption of SiteReliability Engineering (SRE) best practices, ensuring these principles are integrated effectively within our microservices. Collaborate with infrastructure engineers to guarantee the resilience, scalability, and overall performance of the More ❯
A prestigious, technology-driven hedge fund is seeking a highly skilled SiteReliabilityEngineer (SRE) to join their global infrastructure team. This is a unique opportunity to work in a high-performance, low-latency trading environment where technology is at the heart of the firm’s competitive … critical role in ensuring the performance, reliability, and scalability of the systems that power the fund’s trading and research platforms. As an SRE, you will work closely with software engineers and investment teams to build automation-first solutions that support the firm’s most advanced strategies. Key Responsibilities … across the business. Design and implement automation to eliminate manual tasks and reduce operational risk. Collaborate with software and investment teams to embed the SRE mindset early in the development lifecycle. Ideal Candidate: SRE with experience working with data systems Ability to program (structured, OOP, and TDD) using one or More ❯
We are looking for an experienced SiteReliabilityEngineer (SRE) to join our Technical Operations team within Microlise. Your key responsibilities will include implementing and supporting the Microlise infrastructure. This involves bringing automation and observability to the core infrastructure by applying development principles. Do you have experience … we are looking for: Experience in TechOps, especially with Infrastructure as Code Familiarity with development technologies like C# and SQL, Git Deep understanding of SRE practices and infrastructure monitoring frameworks Knowledge of diverse monitoring tools and requirements Enthusiasm and ability to learn new technologies Excellent investigation and problem-solving skills More ❯
re Looking For: Basic Required Qualifications: Bachelor's degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a SiteReliabilityEngineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running … troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues Familiarity working in an Agile environment True understanding of SiteReliability Engineering Ability to build and maintain a system and culture that supports and implements SLOs. Familiar with Docker & Kubernetes, specifically EKS & ECS More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Sage City
Job Description We are looking for a SiteReliabilityEngineer to join our SRE Enablement team, a specialised function within Cloud Operations focused on building reusable infrastructure, automation, and tools that enable CloudOps and Engineering teams to operate more efficiently. You will have the opportunity to be … a key driver for SRE adoption within Sage, taking the helm in developing scalable frameworks to improve developer experience, remove toil and ultimately focus on embedding SRE best practices within the wider business. If you have experience working with Terraform and modern CI/CD workflows this could be the … also engage with broader teams to help implement these new approaches. You will have oversight of the entirety of Sage's product-suite and SRE teams as you work closely with them to build tools to make them more successful. Please note this is a hybrid role - you will be More ❯