Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
excellence Develop and implement strategic plans to enhance the reliability, scalability, and efficiency of our infrastructure Collaborate with cross-functional teams to align SRE initiatives with broader organizational goals Establish and maintain SLIs, SLOs, and SLAs for critical systems and services Drive the adoption of best practices in automation … and management solution that helps organizations harness AI's potential while ensuring governance, security, compliance, and control. Experience Requirements: Proven experience in a senior SRE role or similar. Strong knowledge of cloud technologies and SLA SLO SLI management. Experience leading teams and implementing SCRUM processes. Excellent communication and leadership skills. … Experience line managing, mentoring, and coaching. Responsibilities: Collaborate with the Principal SRE to shape and implement the SRE strategic plan. Lead the SRE team in translating strategy into actionable plans, coordinating these through the SCRUM process. Address wellbeing and performance concerns, fostering a positive and productive team environment. Work with More ❯
Cambridge, Cambridgeshire, East Anglia, United Kingdom
RedTech Recruitment
game-changing technology within their industry, with exciting scope for expansion into further industries. This role is looking for someone to work within the SRE team responsible for incident response and issue resolution. Location: Cambridge Salary: £32,000 £60,000 + excellent benefits (£32,000 for a new Graduate) Requirements … problem solving identifying the root causes of issues. Good logical reasoning Responsibilities for SiteReliabilityEngineer Graduate Considered: Working within the SRE team you will be responsible for the architecture of a mission-critical cloud platform for an industry-leading software company. You will be diagnosing issues … been removed by the job-board, full details for contact are available on our website). Keywords- SiteReliabilityEngineer/SRE/DevOps/Software Engineering/Software Development/Engineering/Physics/Astrophysics/Python/Computer science/Cloud/Mathematics/AWS More ❯
Staines-Upon-Thames, England, United Kingdom Hybrid / WFH Options
Anson McCade
Principal SiteReliabilityEngineer Location: London (Hybrid - 1/2 days per week) Salary: 95k - 120k An American software company who were named #1 on Forbes list of most innovative companies is currently hiring for a Principal SiteReliabilityEngineer position. They are expanding … Cloud Infrastructure Engineering team, which is responsible for building and managing a global full stack cloud infrastructure that runs their SaaS product. This Principal SiteReliabilityEngineer position has a wide range of responsibilities, including but not limited to; Building a web configuration management pipeline that is … plenty of help, but to be successful you will need to be capable of digging deep into internals of DevOps tools. The Successful Principal SiteReliabilityEngineer Benefits Package Includes: Base Salary of £95,000-£120,000 (Depending on experience) 15% Uncapped Bonus Up to More ❯
enabler of Capital One's ambitions. We are keen to add a Senior SiteReliability Engineering Manager (SSREM) to our Nottingham based SRE organisation whose primary focus is to provide effective leadership as we evolve and mature sitereliability practices for the benefit of our cloud … applications and their customers. The successful candidate will be a leader of leaders with custodianship of application services across 5+ SRE teams. We're looking for an experienced professional whose technical background allows effective challenge and support of teams managing primarily Java based applications running in a dynamic IaaC AWS … outcomes in the pursuit of business, functional and personal goals. The successful application will lead by example, build strong and valuable relationships within the SRE org, wider tech and business stakeholders. They have the ability to face ambiguity and understand how to make sense of complexity, importantly being able to More ❯
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Microlise
SiteReliabilityEngineer When registering to this job board you will be redirected to the online application form. Please ensure that this … is completed in full in order that your application can be reviewed. We are looking for an experienced SiteReliabilityEngineer (SRE) to join our Technical Operations team within Microlise. Your key responsibilities would include implementing and supporting the Microlise infrastructure. This will involve bringing automation and … TechOps experience, especially from an Infrastructure as Code approach Familiarity with development technologies like C# and SQL, Git In-depth knowledge and understanding of SRE practicesand infrastructure application monitoring frameworks Understanding of diverse monitoring requirements and tools An enthusiasm and ability to learn new technologies and approaches Excellent investigation and More ❯
SiteReliabilityEngineer When registering to this job board you will be redirected to the online application form. Please ensure that this … is completed in full in order that your application can be reviewed. We are looking for an experienced SiteReliabilityEngineer (SRE) to join our Technical Operations team within Microlise. Your key responsibilities would include implementing and supporting the Microlise infrastructure. This will involve bringing automation and … TechOps experience, especially from an Infrastructure as Code approach Familiarity with development technologies like C# and SQL, Git In-depth knowledge and understanding of SRE practicesand infrastructure application monitoring frameworks Understanding of diverse monitoring requirements and tools An enthusiasm and ability to learn new technologies and approaches Excellent investigation and More ❯
SiteReliabilityEngineer When registering to this job board you will be redirected to the online application form. Please ensure that this … is completed in full in order that your application can be reviewed. We are looking for an experienced SiteReliabilityEngineer (SRE) to join our Technical Operations team within Microlise. Your key responsibilities would include implementing and supporting the Microlise infrastructure. This will involve bringing automation and … TechOps experience, especially from an Infrastructure as Code approach Familiarity with development technologies like C# and SQL, Git In-depth knowledge and understanding of SRE practicesand infrastructure application monitoring frameworks Understanding of diverse monitoring requirements and tools An enthusiasm and ability to learn new technologies and approaches Excellent investigation and More ❯
Lead Cloud Infrastructure and SiteReliabilityEngineer Brand: HSBC Area of Interest: Technology Location: Birmingham, GB, B1 1HQ Work style: Office Worker Date: 24 Apr 2025 Join a digital-first bank that's powered by people. Our technology team builds innovative digital solutions rapidly and at scale …/Infrastructure Security. Your work will provide assurance of the effectiveness of security controls to Business Risk Owners. The Lead Cybersecurity Analytics Cloud Infrastructure & SiteReliabilityEngineer will be part of the CSA Platform & Data Engineering Team, joining a global team of data technology professionals to deliver … Monitoring, Availability, Resiliency). To be successful in this role, you should meet the following requirements: Strong understanding of SiteReliability Engineering (SRE) principles and hands-on experience with Azure DevOps. Proficient in scripting (Bash, PowerShell, Azure CLI), coding (Python, C#, Java), and querying (SQL, Kusto Query Language More ❯
Lead SiteReliabilityEngineer Are you ready to take your career to the next level in a role that’s critical to the reliability, scalability, and performance of cutting-edge systems? We’re on the lookout for a Lead SiteReliabilityEngineer to … Contribute to quality systems through deviation management, CAPA follow-up, and root cause investigations. What We’re Looking For: 5+ years of experience in SiteReliability Engineering or a related field. Hands-on experience with Biosafety and GMP environments. Strong foundation in Lean Six Sigma principles. Proven problem More ❯
Reading, England, United Kingdom Hybrid / WFH Options
People Source Consulting trading as Experis
SiteReliabilityEngineer - DevOps Engineer 18 Month Contract PAYE - Fully Remote/or Hybrid based in Midlands if preferred. The role We are working with one of the finest gaming studios in the industry and are on the lookout for an … exceptional SiteReliabilityEngineer who can bring their expertise and unique thinking to help make their team even stronger! As an SRE the main purpose is solving for scale through collaboration and automation, bringing engineering principles to infrastructure and operational problems. Work closely with the different teams More ❯
facilitate effective job matching and career development, not just for our users but also for our own team members. We are looking for a SiteReliabilityEngineer Lead to ensure our systems are reliable, scalable, and efficient. As the SiteReliabilityEngineer Lead, you … maintaining the health and performance of our platforms while also leading a talented team of engineers. You will champion and coach best practices in reliability and operational excellence to deliver an exceptional experience for our users. Key Responsibilities Minimising downtime to products & services and ensuring the platform is stable … availability and performance. Work with senior stakeholders to mature the concept of SiteReliability within the CVL organisation. Lead and mentor the SRE function, fostering a culture of collaboration, innovation, and excellence. Creating a bridge between Development and support teams by applying an ‘as-a-service' mindset to More ❯
Hi Professionals Hope you are doing great Our client is looking for multiple SREEngineer experienced consultants to join a high-growth organization. If you are interested kindly, please share your CV and let me know your rate expectation. Role: SREEngineer Location: Birmingham UK (Hybrid) Duration: Contract … Experience: • Experienced SREengineer (3 years +) • Experience in security applications particularly SIEM • Preferred experience in Elastic based systems • Preferred experience in SIEM technologies • Knowledge of Infrastructure Monitoring tools Skill Sets: • Bachelor’s degree in Computer Science, Engineering, or a related field. • Proven experience as a SiteReliabilityEngineer or similar role. • Strong knowledge of cloud platforms (SIEM technologies • Proficiency in scripting languages (e.g., Python, Bash). • Experience with infrastructure as code (e.g., Terraform, Ansible). • Familiarity with containerization and orchestration (e.g., Docker, Kubernetes). • Excellent problem-solving skills and attention to detail. • Strong communication and More ❯
The SiteReliability Engineering (SRE) team at Pendo is responsible for provisioning and maintaining cloud infrastructure from development through production for all product initiatives, and working with developers and product managers to ensure that our products are not only reliable and performant, but also cost-efficient. Our platform … on-call and incident management functions, supporting a high-throughput platform which processes more than 15 billion events per day. To ensure the reliability of this environment for our customers, SREs work closely with developers and product managers to understand service level objectives, think through failures scenarios, and design … systems which balance cost with reliability objectives. Additionally, SREs collaborate with the Information Security team to ensure that cloud infrastructure is properly secured, and that sufficient controls are in place to meet our compliance goals with respect to industry standards such as SOC 2. Role Responsibilities Write high-quality More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
bet365 Group
A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability … of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration … is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user demands More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor … the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation … for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure More ❯
Stoke-On-Trent, England, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor … the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation … for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure More ❯
Birmingham, Staffordshire, United Kingdom Hybrid / WFH Options
N Consulting Limited
Role: SRE Lead Location: Birmingham, UK (Hybrid, 2-3 days WFO) Contract: 3 months (Possible extension ) Are you a skilled SiteReliabilityEngineer (SRE) with experience in maintaining scalable and reliable infrastructure? We're looking for a proactive leader with a passion for automation, incident management, and … system optimization. Key Skills Required: 5+ years of SRE or similar experience Expertise in Cloud Platforms (SIEM technologies preferred) Proficiency in Python or Bash scripting Hands-on experience with Infrastructure as Code (e.g., Terraform, Ansible) Familiarity with Docker and Kubernetes Strong problem-solving and collaboration skills Responsibilities: Design, implement, and More ❯
Reigate, Surrey, United Kingdom Hybrid / WFH Options
Willis Towers Watson
Description Summary : We are seeking a SiteReliabilityEngineer to join our SRE team based in Reigate. The ideal candidate will have excellent communication skills, experience working with multiple stakeholders, and a track record in Azure and Observability platforms. You will be joining Insurance Consulting and Technology … delivery family to deliver core foundational functionality that will be used by multiple SaaS product offerings across the business. You will be with other SiteReliability and Response teams as well as with the core Applications Teams, whose responsibility is to deliver and manage business critical services that … working arrangements, with presence in the Reigate office up to two days per week. The Role: Collaborate with cross-functional teams to ensure the reliability, availability, and performance of our client-facing services Maintain and configure observability platforms such as Datadog Proactive monitoring of production and other environments to More ❯
Dundee, Angus, United Kingdom Hybrid / WFH Options
Ivanti
offerings. We are responsible for the reliability, deployment, and operation of the Ivanti Cloud product portfolio. We are seeking individuals eager to drive SRE maturity through the research and development of internal tooling, operational enhancements, and deployment pipelines. Ivanti SRE takes a holistic view of operational procedures, incident response … procedures, application and infrastructure monitoring, and process automation. Ivanti SRE is a blend of infrastructure, networking, automation, development, and application administration. This is a hands-on technical position. The ideal candidate will have a software engineering background and strong experience with continuous deployment, SaaS delivery, and production incident response. This … the company's growth trajectory through continuous innovation and customer-centric solutions. What You Will Be Doing Researching, maintaining, and contributing to automation of SRE tools and processes Contributing to solutions toward reducing toil within SRE Participating in code review and analysis with SRE peers Composing and reviewing contributions to More ❯
SiteReliabilityEngineer (SRE) Head Resourcing is pleased to be working with one of the UK's leading retail banks who are looking for an experienced Cloud SRE to join our engineering team and help drive reliability, scalability, and automation across our cloud-based products and … services on Google Cloud Platform (GCP). This role is all about embedding SRE best practices, improving platform resilience, and troubleshooting service issues with an engineering-first approach, using code and automation rather than manual work. Key Skills & Experience: Background in software engineering or telemetry, ideally with SRE experience. Hands … a mission to transform our technology landscape, investing in automation, innovation, and engineering talent. If you want to help shape the future of cloud reliability, we’d love to hear from you. More ❯
london, south east england, United Kingdom Hybrid / WFH Options
RP International
SiteReliabilityEngineer | Inside IR35 | Hybrid - 2 Days Onsite London | 6 Month Contract Our client a multinational and respected consultancy is hiring for a Lead SiteReliabilityEngineer with expertise in AWS and DevOps Tools for a new project in the Public Sector. Technical More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Embarcaderomediagroup
SiteReliability & Platform Engineer to help lead the way. You'll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices … ship faster, safer, and more cost-efficiently. What you'll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through … opportunity for someone passionate about building robust infrastructure and enabling others to move faster and more securely. You might come from a cloud engineering, SRE, or DevOps background - what matters most is your curiosity, systems thinking, and drive to improve operational efficiency. At Sorted, we are committed to fostering an More ❯
enabler of Capital One's ambitions. We are keen to add a Senior SiteReliability Engineering Manager (SSREM) to our Nottingham based SRE organisation whose primary focus is to provide effective leadership as we evolve and mature sitereliability practices for the benefit of our cloud … applications and their customers. The successful candidate will be a leader of leaders with custodianship of application services across 5+ SRE teams. We're looking for an experienced professional whose technical background allows effective challenge and support of teams managing primarily Java based applications running in a dynamic IaaC AWS … outcomes in the pursuit of business, functional and personal goals. The successful application will lead by example, build strong and valuable relationships within the SRE org, wider tech and business stakeholders. They have the ability to face ambiguity and understand how to make sense of complexity, importantly being able to More ❯
Services, Azure Functions, Azure Logic Apps, Azure SQL, Azure Storage, Application Insights, Azure Redis, VNets and Azure App Gateway. 2+ years of experience with Reliability concepts to ensure high performance and high service availability, able to define implement and improve business performance SLO's. 2+ years of experience with …/paging with OpsGenie, incident management, RCA (Root Cause Analysis) and retrospective analysis. 2+ or more years in hands-on technical roles (such as sitereliabilityengineer, software engineer, DevOps engineer, infrastructure engineer). Experience with infrastructure management across multiple cloud and on-premise … less experienced engineers. Production environments with on-call rotations. Advocacy: Train and mentor engineering teams on modern observability practices and techniques. Define and socialize SRE culture, best practices, architectural and security standards. Assess and raise risks across the organization. Partnership with: Internal engineering, architecture and operations teams to ensure alignment. More ❯
A prestigious, technology-driven hedge fund is seeking a highly skilled SiteReliabilityEngineer (SRE) to join their global infrastructure team. This is a unique opportunity to work in a high-performance, low-latency trading environment where technology is at the heart of the firm’s competitive … critical role in ensuring the performance, reliability, and scalability of the systems that power the fund’s trading and research platforms. As an SRE, you will work closely with software engineers and investment teams to build automation-first solutions that support the firm’s most advanced strategies. Key Responsibilities … across the business. Design and implement automation to eliminate manual tasks and reduce operational risk. Collaborate with software and investment teams to embed the SRE mindset early in the development lifecycle. Ideal Candidate: SRE with experience working with data systems Ability to program (structured, OOP, and TDD) using one or More ❯