1 to 25 of 114 Remote/Hybrid Permanent Site Reliability Engineering Jobs

Senior SRE Lead

Hiring Organisation: Albany Beck
Location: London Area, United Kingdom

that is passionate about capability build, technical excellence, and delivering meaningful change within complex enterprise environments. Role Overview Albany Beck is seeking a Senior SRE Lead/Observability SME to lead the establishment of a new enterprise Site Reliability Engineering (SRE) capability, with a primary focus … stability, incident response maturity, and end-to-end visibility across systems. This role is best suited to someone who has helped design or scale SRE and observability capabilities in large, distributed, and regulated environments. Key Responsibilities Lead the design, build, and rollout of an enterprise-wide observability capability Define observability ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation: Profile 29
Location: United Kingdom
Employment Type: Permanent, Work From Home
Salary: £65,000

Site Reliability Engineer (Security Cleared) Salary to £60k + Company Options Scheme Preference for hybrid working between your home, their offices (London Vauxhall) & client sites however fully remote working may be considered for the ideal candidate. NB: Please only apply if you already have UK Security Clearance … fast and looking for bright, dynamic people to help build their business. Role Theyre looking for a (Security Cleared) Site Reliability Engineer (SRE) to join their growing platform and delivery teams. Youll help design, build, and operate reliable, secure, and performant infrastructure that underpins critical public-sector services. ...

Site Reliability Engineer (Security Cleared)

Hiring Organisation: Profile 29
Location: South East London, London, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £65,000

Site Reliability Engineer

Hiring Organisation: Anson McCade
Location: Gloucester, England, United Kingdom

teams to embed best practices in system design and architecture Diagnosing and resolving complex incidents across the full technology stack Contributing to a broader SRE/DevOps community, sharing knowledge and improving standards Ideal Background Experience in software engineering, ideally with Java and web technologies (JavaScript, HTML) Strong understanding … engineering culture with a focus on continuous improvement and innovation Who Should Apply Engineers who want to move beyond traditional operations into true SRE Candidates who enjoy solving complex reliability and scalability challenges Individuals comfortable working in secure, high-trust environments People who value impact, ownership, and engineering ...

Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Your Role in our Mission: We are seeking a Site Reliability Engineer to help us transform our existing operational workloads to an SRE approach. You will embed with our Product Engineering teams to drive high availability, reliability, and uptime. In this role, you will … work from our Dean Street office two days per week. What You’ll be Doing: Integrating tightly with our Product Engineering teams Following SRE practices and maintaining high standards of compliance Implementing a new standard of observability utilising SLI/SLO/Error Budgets Continually evolving our observability platforms ...

Azure Site Reliability Engineer (Remote)

Hiring Organisation: Revybe IT Recruitment Ltd
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £40,000 - £55,000 per annum

that’s scaling its Platform & Reliability capabilities across a modern Azure cloud environment. They’re looking for a Site Reliability Engineer (SRE) to help improve system resilience, drive automation, and ensure their platforms are highly available, observable, and performant at scale. This is a hands-on role … enhance system reliability and scalability Supporting production systems and participating in incident management What we’re looking for: Experience in a DevOps, SRE, or Platform Engineering role Strong Azure experience (or comparable cloud exposure with Azure focus) Solid understanding of reliability engineering principles Hands-on with ...

Site Reliability Engineer

Hiring Organisation: VIQU IT
Location: United Kingdom, Whitechapel, Greater London
Employment Type: Permanent
Salary: £40000 - £50000/annum

Senior Site Reliability Engineer (AWS/CDK/TypeScript) Remote First – Occasional travel to Leeds £40,000 - £50,000 + benefits No Sponsorship Available VIQU have partnered with a major UK technology-led organisation undergoing a significant transformation following a large-scale business merger. As part … joining a collaborative engineering environment with the opportunity to influence platform standards, improve operational resilience and support modern DevOps and SRE practices across the business. Key responsibilities: Build, maintain and improve scalable AWS infrastructure. Develop and manage Infrastructure as Code using AWS CDK. Support CI/CD pipelines ...

Site Reliability / Software Engineer - SC Cleared

Hiring Organisation: Searchability NS&D
Location: Gloucestershire, England, United Kingdom
Employment Type: Full-Time
Salary: £45,000 - £65,000 per annum

Site Reliability/Software Engineer (SC Cleared) Location: Gloucestershire SC Clearance required to start with the opportunity to be sponsored through DV Clearance after joining Salary: Up to £65,000 + Clearance Bonus To appy, email: Overview An exciting opportunity has arisen for a technically versatile engineer … Agile teams Desirable Skills Exposure to infrastructure automation tools Experience with microservices architectures Familiarity with MongoDB, Elasticsearch or similar technologies Knowledge of DevOps and SRE best practices Understanding of operational support within secure environments Experience improving system observability and performance Additional Information Active SC Clearance is required for this role ...

Data Reliability Engineer II (Tue - Sat)

Hiring Organisation: Jobleads-UK
Location: Belfast, Northern Ireland, United Kingdom

Data Reliability Engineer II (dRE)**Role Overview:As a **Data Reliability Engineer II**, you will play a crucial, hands-on role in our global DRE team ensuring our critical database systems are reliable, fast, and scalable. Moving away from traditional, reactive database administration, our mission is to proactively … version control (Git) and deployment tools (like Argo or general CI/CD pipelines).* Methodologies: Understanding of Site Reliability Engineering (SRE) practices.### Preferred Experience:* Exposure to Change Data Capture (CDC) tools like Striim.* Certifications: Google Associate Cloud Engineer or actively studying for GCP Professional Database Engineer. ...

Site Reliability Engineer (SRE)

Hiring Organisation: Reading Industrial Pertemps
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £50,000 per annum

Site Reliability Engineer (SRE) Salary: £50,000 per annum Reporting to: Head of Technology Location: London-based | Hybrid working (2–8 days per month in the office)We’re looking for an SRE with 2–3 years of experience in DevOps, Platform Engineering, or SRE … processes and improve platform reliability Contribute to scalable, maintainable, and secure infrastructure practices What We’re Looking For 2–3 years in DevOps, SRE, or Platform Engineering Strong Linux troubleshooting skills Experience with Terraform and infrastructure-as-code CI/CD pipeline experience Strong Python and/ ...

Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: Cambridge, England, United Kingdom

seeking a Site Reliability Engineer to maintain and develop our cloud infrastructure and monitoring systems Key features Location: Cambridge Fantastic opportunity to help the business develop and thrive Full time hybrid working The opportunity We are seeking a Site Reliability Engineer to maintain and develop … Site Reliability Engineer? Minimum Bachelor 2:1 degree in computer science or a related field 2+ years experience in a professional DevOps, SRE, Platform Engineering or similar role Self-motivated with strong problem-solving and analytical skills Experience using and configuring monitoring tools, ideally Grafana and Prometheus ...

Site Reliability Engineer (Kubernetes / Multi-Cloud) UK Based

Hiring Organisation: Jobleads-UK
Location: Hereford, England, United Kingdom

looking for a Site Reliability Engineer (SRE) to join an established and growing SRE team supporting Kubernetes-based platforms running across Azure and AWS This role focuses on maintaining reliable, scalable, and observable systems, working closely with engineering teams to ensure services run smoothly in production. … Karpenter) Service mesh exposure Personal Attributes Problem‐solving mindset Willingness to learn Proactive and dependable Qualifications 2–4 years of experience in cloud/SRE/platform roles Location – Hybrd (Hereford based) or Remote Employment Type – Full Time Residency – You must have been Resident in the UK for 5 years ...

Senior Azure Platform Engineer

Hiring Organisation: Talent Locker
Location: Farnborough, Hampshire, South East, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £75,000

Overview Are you a highly skilled Senior Azure Platform Engineer looking to take the next step in your career? Join a growing and experienced engineering team delivering secure, scalable cloud platforms within a highly regulated environment. This is an excellent opportunity to play a key role in designing … Skills & Experience Degree in Computer Science, Engineering, or equivalent experience 5+ years' experience in: Platform Engineering, Site Reliability Engineering (SRE), Cloud/Platform Integration Strong Azure expertise, ideally with: AZ-104 (Azure Administrator) AZ-305 (Azure Solutions Architect) Deep knowledge of Azure networking: Virtual ...

Data Reliability Engineer II JBLE1 NI

Hiring Organisation: CME Technology Support Services Ltd
Location: Belfast, UK

Data Reliability Engineer II (dRE) Role Overview: A crucial role in CME's Cloud transformation, the dRE II will be aligned to data product pods ensuring that our data infrastructure is reliable, scalable, and efficient as the GCP data footprint expands rapidly. Accountabilities: Automate data tasks on Google Cloud … Experience as a Site Reliability Engineer or a similar role would be beneficial. Methodologies: Understanding of Site Reliability Engineering (SRE) practices. Data Technologies: Knowledge of data technologies such as relational databases, data warehousing, big data platforms (e.g., Hadoop), data streaming (e.g., Kafka), and cloud services ...

Azure Site Reliability Engineer

Hiring Organisation: Context
Location: Manchester, North West, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £65,000

depending on experience. We are supporting a highly regarded Managed Services Provider in the search for an experienced Azure Site Reliability Engineer (SRE) to join their growing cloud engineering team. This organisation has built an excellent reputation in the market and works with a broad client base … cause analysis Driving automation using Infrastructure as Code Monitoring system performance and identifying improvements Working closely with engineering teams to embed DevOps and SRE best practices Skills & Experience Proven experience as an Azure SRE, DevOps Engineer, or Cloud Engineer Strong knowledge of Azure services across compute, networking, storage, databases ...

Principal AI Technical Architect

Hiring Organisation: Jobleads-UK
Location: Bristol, England, United Kingdom

into secure, scalable architectures, then building, deploying, and running what you design. This role suits someone who enjoys the full lifecycle: rapid prototyping, strong engineering fundamentals, infrastructure‐as‐code, and the operational discipline that comes with “you build it, you run it.” You’ll help set the technical patterns … Logiq uses for AI delivery, platform engineering, observability, and secure‐by‐default cloud implementations, while leading small engineering teams and supporting clients through delivery and change. Expect variety: one week you may be shaping a reference architecture and delivery plan; the next you’re implementing CI/ ...

Staff Cloud Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Compute platform (large-scale, multi-tenant GPU fleets and scheduling systems driving model training and inference at scale). This is a founding Cloud SRE role. You won’t inherit a mature SRE function, you’ll help create it. You will define the frameworks, automation, and operational standards that ensure … Cloud Site Reliability Engineer at Wayve, we’re looking for the following skills and experience. Essential skills Proven experience in an SRE, Production Engineer, or Cloud Reliability role supporting large-scale cloud systems. Strong Kubernetes experience, including operating production clusters. Hands‐on experience running production workloads ...

Azure Site Reliability Engineer

Hiring Organisation: Context
Location: Manchester, United Kingdom
Employment Type: Permanent
Salary: GBP 65,000 Annual

Azure Site Reliability Engineer Remote based. Paying between … depending on experience. We are supporting a highly regarded Managed Services Provider in the search for an experienced Azure Site Reliability Engineer (SRE) to join their growing cloud engineering team click apply for full job details ...

Strategic Initiatives Program Manager – Vice President

Hiring Organisation: Jobleads-UK
Location: Belfast, Northern Ireland, United Kingdom

enterprise resilience. Responsibilities include: Lead and govern the implementation and execution of Production Swing testing for critical applications, ensuring applications run from their alternate site for a minimum of 5 days. Drive the implementation and oversight of Data Recovery testing, ensuring applications can recover critical data from backup solutions … capabilities to provide transparency into program progress and application resiliency posture. Key Qualifications Experience in software engineering, site reliability engineering (SRE), or technology risk and controls. Experience in a program or project management role, delivering complex, cross-functional technology initiatives. Proven expertise in analyzing complex application ...

Strategic Initiatives Program Manager – Vice President

Hiring Organisation: Jobleads-UK
Location: Belfast, Northern Ireland, United Kingdom

Enhanced Testing and Recovery:*** Lead and govern the implementation and execution of Production Swing testing for critical applications, ensuring applications run from their alternate site for a minimum of 5 days.* Drive the implementation and oversight of Data Recovery testing, ensuring applications can recover critical data from backup solutions … capabilities to provide transparency into program progress and application resiliency posture.**Key Qualifications:*** Experience in software engineering, site reliability engineering (SRE), or technology risk and controls.* Experience in a program or project management role, delivering complex, cross-functional technology initiatives.* Proven expertise in analyzing complex application ...

SC/DV Site Reliability Engineer

Hiring Organisation: IO Associates
Location: Bristol, Avon, South West, United Kingdom
Employment Type: Permanent
Salary: £90,000

Role: Site Reliability Engineer Location: Bristol (Hybrid; 3 days per week onsite) Salary: £60,000 to £90,000 per annum (Dependent on Experience) Clearance: Must hold active SC or DV clearance, or be fully eligible to undergo the vetting process This is a highly collaborative, hands-on role … value a problem-solving mindset over a perfect checklist. If you have a strong foundation in the following, we want to speak with you: SRE/DevOps Background: Proven experience in reliability engineering or infrastructure-focused roles. Cloud Expertise: Proficiency in AWS, Azure, or GCP. IaC & Config Management ...

Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Site Reliability Engineer- UK Optum is a global organisation that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need … communities. Use your talents to improve the health outcomes of millions of people and discover the meaning behind: Caring. Connecting. Growing together . The Site Reliability Engineer is a member of Cloud Operations Automation team and responsible for the reliability, security and efficiency of Change Healthcare ...

SRE Technical Lead

Hiring Organisation: F5 consultants
Location: Berkshire, South East, United Kingdom
Employment Type: Permanent, Work From Home

SRE Technical Lead Location: Wokingham - Hybrid working (2-3 days onsite) Salary: Up to £120,000 + 10% bonus Clearance: Active SC clearance required We have an exciting opportunity for an SRE Technical Lead/Manager to join a major UK critical infrastructure programme delivering large-scale cloud-native transformation … enterprise scale. In this role, you'll take ownership of SRE strategy and platform reliability across complex Kubernetes and OpenShift environments, helping shape engineering standards, operational maturity, and long-term platform stability. You'll work within a modern cloud-native environment leveraging Kubernetes, OpenShift, GitOps, service mesh, observability ...

Staff Software Engineer, AI Reliability Engineering

Hiring Organisation: Jobleads-UK
Location: England, United Kingdom

About The Role Claude has your back. AIRE has Claude's. Help us keep Claude reliable for everyone who depends on it. AIRE (AI Reliability Engineering) partners with teams across Anthropic to improve reliability across our most critical serving paths -- every hop from the SDK through … from people who've built product stacks, scaled databases, run massive distributed systems, and everything in between. Strong candidates may also Have been an SRE, Production Engineer, or in similar reliability-focused roles on large scale systems Have experience operating large-scale model serving or training infrastructure (>1000 GPUs ...

Principal Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: Manchester, England, United Kingdom

with cloud native technology. This role is ideal for someone who thrives on solving complex technical challenges, enabling high performing teams, and driving modern SRE and DevOps practices at scale based in the heart of Manchester. WHAT TO EXPECT You’ll lead and empower the SRE function while remaining technically … coaching, mentoring, and improving engineering teams and processes Solid understanding of cloud native development, CI/CD tooling, and modern DevOps/SRE practices Strong communication skills and experience operating enterprise scale production systems BENEFITS This role is rewarding in more ways than one. On top of our core ...