Wigan, Lancashire, England, United Kingdom Hybrid/Remote Options
Searchability
SITERELIABILITYENGINEER £40k salary Join a growing, technology-driven business operating at scale within the online gaming and sports sector. Opportunity to shape the SRE strategy. ABOUT THE CLIENT Our client is a fast-growing digital technology company at the forefront of delivering high-availability platforms for the sports and gaming industry. They pride themselves on … Engineer to strengthen their engineering function and help evolve their observability and automation capabilities. THE BENEFITS Hybrid working model (office and remote) Opportunity to define and lead SRE strategy within a collaborative culture Exposure to modern cloud-native and containerised environments THE SITERELIABILITYENGINEER ROLE: As a SiteReliabilityEngineer, you … testing, system tuning and incident management to ensure smooth operation during critical events. SITERELIABILITYENGINEER ESSENTIAL SKILLS At least 2 years' experience working as an SRE Deep understanding of system reliability, scalability and performance tuning Experience with observability tools (Grafana, Prometheus, OpenTelemetry) Proficiency in a programming language such as Go or .NET for automation and More ❯
Wigan, Greater Manchester, United Kingdom Hybrid/Remote Options
Searchability (UK) Ltd
SITERELIABILITYENGINEER £40k salary Join a growing, technology-driven business operating at scale within the online gaming and sports sector. Opportunity to shape the SRE strategy. ABOUT THE CLIENT Our client is a fast-growing digital technology company at the forefront of delivering high-availability platforms for the sports and gaming industry. They pride themselves on … Engineer to strengthen their engineering function and help evolve their observability and automation capabilities. THE BENEFITS Hybrid working model (office and remote) Opportunity to define and lead SRE strategy within a collaborative culture Exposure to modern cloud-native and containerised environments THE SITERELIABILITYENGINEER ROLE: As a SiteReliabilityEngineer, you … testing, system tuning and incident management to ensure smooth operation during critical events. SITERELIABILITYENGINEER ESSENTIAL SKILLS At least 2 years' experience working as an SRE Deep understanding of system reliability, scalability and performance tuning Experience with observability tools (Grafana, Prometheus, OpenTelemetry) Proficiency in a programming language such as Go or .NET for automation and More ❯
SiteReliabilityEngineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £85,000 per annum + benefits Start Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a SiteReliabilityEngineer (SRE … on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing … Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you. SiteReliabilityEngineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
SiteReliabilityEngineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote THIS IS AN AZURE FOCUSED ROLE, IF YOU APPLY AND DO NOT WORK EITHER SOLEY OR MAINLY ON AZURE YOU WILL NOT BE CONSIDERED. Location: Remote (occasional travel to Nottinghamshire HQ) Salary: Up to £95,000 per annum + benefits Start … Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a SiteReliabilityEngineer (SRE) on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and … Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you. SiteReliabilityEngineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote More ❯
SiteReliabilityEngineer (SRE) Central London (Hybrid 3 days per week in the office) £65,000 £75,000 per annum + Excellent Benefits Were working with an innovative software company thats scaling its platform to support rapid customer growth and product expansion. Theyre looking for a SiteReliabilityEngineer (SRE) to join their platform … performance into the software lifecycle. Managing and evolving CI/CD pipelines to ensure smooth deployments and rollbacks. Contributing to incident response , post-mortems, and reliability improvements. Championing SRE principles such as error budgets, SLIs/SLOs, and automation-first thinking. What Were Looking For Strong experience running cloud infrastructure (AWS preferred) in production. Proven background in Kubernetes operations … native engineering culture. Influence how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | SiteReliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the More ❯
Mid-Level SiteReliabilityEngineer (SRE) Are you an experienced SiteReliabilityEngineer with a passion for building reliable, scalable systems that empower innovation? Our client is looking for a skilled Mid-Level SRE to join our growing technology team. In this role, you’ll help ensure our infrastructure is stable, secure, and efficient … supporting the applications that drive support our clients. The Role We are seeking a mid-level SiteReliabilityEngineer (SRE) to join our technology team, helping to ensure the smooth operation and reliability of our infrastructure. You’ll play a vital role in maintaining uptime, managing deployments, and supporting other team members. This is a hands … performance, and availability of production systems. Perform regular updates, patching, and maintenance across environments. Manage infrastructure provisioning using Terraform, Ansible, and AWS. Collaborate & Support Work closely with the junior SRE to develop their practical experience and technical confidence. Partner with developers, data scientists, and business users to resolve technical issues. Automate & Optimise Contribute to configuration management and automation improvements. Identify More ❯
SiteReliabilityEngineer | Bristol (3 days onsite, 2 days remote) | £65,000–£95,000 DOE Join a Team That Builds the Backbone of Secure, High-Performance Systems In 2019, a group of engineers solving complex cross-domain challenges inside government organisations decided to take things further — and TwinStream was born. Our mission? To deliver technical excellence, operational … reliability, and exceptional … client service for some of the UK’s most high-profile government projects. Now, we’re growing — and we’re looking for a SiteReliabilityEngineer (SRE) who’s ready to shape the next generation of resilient, high-impact infrastructure. Key Responsibilities of the SiteReliabilityEngineer: As a TwinStream SRE, you’ll sit More ❯
SiteReliabilityEngineer Bristol (3 days onsite, 2 days remote) £65,000 £95,000 DOE Join a Team That Builds the Backbone of Secure, High-Performance Systems In 2019, a group of engineers solving complex cross-domain challenges inside government organisations decided to take things further and TwinStream was born. Our mission? To deliver technical excellence, operational … reliability, and exceptional … client service for some of the UK s most high-profile government projects. Now, we re growing and we re looking for a SiteReliabilityEngineer (SRE) who s ready to shape the next generation of resilient, high-impact infrastructure. Key Responsibilities of the SiteReliabilityEngineer: As a TwinStream SRE, you ll sit More ❯
SiteReliabilityEngineer | Bristol (3 days onsite, 2 days remote) | £65,000–£95,000 DOE Join a Team That Builds the Backbone of Secure, High-Performance Systems In 2019, a group of engineers solving complex cross-domain challenges inside government organisations decided to take things further — and TwinStream was born. Our mission? To deliver technical excellence, operational … reliability, and exceptional … client service for some of the UK's most high-profile government projects. Now, we're growing — and we're looking for a SiteReliabilityEngineer (SRE) who's ready to shape the next generation of resilient, high-impact infrastructure. Key Responsibilities of the SiteReliabilityEngineer: As a TwinStream SRE, you'll sit More ❯
SiteReliabilityEngineer (SRE) - eDV Cleared Location: London (On-site) Salary: Up to £75,000 + Clearance Bonus + Company Bonus Clearance: eDV (Enhanced Developed Vetting) required Are you an experienced SiteReliabilityEngineer (SRE) with active eDV Clearance Do you want to work on mission-critical systems that directly support UK National … brightest minds in the industry, ensuring the reliability, scalability and performance of complex, high-assurance systems that protect the nation. The Role: As a key member of the SRE team, you'll design, build and maintain reliable infrastructure and automation solutions to keep vital services running smoothly. You'll drive continuous improvement across monitoring, deployment, and incident response for … performance bonus . Opportunity to work on high-impact, national security projects . Career development within one of the UK's most respected secure consultancies. If you're an SRE with eDV clearance looking to make a real impact in a secure and rewarding environment, we'd love to hear from you. Apply now or reach out directly to Dominic More ❯
ReliabilityEngineer where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. As a Microsoft SQL Database SiteReliabilityEngineer ( SRE) at Barclays, you will assume a key technical role. You will assist in shaping the direction of our database administration, ensuring our technological approaches are innovative and aligned with the … Bank's business goals. You will contribute high-impact projects to completion, collaborate with management, and implement SRE practices using software engineering and database administration to address infrastructure and operational challenges at scale. As part of the Database SRE team, you will be data-driven and work to eliminate TOIL through simplification, automation, and observability, thereby enhancing the reliability … have experience with: Technical specialisation with MS SQL expertise on version - SQL for complex database related issues from availability, to tuning to architecture on enterprise scale. Contribute shaping, designing SRE practice for MSSQL offering, delivering through SRE team. Serve as the technical escalation for complex database related issues, providing expert solutions. Assist establishment and evolution of the SRE function and More ❯
SiteReliabilityEngineer/SRE/DevOps/AWS/IaC/Manchester/Permanent/Remote/£50,000 - £60,000pa Vivo Talent is proud to be partnering with a market-leading software organisation to recruit a talented SiteReliabilityEngineer (SRE) to join their growing team. This is a fantastic opportunity to … play a pivotal role in designing and maintaining reliable, scalable infrastructure that keeps the business running smoothly and enables innovation at scale. As the SRE, you'll take ownership of ensuring systems remain stable, efficient, and secure - while also having the chance to mentor junior team members and help shape the foundations of a growing engineering function. If you're … Optimise: Use tools like Terraform, Ansible and AWS to manage infrastructure and enhance automation. Collaborate & Support: Work hand-in-hand with cross-functional teams and help develop the junior SRE through mentoring and knowledge sharing. Monitor & Troubleshoot: Strengthen monitoring systems (moving from Nagios to Datadog) and take ownership of incident management. What You'll Bring Solid experience in SRE or More ❯
the future of AI. Together, we can make a meaningful impact. See more about our culture on Role Summary We are seeking highly experienced SiteReliability Engineers (SRE) to shape the reliability, scalability and performance of our platform and customer facing applications. You will work closely with our software engineers and research teams to ensure our systems … meet and exceed our internal and external customers' expectations. Location: Paris or London Reporting line: Head of Engineering What you will do As a SiteReliabilityEngineer, you balance the day to day operations on production systems with long term software engineering improvements to reduce operational toil and foster the reliability, availability, and performance of these … source projects, research publications, blog articles and conferences About you Master's degree in Computer Science, Engineering or a related field 7+ years of experience in a DevOps/SRE role Strong experience with cloud computing and highly available distributed systems Exposure to sitereliability issues in critical environments (issue root cause analysis, in production troubleshooting, on call More ❯
Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
DWP Digital
Senior SiteReliabilityEngineer Pay up to £78,517 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital … with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior SiteReliabilityEngineer, within one of our SRE teams at the heart of Digital Transformation. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person … Demonstrable experience of developing cloud based and supporting cloud-based applications in AWS & Azure. Incident Resolution: Strong experience in resolving complex technical incidents, ensuring minimal downtime and swift recovery. Reliability Engineering: Expertise in reliability engineering, including capacity and performance management through effective monitoring, logging, and alerting. Leadership: Demonstrated ability to engage with stakeholders at all levels, providing valuable More ❯
DV-Cleared SiteReliabilityEngineer - Contract - Cheltenham - On-Site Hays Technology is recruiting an SiteReliabilityEngineer for a secure, on-site contract role in Cheltenham. This position is outside IR35 and requires active DV clearance. Rate: £550-£650/day Location: Cheltenham (3/4 days/week on-siteMore ❯
bristol, south west england, united kingdom Hybrid/Remote Options
TwinStream
organisations TwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home. Salary: £65,000 - £95,000 DOE Location: 3 days onsite in Bristol, 2 days from home Security Clearance: Eligible for SC and/… for these services continues to grow in both scope and scale. We are seeking an experienced SiteReliabilityEngineer to help satisfy that demand. As an SRE you will be responsible for ensuring the availability, performance and cost effectiveness of these services. You will be working with multiple feature development teams and the BAU/Support team … to define and evolve our cloud & on-prem infrastructure & delivery pipelines, improving system observability, demonstrating performance and capacity improvements and proactively identifying and mitigating reliability risks. Key Responsibilities of the SiteReliabilityEngineer: Collaborate with Software Engineers to improve reliability and performance in their subsystems Partner with System Administrators in automating toil and eliminating alerts More ❯
Manchester, Lancashire, England, United Kingdom Hybrid/Remote Options
Lorien
Junior SiteReliabilityEngineer Hybrid - Manchester x2 days a week Salary up to £45,000 + Bonus The Company: Lorien Global are supporting a growing business based in Manchester City Centre as they expand their Support Services team. With an exciting pipeline of work ahead, they're looking to hire an experienced Junior SiteReliabilityEngineer to play a key role in supporting and improving their online platforms. The Role: You'll act as a technical escalation point, handling complex support queries from the Service Desk and resolving advanced issues across both Windows and Linux environments. From diagnosing system faults to working closely with Infrastructure and Development teams when escalations are required, your More ❯
About the Role Are you passionate about building resilient systems and eliminating operational toil through automation? We’re looking for a SiteReliabilityEngineer (SRE) to join our high-impact team and help shape the future of our digital infrastructure. As an SRE, you’ll blend software engineering with systems engineering to ensure the reliability, availability … tools and scripts to reduce manual operations and improve system resilience. Optimize system performance and resource usage, identifying and resolving bottlenecks. Collaborate with development and product teams to integrate SRE best practices into the software lifecycle. Contribute to the evolution of our SLIs, SLOs, and error budgets to drive reliability metrics. Stay current with industry trends and contribute to … our internal engineering communities. What You Bring Proven experience as an SRE, DevOps Engineer, or Systems Engineer in a complex, high-availability environment. Deep expertise in Microsoft SQL Server (2016–2022), including performance tuning, high availability, and architecture. Strong scripting skills (e.g., PowerShell) and experience with automation/configuration tools like Ansible or Chef. Familiarity with observability tools More ❯
Role- Senior SiteReliabilityEngineer (SRE) Location - London (full onsite- 5 days every week) Perm up to 80K gross Minimum 12+ year profile are required PFB updated JD Core Competency, • Datadog, Splunk, Dynatrace, Grafana, Prometheus, Thousand Eyes, Gremlin etc. • Efficiency in creating Dashboard for Infra/APM/E2E workflows. • Monitoring, logging, Alerting and Error budget , 99.99 More ❯
our customer's systems are built and maintained. This role blends operational product support with software engineering to create applications to understand the overall health of our systems. The SRE team sits within a wider programme at the core of the customer mission. The role holder As an SRE, fundamentally you will be doing work that has historically been done … engineering expertise to substitute automation for human labour, with the objective of limiting traditional manual operations work (incident tickets, on-call etc.) to no more than half of the SRE team's time (and aiming for considerably less). You will have an enthusiasm to learn and experiment, to develop tools to understand application health and improve their reliability … enable them to be scalable and resilient to failure, and how to get the best out of the infrastructure they are deployed to. Participating in the wider DevOps/SRE community within the organisation. Competencies It is desirable for you to have experience in the areas below. However more valued for this role is that you have excitement and enthusiasm More ❯
application, please feel free to note which pronouns you use (For example: she/her, he/him, they/them, etc). We are looking for an experienced engineer with strong Linux and system-level expertise who can operate autonomously in complex production environments. You must be able to independently troubleshoot incidents, lead … and support post-incident service recovery, and drive improvements to overall system stability, performance, and observability. We are looking for a hands-on SiteReliabilityEngineer (SRE) with a strong background in Linux infrastructure and third-party system operations. This role focuses on managing and optimizing large-scale environments (5,000+ hosts) running technologies like Kafka, Redis … and Kubernetes. The position does not involve application development but requires deep operational expertise and solid troubleshooting skills. Qualifications 5+ years of experience in Linux system administration or SRE roles Proven experience managing large-scale infrastructure environments Strong troubleshooting and performance tuning skills at the infrastructure level Basic scripting/automation experience (Bash, Python) Familiarity with IaC tools (e.g., Ansible More ❯
With a strong culture of collaboration and technical excellence, the organisation continues to push the boundaries of low-latency infrastructure and reliable system design. The team is hiring a SiteReliabilityEngineer (London) to build, monitor, and optimise mission-critical trading systems. The role will focus on automation, system scalability, and incident response to maintain maximum uptime … support new trading systems, continuously improving the infrastructure. Drive automation and operational excellence by leveraging your Linux expertise, Kubernetes, and Python scripting skills. Monitor and ensure high availability and reliability of trading applications while being on top of system alerts and incidents. Key Requirements: 1-5 years working experience The right candidate will come from a financial services background … Solid experience with Linux Systems administration and troubleshooting. Hands-on experience with Kubernetes for container orchestration. Proficient in Python scripting for automation and system management. A mindset focused on sitereliability and performance. Strong troubleshooting skills and a proactive approach to problem-solving. Benefits: Lucrative bonus scheme Salary: Up to £90,000 base salary More ❯
SiteReliabilityEngineer Bristol (3 days onsite, 2 days remote) £65,000-£95,000 DOE Join a Team That Builds the Backbone of Secure, High-Performance Systems In 2019, a group of engineers solving complex cross-domain challenges inside government organisations decided to take things further - and TwinStream was born click apply for full job details More ❯
We’re building a Centralised SRE team to champion reliability engineering across global technology infrastructure. As a Senior SiteReliabilityEngineer, you’ll be at the forefront of this transformation engineering scalable systems, automating operations, and embedding resilience into every layer of the tech stack. This isn’t just about keeping the lights on. It’s … about rethinking how systems behave under pressure, how teams respond to incidents, and how automation can unlock new levels of performance and efficiency. What You’ll Do; Engineer for Resilience: Design and implement systems that are fault-tolerant, self-healing, and built for scale. Automate Everything: Build tools and scripts to eliminate manual toil, streamline operations, and accelerate recovery. … analysis and strategic fixes. Drive Performance: Monitor, tune, and optimize systems to ensure peak performance across platforms. Partner Across Teams: Collaborate with product, infrastructure, and development teams to embed SRE principles into the software lifecycle. Influence Culture: Advocate for reliability-first thinking, mentor engineers, and help shape a culture of technical excellence. What You Bring; Strong coding skills in More ❯
team. Thus, we are building the firm around exceptional talent. Position Overview The Junior SiteReliability & Network Engineer will be working alongside the head of the SRE team to ensure the reliability, scalability, and performance of our trading platform. This role will blend traditional system administration with software development and require expertise in cloud network engineering … particularly with AWS. The ideal candidate will understand network routing and networking stacks. Key Responsibilities System Reliability and Performance Monitor, maintain, and optimize the performance, availability, and scalability of our trading platform. Respond to and resolve system incidents, ensuring minimal downtime and swift recovery. Collaborate with the development team to design and implement system enhancements. Infrastructure Management Under the … implement improvements to our infrastructure and development processes. Participate in post-mortem reviews to identify areas of improvement after incidents. Qualifications Minimum of 2 years of experience in a SiteReliability Engineering, DevOps, or similar role. Strong experience with hybrid cloud and on-prem system management. Expertise in cloud network engineering, particularly with AWS. Proficiency in containerisation platforms More ❯