Salford, Manchester, United Kingdom Hybrid / WFH Options
Lloyds Bank plc
SiteReliabilityEngineer (SRE) for GCP Analytics Platform page is loaded SiteReliabilityEngineer (SRE) for GCP Analytics Platform Apply locations Manchester time type Full time posted on Posted 2 Days Ago time left to apply End Date: August 14, 2025 (12 days left to apply) job requisition id 139740 End Date Wednesday … Range £47,790 - £53,100 We support flexible working - click here for more information on flexible working options Flexible Working Options Hybrid Working, Job Share Job Description Summary An SRE will focus on monitoring and improving the SLO of their cloud infrastructure services whilst working under the guidance of senior SRE colleagues. Job Description JOB TITLE: SiteReliability … which involves spending at least two days per week, or 40% of our time, at our Manchester office. About this opportunity As a SiteReliabilityEngineer (SRE) within the Data & Platform Enablement Lab, you'll play a pivotal role in shaping and supporting a best-in-class analytics platform on Google Cloud within Lloyds Banking. Our mission More ❯
they are already renowned as having game-changing technology within their industry, with exciting scope for expansion into further industries. This role is looking for a Graduate or experienced SRE professional to work within the SRE team responsible for incident response and issue resolution. Location: Cambridge Salary: £32,000 - £70,000 per annum + excellent benefits including private healthcare (could … be more available for an experienced SRE) Requirements for SiteReliabilityEngineer - Graduate Considered: Excellent academics including 2.1 or 1st class honours degree from a leading international University in a STEM subject A minimum of AAB at A-Level or international equivalent if applying at Graduate level Any experience working an incident response or technical support environment … knowledge this role will not lead to a role in the R&D/Software teams Responsibilities for SiteReliabilityEngineer - Graduate Considered: Working within the SRE team you will be responsible for the architecture of a mission-critical cloud platform for an industry-leading software company. You will diagnose issues within complex systems, identify root causes More ❯
Cambridge, Cambridgeshire, East Anglia, United Kingdom
RedTech Recruitment
are already renowned as having game-changing technology within their industry, with exciting scope for expansion into further industries. This role is looking for someone to work within the SRE team responsible for incident response and issue resolution. Location: Cambridge Salary: £32,000 £60,000 + excellent benefits (£32,000 for a new Graduate) Requirements for SiteReliability … a role involving lots of problem solving identifying the root causes of issues. Good logical reasoning Responsibilities for SiteReliabilityEngineer Graduate Considered: Working within the SRE team you will be responsible for the architecture of a mission-critical cloud platform for an industry-leading software company. You will be diagnosing issues within complex systems and identifying … if this email address has been removed by the job-board, full details for contact are available on our website). Keywords- SiteReliabilityEngineer/SRE/DevOps/Software Engineering/Software Development/Engineering/Physics/Astrophysics/Python/Computer science/Cloud/Mathematics/AWS/Azure/GCP/ More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
Noir
SiteReliabilityEngineer (SRE) - Market leading company - Milton Keynes (Tech stack: .Net, C#, ASP.Net Core, SQL Server, PowerShell, Azure CLI, Bash, Azure DevOps, Jenkins, GitHub Actions, Docker, Kubernetes) Help shape the tech future of UK market leader! Backed by a major financial institution with soaring profits - my client is modernising platforms, embracing AI, and driving automation at … scale. We're hiring a Lead SiteReliabilityEngineer (SRE) to drive reliability, observability, and performance across our Azure cloud infrastructure. You'll work in a modern engineering environment where we live by "you build it, you run it", focused on automation, scale, and resilience. Tech stack you'll work with: .NET, C#, ASP.NET Core, SQL … Server, PowerShell, Azure CLI, Bash, Azure DevOps, Jenkins, GitHub Actions, Docker, Kubernetes We want to hear from you if: As a SiteReliabilityEngineer (SRE) you've delivered scalable systems using .NET, C#, and ASP.NET Core , with real-world experience managing production workloads You've automated operations using PowerShell, Azure CLI, and Bash to reduce toil and More ❯
Engineer - SiteReliability Engineering page is loaded Engineer - SiteReliability Engineering Apply locations USA-St. Louis-795 Office Pkwy time type Full time posted on Posted 11 Days Ago job requisition id R Our Team We are evolving our Reliability Engineering team to move beyond support and operations. As a Senior Engineer in SiteReliability, you will be part of a diverse and inclusive organization that has full ownership of the availability, performance, and scalability of one of the most critical shared services at LSEG. Main responsibilities We are looking for people with a passion to learn, and who bring a continuous improvement mentality to our team! SREs maintain … core of our team's purpose. Write automation to scale systems sustainably, prevent service issues, or when they occur, quickly recover service. Partner with development teams to improve system reliability, observability, and release velocity. Participate in on-call rotations, incident response, postmortems, and root cause analysis and resolution. Be a vocal advocate of strong/sound engineering practices that More ❯
Edinburgh, Midlothian, Scotland, United Kingdom Hybrid / WFH Options
McGregor Boyall
Site … ReliabilityEngineer | UK Remote | 6months | £530 p/d outside ir35 One of our public sector clients is seeking a skilled SiteReliabilityEngineer (SRE) to support and enhance their modern digital platform as it transitions from on-premise to cloud-native environments. You'll work within a highly collaborative, agile Site Resilience team … focused on building reliable, secure, and scalable infrastructure and services. SiteReliabilityEngineer - Key Responsibilities: Administer and optimise RHEL 7/8/9 and Red Hat Satellite Automate OS and application deployment using Ansible and Infrastructure as Code (IaC) principles Support Oracle 19c on Oracle Linux with KVM and CommVault integration Maintain observability stacks (Prometheus, Grafana More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
VIQU IT
Lead SiteReliabilityEngineer Hybrid/Remote – Once a month requirement in Leeds. Up to £80,000 per annum plus car allowance plus bonus. VIQU have partnered with a leading company within the supply chain industry who are seeking a Lead SiteReliabilityEngineer (AWS) to join and mentor their growing team. This position … organisations cloud infrastructure. This role is mostly remote, with monthly travel required to Leeds. Responsibilities of the Lead SiteReliabilityEngineer: Lead a team of four SRE’s, helping to maintain the stability of cloud platforms. Take on hands on technical responsibilities within AWS, utilising a range of cloud technologies (CI/CD, Container Orchestration, IaaS, Scripting … Lead SiteReliabilityEngineer: Must have at least a years’ experience in managing technical teams, and over five years of experience in a hands on, technical SRE/Dev Ops Engineer role. Experience with CI/CD tools (Jenkins and Concourse CI ideally). Must hold experience within AWS and hold relevant AWS certifications (SA1, DOP More ❯
West Midlands, England, United Kingdom Hybrid / WFH Options
MYO Talent
SiteReliabilityEngineer/SRE/Dynatrace/Observational Monitoring Tools/Automation/Grafana Labs/InfluxDB tools/Software/Network/Remote based/6 month contract/£500 – 650 per day Inside IR35. One of our leading clients is looking to recruit a SiteReliabilityEngineer (SRE) with strong Dynatrace … experience. Location – remote Duration – 6 months Day rate – £500 – 650 per day Experience: Must have experience working as a SRE/SiteReliabilityEngineer Must have strong Dynatrace experience Strong reliability, performance, and availability of systems, leveraging Dynatrace for monitoring and troubleshooting Dynatrace delivery, support and implementation Installation and Configuration, Performance Analysis, Incident Response, Automation Experience More ❯
Solihull, West Midlands, United Kingdom Hybrid / WFH Options
MYO Talent
SiteReliabilityEngineer/SRE/Dynatrace/Observational Monitoring Tools/Automation/Grafana Labs/InfluxDB tools/Software/Network/Remote based/6 month contract/£500 650 per day Inside IR35. One of our leading clients is looking to recruit a SiteReliabilityEngineer (SRE) with strong Dynatrace … experience. Location remote Duration 6 months Day rate £500 650 per day Experience: Must have experience working as a SRE/SiteReliabilityEngineer Must have strong Dynatrace experience Strong reliability, performance, and availability of systems, leveraging Dynatrace for monitoring and troubleshooting Dynatrace delivery, support and implementation Installation and Configuration, Performance Analysis, Incident Response, Automation Experience More ❯
Graduate DevOps Engineer/SRE All top graduates with tech-related degrees should read this! If you have a passion for building things, love constantly solving interesting challenges and also enjoy some coding as well, then we would encourage you to explore a career in DevOps & SiteReliability Engineering (if you're not already!). The demand … for this skill set is high, the role is interesting and varied and it is quite rare to see entry-level DevOps or SRE positions advertised. If you're already an experienced DevOps Engineer or SiteReliabilityEngineer we also really want to hear from you, as we are excited to be able to offer this … days a week in office) Salary: £35,000 - £70,000 per annum + excellent benefits (£35,000 for a new Gradaute, more DOE experience) Requirements for Graduate DevOpsEngineer/SRE: This company hires some of the very brightest engineers and is looking for a 2.1 or 1st class honours degree from a leading international University in a STEM subject Minimum More ❯
JOB TITLE: SiteReliabilityEngineer (GCP Analytics Platform) SALARY: £70,929 - £78,810 LOCATION(S): Manchester HOURS: Full-time - 35 hours per week WORKING PATTERN: Our work style is hybrid … which involves spending at least two days per week, or 40% of our time, at our Manchester office. About this opportunity As a SiteReliabilityEngineer (SRE) within the Data & Platform Enablement Lab, you'll play a pivotal role in shaping and supporting a best-in-class analytics platform on Google Cloud within Lloyds Banking. Our mission … years of hands-on experience working with Google Cloud products, particularly in the context of analytics platforms or large-scale infrastructure. Strong understanding of SiteReliability Engineering (SRE) principles, including SLIs/SLOs, error budgets, and incident response. Experience with infrastructure as code (e.g., Terraform, Deployment Manager) and CI/CD pipelines. Proficiency in monitoring, logging, and observability More ❯
SiteReliabilityEngineer - Core & Security (f/m/d) Posted On 04/28/2025 Job Information Number of Positions 1 Assigned Recruiter(s) Yann Provost Hiring Manager Yann Provost Technology Work Experience 4-5 years City Lausanne, Switzerland or remote in EU/UK State/Province Vaud (fr) 1006 Job Description Exoscale is … Exoscale strives to create an environment with great working conditions and welcomes diverse applicants. As part of its ongoing efforts to grow its infrastructure footprint Exoscale is hiring a SiteReliability Engineer. The sitereliabilityengineer plays a critical role in ensuring constant availability of the Exoscale platform. The engineering team at Exoscale works on … all aspects from designing & developing products, to their operation and support. With an expanding customer base and new products to further advance Exoscale's product portfolio, sitereliability engineers build and maintain a wide range of technologies. As users of Exoscale itself, sitereliability engineers also take active part in improving products. This position focuses on More ❯
SiteReliabilityEngineer - Data Infrastructure, AD/ADAS London/Product & Technology - AD/ADAS/Employee/hybrid Woven by Toyota is enabling Toyota's once-in-a-century transformation into a mobility company. Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current state of mobility through … development. The right candidate will have excellent communication skills, solid coding skills, expertise in building scalable, reliable, highly available and fault-tolerant systems, broad knowledge of software engineering and sitereliability engineering in areas such as Large-Scale Data and Compute Infrastructure, Stream Processing, Kubernetes, High-Performance Networking, Observability and Infrastructure Automation. RESPONSIBILITIES Set the technology strategy for … maintain, optimize and support large scale, multi-region, multi-cloud compute and storage infrastructure powering our data platform and mission critical services. Work with fellow Data Infrastructure engineers and SiteReliability engineers to ensure our systems are scalable, reliable, fault-tolerant, highly available, highly performant, and observable. Manage incidents, triage product or system issues and debug/track More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Capital One (Europe) plc
Nottinghamshire Senior Software Development Engineer - SiteReliability About the Role We're looking for a Senior Engineer to join our SiteReliability Engineering (SRE) team. This role is ideal for a skilled Java engineer with a passion for understanding how complex systems work, analysing performance, and applying engineering solutions to make them more … efficient, stable, and scalable. You'll lead on planning and implementing key SRE initiatives, optimise and automate how our systems operate, and improve observability through better monitoring and logging. You'll also work closely with your peers to drive consistency and high standards across SRE and the wider engineering community, so a real enthusiasm for influencing others and leading by … opportunities to reduce operational overheads through observability and service automation. Drive engineering best practice (e.g., Operational Excellence, Security, Quality, Resilience etc.) and set standards across the team and wider SRE community. Innovate within your team and contribute within your technical domain. Deliver key pieces of intent from inception through to design and hands-on delivery, in collaboration with your SREM. More ❯
Senior SiteReliabilityEngineer page is loaded Senior SiteReliabilityEngineer Apply remote type Remote locations Remote - United Kingdom time type Full time posted on Posted Yesterday job requisition id JR- Job Description … The rapid adoption of advanced software in vehicles marks a new era for automakers and consumers, bringing both advantages and challenges. As part of SiteReliability Engineering (SRE) at General motors, you'll join a dedicated team focused on enhancing the reliability, efficiency, and scalability of our distributed systems. We leverage engineering principles to manage operations effectively … engineering and systems engineering skills to keep our services resilient, robust, and scalable. This role is for a hands-on position as an Individual Contributor (IC). As an SRE IC, you will focus on enhancing the reliability, efficiency, and performance of our services. You'll work closely with other engineers to develop automated solutions, respond to incidents, and More ❯
Luupli started internal testing since June 2024 and getting ready for a commercial BETA testing from December 2024, with the hope of launching fully summer of 2025 Job Title: SiteReliability Platform Engineer About Luupli: Luupli is a social media app that has equity, diversity, and equality at its heart. We believe that social media can be … up of passionate and dedicated individuals who are committed to making Luupli a success. Role Description: We are seeking a talented and experienced SiteReliabilityEngineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure and services, primarily hosted … to proactively identify performance bottlenecks, system outages, and other potential issues. - Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents. 3. Reliability and Performance Optimization: - Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning. - Identify opportunities to automate manual processes and improve system resilience. More ❯
Fancy being our next SRE Superstar? SiteReliabilityEngineer (SRE) Sunderland (Hybrid) Full-time Alright, listen up! Here at Tombola, we're not just about bingo - we're about brilliant tech, seamless experiences, and keeping millions of players happy. And to do that, we need a SiteReliabilityEngineer who's as excited about … working hand-in-hand with our dev, infra, and security teams, making sure we balance exciting new features with unbeatable stability. What you'll be getting up to: System Reliability & Availability Hero: You'll be the guardian of our uptime, making sure our critical systems are always available and hitting those all-important SLAs . You'll also be … tech and better ways of doing things, constantly pushing us to improve system reliability, performance, and efficiency. Sound like a bit of you? If you're an experienced SRE with a passion for building reliable, scalable, and efficient systems, and you love working in a fun, collaborative environment, then we want to hear from you! Ready to join the More ❯
Senior SiteReliabilityEngineer London - Hybrid £80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliabilityEngineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression … With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal candidate … strategies and conduct chaos engineering experiments Monitor and maintain Kafka clusters for performance and reliability Respond to and resolve application-level production incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with Kafka and observability tools like Datadog or Grafana Able to troubleshoot issues across infrastructure and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment
Senior SiteReliabilityEngineer London - Hybrid £80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliabilityEngineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression … With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal candidate … strategies and conduct chaos engineering experiments *Monitor and maintain Kafka clusters for performance and reliability *Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with Kafka and observability tools like Datadog or Grafana *Able to troubleshoot issues across infrastructure and More ❯
Employment Type: Permanent
Salary: £80000 - £90000/annum 38 Days Holiday, Healthcare, Pension
Senior SiteReliabilityEngineer - Reuters The Reuters Professional DevOps team is a global squad with members from over five countries. Our work reflects on which is a source of real-time, nonpartisan information on world events, trends and culture. The DevOps team takes a factory approach to infrastructure, by designing and developing repeatable cloud-native patterns and … spans multiple regions, auto-scales, mitigates attacks and helps our customers stay informed wherever they are. Intrigued by a challenge? Reuters Professional DevOps Team is looking for an experienced engineer, who's passionate about automation and scalability to work from our London Office . About the Role: As a Senior SiteReliabilityEngineer at Reuters , you … forward until the adoption of chosen solutions reaches a 100% Communicate clearly, frequently, and take pleasure in simplifying technical concepts for non-technical audiences About You: As our Senior SiteReliabilityEngineer, you are likely to have: Essential Skills & Experience Comfortable with various flavors of (U L)inux and ready to discuss implementations of reg(ex ular More ❯
SiteReliabilityEngineer - Microsoft Admin (Windows Server, IIS, MS SQL Server) Team Summary The ReliabilityEngineer (SRE) is a member of a cross-functional Operations & Infrastructure team responsible for running our Visa Spend Clarity for Enterprises production infrastructure and ensuring the highest levels of availability, performance, and operational excellence. What a SiteReliabilityEngineer does at Visa: The SRE is responsible for finding the right way to run robust applications in our environments. In this role, you will balance engineering improvements, systems operations, and contributions to strategic initiatives. You will work closely with all members of the Technology Group to improve the reliability, availability, performance, monitoring, and operations of Visa More ❯
Lead SiteReliabilityEngineer (Lead SRE) Ready to keep things running smoothly? Join our tombola team! At tombola, we pride ourselves on building our own exceptional games and platforms in-house. That means keeping everything running flawlessly is paramount! We're seeking a Lead SiteReliabilityEngineer (SRE) to join us and help ensure … our critical systems and services are always reliable, available, and performing at their best. What will yo u be doing? As an SRE, you'll be instrumental in implementing automation, monitoring, and incident response strategies to minimize downtime and optimize our operations. You'll collaborate closely with our development, infrastructure, and security teams, balancing exciting new feature delivery with rock … with our broader business objectives. Collaborating with other teams and departments to achieve shared success. Partnering with our People Partner for tech to build robust team management practices. System Reliability and Availability Ensure system uptime: Monitor and maintain the availability and reliability of critical systems and services, meeting all uptime SLAs (Service Level Agreements). Incident management: Quickly More ❯
Bristol, Avon, England, United Kingdom Hybrid / WFH Options
Robert Walters
design, development, and operation of cloud infrastructure and applications on Google Cloud Platform. You will work collaboratively with engineering and infrastructure teams to implement sitereliability engineering (SRE) principles, focusing on system reliability, observability, automation, and operational excellence. This role follows a hybrid working model, requiring attendance at the Bristol office for at least two days per … week or 40% of the working time. Key Responsibilities Promote and embed SRE best practices within engineering teams and microservices environments Partner with infrastructure and DevOps engineers to improve system resilience and performance Troubleshoot complex incidents and implement long-term solutions through code and automation Develop and improve automation pipelines to reduce manual operations and enhance system efficiency Contribute to … multiple strategic digital initiatives and collaborate across engineering domains Essential Skills and Experience Background in software engineering or telemetry, with current focus on SRE Extensive experience with public cloud platforms, particularly Google Cloud (or AWS/Azure) Proven ability to manage Kubernetes clusters in production environments Competence in scripting and development using languages such as Python, Java, Go, Bash, or More ❯
application performance - identifying, and implementing, improvements to application performance and stability. Collaborate with the design and implementation of the desired pipelines and process for deployment to production environment. The SRE will work closely with Platform and Software domains to ensure continuous improvement of performance and stability whilst adhering to standards. Undertake ad-hoc projects and other activities as required. Key … Accountabilities and Activities Contribute to the SRE function including: Drive evolution of the DevOps/GitOps toolchain, promoting improvements to streamline the software delivery process and showing improvements through metrics. Accountable for halting or stopping a project/product if the solution is not technically acceptable. Responsible for producing and maintaining documentation relating to application design, integration processes, testing procedures … to create operational run and playbooks. Integration with Domains including: Collaborating with Domains to plan, design, test and maintain the application. Design patterns for any component or structure under SRE responsibility. Implementation of components such as Monitoring and Logging. Manage the runbook preparations of Domains. Liaise and support other teams on work items including: Developing, refining, and tuning integrations between More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Senior SiteReliability EngineerLondon - Hybrid£80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliabilityEngineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression!This company … performance. With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries.In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems.The ideal candidate will … strategies and conduct chaos engineering experiments*Monitor and maintain Kafka clusters for performance and reliability*Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and observability tools like Datadog or Grafana*Able to troubleshoot issues across infrastructure and More ❯