London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Senior SiteReliability EngineerLondon - Hybrid£80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliability Engineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression!This company operates … performance. With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries.In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems.The ideal candidate … and conduct chaos engineering experiments*Monitor and maintain Kafka clusters for performance and reliability*Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and observability tools like Datadog or Grafana*Able to troubleshoot issues across infrastructure More ❯
Senior SiteReliability Engineer London - Hybrid £80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliability Engineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression! This company … With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal … and conduct chaos engineering experiments Monitor and maintain Kafka clusters for performance and reliability Respond to and resolve application-level production incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with Kafka and observability tools like Datadog or Grafana Able to troubleshoot issues across infrastructure More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment
Senior SiteReliability Engineer London - Hybrid £80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliability Engineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression! This company … With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal … and conduct chaos engineering experiments *Monitor and maintain Kafka clusters for performance and reliability *Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with Kafka and observability tools like Datadog or Grafana *Able to troubleshoot issues across infrastructure More ❯
Employment Type: Permanent
Salary: £80000 - £90000/annum 38 Days Holiday, Healthcare, Pension
Vacancy for Snr SiteReliability Engineer (SRE) at Preservica Abingdon/Remote, UK About You You have a proven track record in DevOps and software development, with a passion for creating reliable solutions to deploy software at scale and speed. You are eager to challenge the status quo, learn, and adopt new technologies. Excellent communication skills across all … Our team is small but growing, so self-motivation, organization, and the ability to multitask and prioritize are crucial. The Role Serve as a primary visionary for DevOps/SiteReliabilityEngineering across the entire technology organization. Eliminate process bottlenecks to enable frictionless, reliable, and high-velocity feature development through automation of Build, Test, Deploy, and Operate More ❯
We are seeking an exceptional technology leader to oversee our global s ite reliabilityengineering ( SRE), DevOps, and Platform Engineering teams. This hands-on engineering leadership role requires someone who can both provide technical vision and build strong stakeholder relationships across the organization. The ideal candidate will bring a combination of deep technical expertise, strategic thinking … Leadership: Serve as a hands-on technical leader who can architect, design, and guide the implementation of highly resilient systems Build a compelling vision and strategic roadmap for our SRE, DevOps, and Platform Engineering functions Establish and evangelize engineering best practices across teams and the wider organization Drive technical innovation while ensuring operational excellence Provide architectural guidance to … capabilities, and constraints Required Skills & Experience: Extensive experience in engineering leadership roles Strong hands-on technical background in cloud platforms, containerization, and modern DevOps practices Demonstrated experience leading SRE, DevOps, or Platform Engineering teams Deep understanding of system architecture, resilience patterns, and high-availability design Experience developing strategic roadmaps and executing technical vision Proven ability to build and More ❯
Luupli started internal testing since June 2024 and getting ready for a commercial BETA testing from December 2024, with the hope of launching fully summer of 2025 Job Title: SiteReliability Platform Engineer About Luupli: Luupli is a social media app that has equity, diversity, and equality at its heart. We believe that social media can be a … made up of passionate and dedicated individuals who are committed to making Luupli a success. Role Description: We are seeking a talented and experienced SiteReliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure and services, primarily hosted … Terraform, and proficiency in scripting with Python or Bash, we invite you to apply for this exciting opportunity. Role and Responsibilities: 1. Infrastructure Design and Automation: - Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform. - Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and More ❯
schools more joyful places to work, as well as learn. About the role We are looking for an enthusiastic and proactive SiteReliability Engineer to join our SRE team and help us ensure we provide world-class resilience and performance across the platform. The remit and focus of the role is to advise on all aspects of site … and backups Conduct assessments of capacity and plan for scaling to meet current and future business needs. Work closely with the Head of Platform Engineering and Head of SRE to strategize and implement scalable solutions. Work closely with the Platform team, feature teams and, 2nd line support and other stakeholders to ensure a good level of service is provided … for our customers and embed SRE practices. Key player in the response and troubleshooting of incidents, ensuring rapid resolution and minimising downtime. Participate in blameless postmortems to identify root cause and corrective actions Develop and maintain playbooks and documentation About you Experience in performance monitoring and analysis Capacity planning experience Scripting and automation skills, with experience in relevant technologies. Experience More ❯
Fancy being our next SRE Superstar? SiteReliability Engineer (SRE) Sunderland (Hybrid) Full-time Alright, listen up! Here at Tombola, we're not just about bingo - we're about brilliant tech, seamless experiences, and keeping millions of players happy. And to do that, we need a SiteReliability Engineer who's as excited about rock-solid … working hand-in-hand with our dev, infra, and security teams, making sure we balance exciting new features with unbeatable stability. What you'll be getting up to: System Reliability & Availability Hero: You'll be the guardian of our uptime, making sure our critical systems are always available and hitting those all-important SLAs . You'll also be … tech and better ways of doing things, constantly pushing us to improve system reliability, performance, and efficiency. Sound like a bit of you? If you're an experienced SRE with a passion for building reliable, scalable, and efficient systems, and you love working in a fun, collaborative environment, then we want to hear from you! Ready to join the More ❯
SiteReliability Engineer Fully remote work for UK Citizens based in the UK - Salary to £80k + Benefits We are looking for a SiteReliability Engineer/DevOps Engineer with a background in .NET software development and strong C# skills. The candidate should also have knowledge of DevOps tools like Kubernetes and/or Docker, and … and tune databases to ensure high availability and performance. Collaborate with product engineering teams to design and build observable software. Required Skills and Experience: Proven experience in an SRE/DevOps/Platform Engineering role, with a background in software engineering using .NET and C#. Proficiency in C# and scripting languages like Bash, Python, or PowerShell. Experience … innovation and continuous improvement. This role is part of a large program of change and improvement for a market-leading global SaaS company. If you're seeking an interesting SRE role within a forward-thinking organization, this could be a tremendous career opportunity. Please apply with your CV to find out more. More ❯
customer's systems are built and maintained. This role blends operational product support with software engineering to create applications to understand the overall health of our systems. The SRE team sits within a wider programme at the core of the customer mission. The role holder: As an SRE, fundamentally you will be doing work that has historically been done … expertise to substitute automation for human labour, with the objective of limiting traditional manual operations work (incident tickets, on-call etc.) to no more than half of the SRE team's time (and aiming for considerably less). You will have an enthusiasm to learn and experiment, to develop tools to understand application health and improve their reliability … enable them to be scalable and resilient to failure, and how to get the best out of the infrastructure they are deployed to. Participating in the wider DevOps/SRE community within the organisation. Competancies It is desirable for you to have experience in the areas below. However more valued for this role is that you have excitement and enthusiasm More ❯
Senior SiteReliability Engineer - Reuters The Reuters Professional DevOps team is a global squad with members from over five countries. Our work reflects on which is a source of real-time, nonpartisan information on world events, trends and culture. The DevOps team takes a factory approach to infrastructure, by designing and developing repeatable cloud-native patterns and applying … Professional DevOps Team is looking for an experienced engineer, who's passionate about automation and scalability to work from our London Office . About the Role: As a Senior SiteReliability Engineer at Reuters , you will: Work with a global team, responsible for the infrastructure powering and other products Architect, diagram, document and implement highly scalable solutions for … forward until the adoption of chosen solutions reaches a 100% Communicate clearly, frequently, and take pleasure in simplifying technical concepts for non-technical audiences About You: As our Senior SiteReliability Engineer, you are likely to have: Essential Skills & Experience Comfortable with various flavors of (U L)inux and ready to discuss implementations of reg(ex ular expressions More ❯
SiteReliability Engineer with a strong focus on leadership and team management . Around 70% of this role is about building, mentoring and directing a high-performing SRE team, setting strategy and driving operational excellence. The remaining 30% will be hands-on involvement in AWS-based platforms, automation and performance tuning. Key Responsibilities Lead and develop a team … of SRE engineers, setting priorities, providing coaching and creating a culture of reliability and continuous improvement Define and own SRE strategy, standards and ways of working across the organisation Collaborate with engineering, operations and product teams to ensure seamless delivery and robust systems Oversee system reliability, availability and performance across large, business-critical platforms Provide technical guidance … GitLab, Concourse) and ensure AWS platforms meet operational best practice Produce regular reporting and communicate clearly with senior stakeholders Key Requirements Strong experience managing or leading engineering/SRE/DevOps teams in a complex environment Track record of mentoring, coaching and growing technical teams Excellent stakeholder engagement skills with the ability to influence at all levels Broad technical More ❯
West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
VIQU IT Recruitment
SiteReliability Engineer with a strong focus on leadership and team management . Around 70% of this role is about building, mentoring and directing a high-performing SRE team, setting strategy and driving operational excellence. The remaining 30% will be hands-on involvement in AWS-based platforms, automation and performance tuning. Key Responsibilities Lead and develop a team … of SRE engineers, setting priorities, providing coaching and creating a culture of reliability and continuous improvement Define and own SRE strategy, standards and ways of working across the organisation Collaborate with engineering, operations and product teams to ensure seamless delivery and robust systems Oversee system reliability, availability and performance across large, business-critical platforms Provide technical guidance … GitLab, Concourse) and ensure AWS platforms meet operational best practice Produce regular reporting and communicate clearly with senior stakeholders Key Requirements Strong experience managing or leading engineering/SRE/DevOps teams in a complex environment Track record of mentoring, coaching and growing technical teams Excellent stakeholder engagement skills with the ability to influence at all levels Broad technical More ❯
Morley, Leeds, West Yorkshire, England, United Kingdom Hybrid / WFH Options
VIQU IT Recruitment
SiteReliability Engineer with a strong focus on leadership and team management . Around 70% of this role is about building, mentoring and directing a high-performing SRE team, setting strategy and driving operational excellence. The remaining 30% will be hands-on involvement in AWS-based platforms, automation and performance tuning. Key Responsibilities Lead and develop a team … of SRE engineers, setting priorities, providing coaching and creating a culture of reliability and continuous improvement Define and own SRE strategy, standards and ways of working across the organisation Collaborate with engineering, operations and product teams to ensure seamless delivery and robust systems Oversee system reliability, availability and performance across large, business-critical platforms Provide technical guidance … GitLab, Concourse) and ensure AWS platforms meet operational best practice Produce regular reporting and communicate clearly with senior stakeholders Key Requirements Strong experience managing or leading engineering/SRE/DevOps teams in a complex environment Track record of mentoring, coaching and growing technical teams Excellent stakeholder engagement skills with the ability to influence at all levels Broad technical More ❯
Lead SiteReliability Engineer (Lead SRE) Ready to keep things running smoothly? Join our tombola team! At tombola, we pride ourselves on building our own exceptional games and platforms in-house. That means keeping everything running flawlessly is paramount! We're seeking a Lead SiteReliability Engineer (SRE) to join us and help ensure our critical … systems and services are always reliable, available, and performing at their best. What will yo u be doing? As an SRE, you'll be instrumental in implementing automation, monitoring, and incident response strategies to minimize downtime and optimize our operations. You'll collaborate closely with our development, infrastructure, and security teams, balancing exciting new feature delivery with rock-solid system … with our broader business objectives. Collaborating with other teams and departments to achieve shared success. Partnering with our People Partner for tech to build robust team management practices. System Reliability and Availability Ensure system uptime: Monitor and maintain the availability and reliability of critical systems and services, meeting all uptime SLAs (Service Level Agreements). Incident management: Quickly More ❯
live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this global defence organisation as a SiteReliability Engineer (SRE) and help shape the future of one of the UK's most vital national security platforms. You'll be joining a growing SRE team at the heart of the customer … s mission, focused on ensuring performance, availability, and scalability-while driving continuous improvement and innovation. About the Role As an SRE, you'll combine your operational expertise with software engineering skills to minimise manual effort and drive automation across complex systems. This role is perfect for someone who thrives on solving hard problems, automating the mundane, and building intelligent … overtime. Proactively enhance system availability, performance, and resilience. Develop tools and solutions to automate repetitive tasks and reduce operational toil. Collaborate with development teams to embed best practices and SRE principles. Deploy and manage monitoring systems to provide intelligent observability. Engage with the wider DevOps/SRE community within the organisation. Ideal Skills & Experience We're more interested in your More ❯
Founded in 2001, Resident Advisor (RA) is one of the world's longest-running music media brands and a cornerstone of the dance, electronic and DJ ecosystem. The site's audience of over 6 million monthly users is drawn in by a combination of news, editorial, club listings and ticketing, RA-branded events at venues and festivals worldwide, original … films and a weekly mix series that has run for 18 years. We're looking for a Senior SiteReliability Engineer passionate about electronic music to join our Core Platform team. This role is office based (minimum 3 days/week in-office), and offers flexibility to work hybridly. You'll help scale our high-traffic infrastructure that … MSSQL databases, ElasticSearch, Redis, and Kafka running on AWS EKS (Kubernetes), managed via Terraform with CI/CD pipelines and DataDog monitoring. Your responsibilities include improving infrastructure performance and reliability, driving modernization and cost optimization, developing shared components (i.e. auth systems, GraphQL gateways), enhancing developer experience, maintaining E2E testing systems, and creating internal tooling. This is an opportunity to More ❯
A Developer possesses a unique skill set that synergises well with SiteReliabilityEngineering (SRE). With a strong foundation in Golang development, valuable expertise is brought to the table, enabling contributions to innovative solutions for complex monitoring, automation, and capacity management challenges. As a SiteReliability Engineer, you can shape the way this company … Development and Platform teams to optimise system performance for this industry leader! In the development of reliable and scalable systems, you are responsible for creating software by applying sound engineering principles, best practices, and leveraging technologies including your expertise in contemporary monitoring tools and programming. Experience in modern monitoring tools such as Splunk, Nagios, or … Grafana is a significant advantage! However, proficiency in programming languages such as Golang, Python, or JavaScript is essential! If you are a Golang Engineer looking to transition into the SRE world, or vice versa, this is an opportunity you won't want to miss More ❯
Wokingham, Berkshire, England, United Kingdom Hybrid / WFH Options
eTeam Inc
We are a Global Recruitment specialist that provides support to the clients across EMEA, APAC, US and Canada. We have an excellent job opportunity for you. Role Title: SiteReliabilityEngineering- Need Active SC Clearance Location: Wokingham (Reading) | Hybrid, 60% remote and 40% onsite Duration: 27/02/2026 Rate:402GBP/Day(Inside IR35) Role … Implement CI/CD pipelines for seamless deployment and release management. Ensure compliance with security standards, governance policies, and regulatory requirements. Required Skills & Experience Expertise in software development and engineering for large-scale distributed systems. Strong proficiency in programming languages such as Golang, Java, or Python. Extensive experience with cloud infrastructure providers (AWS, Azure, or GCP). Deep knowledge More ❯
has helped build some of the world's largest companies. Our team in London is growing and we're looking for talented people to join us on our journey Engineering at Duffel We're building tools to simplify travel distribution, search and booking. What does this actually mean? It's one common and seamless API. This brings huge technical … experience to go with it. The tools used on the team include Elixir, Phoenix, Kubernetes and Google Cloud Platform. SiteReliabilityEngineering at Duffel As an SRE at Duffel, you'll be part of a small team within engineering that is responsible for the reliability, performance, and resilience of our infrastructure and applications. You will … be working closely with engineering teams to understand their needs and help meet the demands of our product as we scale globally. What we're looking for - An infrastructure and systems engineering generalist who is comfortable diving deep into the weeds on different issues. Some recent examples include: - A configuration issue between Google's Load Balancer and the More ❯
SiteReliability Engineer - Microsoft Admin (Windows Server, IIS, MS SQL Server) Team Summary The Reliability Engineer (SRE) is a member of a cross-functional Operations & Infrastructure team responsible for running our Visa Spend Clarity for Enterprises production infrastructure and ensuring the highest levels of availability, performance, and operational excellence. What a SiteReliability Engineer does … at Visa: The SRE is responsible for finding the right way to run robust applications in our environments. In this role, you will balance engineering improvements, systems operations, and contributions to strategic initiatives. You will work closely with all members of the Technology Group to improve the reliability, availability, performance, monitoring, and operations of Visa Spend Clarity for More ❯
# SiteReliability EngineerRemote - APAC/EngineeringThe Tyk API Management platform is helping to drive the connected world and power new products and services. We're changing the way that organisations connect any number of their systems and services.Whether internal, external, public or highly encrypted systems, Tyk helps businesses drive value across the retail, finance, telecoms, healthcare, or … radical responsibility If this sounds like an environment that you believe could work for you then read on to find out more. The role: We're looking for a SiteReliability Engineer to manage, maintain, improve and provide support on our platform. You will be curious by nature, always looking for ways to improve, as we will look … we expect this role to be advocate of continuous improvement Reliability of our new global Tyk Cloud platform Automation of operations and support Writing and maintaining documentation on SRE processes and policies Recommending and implementing ways of driving operational efficiency and driving down our cost to run, without impacting service Assisting in penetration testing for Cloud through liaising with More ❯
collaborative innovation. Our group drives competitive advantage by enhancing our consumer experiences, enabling business growth, and advancing operational excellence. The Database ReliabilityEngineering (DBRE) team helps elevate SRE practices as it applies to Database Management technology and services at TWDC, promoting and onboarding new technologies, solving complex problems and integrating with next generation digital platforms. Database Reliability Engineers (DBRE) use a software engineering approach to architect, design, automate, monitor, and build applications at scale. This includes operating and engineering software with close business segment alignment to deliver platforms through efficient, effective and resilient architectures. DBREs are talented engineers that are focused on improving quality through a data driven approach: instrumentation, automation, and functional …/unit testing. The Database ReliabilityEngineering (DBRE) team is a group of highly trained professional database engineers who build, deploy and operate database platforms in an SRE/DevOps manner. This team is responsible for operating the following platforms: MySQL, PostgreSQL, Oracle, NOSQL(MONGO, Cassandra) and Snowflake for TWDC. These workloads are running in all major CSPs More ❯
as well as accurate health monitoring through all our clients, both new and old. The person in this role will join the SiteReliabilityEngineering team (SRE). The main role of the SRE team is to facilitate the scalability of Dayshape and allow us to meet the demands of an increasing client base. What you'll … do Lead initiatives to enhance Dayshape's ability to scale our cloud platform Maintain and improve our cloud estate in Azure Improve SRE and other teams' working lives through automation of manual tasks Lead in making the deployment of Dayshape more scalable Increase our knowledge sharing of SRE across the organisation Improve the observability of Dayshape through reporting and tool … Infrastructure as Code". We are looking for someone who has a genuine enthusiasm for IaC and experience of Delivered IaC frameworks that can be contributed to by other engineering teams. An understanding of the patterns and practices required to deliver reliable, high-quality results, such as unit and integration testing, CI/CD etc. Writing and debugging Python More ❯