Engineer - SiteReliabilityEngineering page is loaded Engineer - SiteReliabilityEngineering Apply locations USA-St. Louis-795 Office Pkwy time type Full time posted on Posted 11 Days Ago job requisition id R Our Team We are evolving our ReliabilityEngineering team to move beyond support and operations. As a Senior Engineer … in SiteReliability, you will be part of a diverse and inclusive organization that has full ownership of the availability, performance, and scalability of one of the most critical shared services at LSEG. Main responsibilities We are looking for people with a passion to learn, and who bring a continuous improvement mentality to our team! SREs maintain Service … core of our team's purpose. Write automation to scale systems sustainably, prevent service issues, or when they occur, quickly recover service. Partner with development teams to improve system reliability, observability, and release velocity. Participate in on-call rotations, incident response, postmortems, and root cause analysis and resolution. Be a vocal advocate of strong/sound engineering practices More ❯
they are already renowned as having game-changing technology within their industry, with exciting scope for expansion into further industries. This role is looking for a Graduate or experienced SRE professional to work within the SRE team responsible for incident response and issue resolution. Location: Cambridge Salary: £32,000 - £70,000 per annum + excellent benefits including private healthcare (could … be more available for an experienced SRE) Requirements for SiteReliability Engineer - Graduate Considered: Excellent academics including 2.1 or 1st class honours degree from a leading international University in a STEM subject A minimum of AAB at A-Level or international equivalent if applying at Graduate level Any experience working an incident response or technical support environment would … the knowledge this role will not lead to a role in the R&D/Software teams Responsibilities for SiteReliability Engineer - Graduate Considered: Working within the SRE team you will be responsible for the architecture of a mission-critical cloud platform for an industry-leading software company. You will diagnose issues within complex systems, identify root causes More ❯
Senior SiteReliability Engineer page is loaded Senior SiteReliability Engineer Apply remote type Remote locations Remote - United Kingdom time type Full time posted on Posted Yesterday job requisition id JR- Job Description The … rapid adoption of advanced software in vehicles marks a new era for automakers and consumers, bringing both advantages and challenges. As part of SiteReliabilityEngineering (SRE) at General motors, you'll join a dedicated team focused on enhancing the reliability, efficiency, and scalability of our distributed systems. We leverage engineering principles to manage operations … and systems engineering skills to keep our services resilient, robust, and scalable. This role is for a hands-on position as an Individual Contributor (IC). As an SRE IC, you will focus on enhancing the reliability, efficiency, and performance of our services. You'll work closely with other engineers to develop automated solutions, respond to incidents, and More ❯
SiteReliabilityEngineering Manager page is loaded Site … ReliabilityEngineering Manager Apply remote type Remote locations Remote - United Kingdom time type Full time posted on Posted Yesterday job requisition id JR- Job Description As an SREEngineering Manager, you will be expected to not only lead your team in setting priorities and ensuring alignment with organizational goals but also to be deeply technical. We expect … details, solve problems hands-on, and support your team's technical decisions is crucial. You'll be a mentor, guide, and a partner, helping engineers grow, and ensuring the reliability and efficiency of the systems they are working on. We believe in setting a high bar for engineering managers who can lead by example in both technical expertise More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Capital One (Europe) plc
Nottingham, Nottinghamshire Senior Software Development Engineer - SiteReliability About the Role We're looking for a Senior Engineer to join our SiteReliabilityEngineering (SRE) team. This role is ideal for a skilled Java engineer with a passion for understanding how complex systems work, analysing performance, and applying engineering solutions to make them more … efficient, stable, and scalable. You'll lead on planning and implementing key SRE initiatives, optimise and automate how our systems operate, and improve observability through better monitoring and logging. You'll also work closely with your peers to drive consistency and high standards across SRE and the wider engineering community, so a real enthusiasm for influencing others and leading … to reduce operational overheads through observability and service automation. Drive engineering best practice (e.g., Operational Excellence, Security, Quality, Resilience etc.) and set standards across the team and wider SRE community. Innovate within your team and contribute within your technical domain. Deliver key pieces of intent from inception through to design and hands-on delivery, in collaboration with your SREM. More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
Noir
SiteReliability Engineer (SRE) - Market leading company - Milton Keynes (Tech stack: .Net, C#, ASP.Net Core, SQL Server, PowerShell, Azure CLI, Bash, Azure DevOps, Jenkins, GitHub Actions, Docker, Kubernetes) Help shape the tech future of UK market leader! Backed by a major financial institution with soaring profits - my client is modernising platforms, embracing AI, and driving automation at scale. … We're hiring a Lead SiteReliability Engineer (SRE) to drive reliability, observability, and performance across our Azure cloud infrastructure. You'll work in a modern engineering environment where we live by "you build it, you run it", focused on automation, scale, and resilience. Tech stack you'll work with: .NET, C#, ASP.NET Core, SQL Server … PowerShell, Azure CLI, Bash, Azure DevOps, Jenkins, GitHub Actions, Docker, Kubernetes We want to hear from you if: As a SiteReliability Engineer (SRE) you've delivered scalable systems using .NET, C#, and ASP.NET Core , with real-world experience managing production workloads You've automated operations using PowerShell, Azure CLI, and Bash to reduce toil and boost efficiency More ❯
Prestigious opportunity with a Global Investment Giant for a SiteReliabilityEngineering (SRE) Manager to be based in our Manchester HQ, leading a talented team of engineers dedicated to maintaining and enhancing the reliability of our systems.Working closely with cross-functional teams across the globe, including business stakeholders, product managers, and software engineers, you will ensure … role has an opportunity to provide strategic guidance on improvements. At the forefront of providing production support services including, incident logging, incident resolution, problem management, change management practices, and SRE support, we are inviting you to join our success story.As our SiteReliabilityEngineering Manager you will:- Lead, coach, and develop a high-performing SRE team. Foster … for incident response, root cause analysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and uptime goals. Facilitate communication and collaboration across global teams. Drive the development and adoption of automation tools to improve efficiency and reduce manual intervention. Establish and maintain comprehensive More ❯
Prestigious opportunity with a Global Investment Giant for a SiteReliabilityEngineering (SRE) Manager to be based in our Manchester HQ, leading a talented team of engineers dedicated to maintaining and enhancing the reliability of our systems. Working closely with cross-functional teams across the globe, including business stakeholders, product managers, and software engineers, you will … role has an opportunity to provide strategic guidance on improvements. At the forefront of providing production support services including, incident logging, incident resolution, problem management, change management practices, and SRE support, we are inviting you to join our success story. As our SiteReliabilityEngineering Manager you will:- Lead, coach, and develop a high-performing SRE team. … for incident response, root cause analysis, and post-mortem reviews to prevent future incidents. Work closely with business and technology teams to understand their needs and ensure alignment with reliability and uptime goals. Facilitate communication and collaboration across global teams. Drive the development and adoption of automation tools to improve efficiency and reduce manual intervention. Establish and maintain comprehensive More ❯
A leading global financial institution is seeking a Principal SiteReliability Engineer to provide essential support for their Foreign Exchange (FX) desk, focusing on trading and risk applications, including an advanced algorithmic ultra-low latency stack. … This is a unique opportunity to play a pivotal role in ensuring the reliability, performance, and scalability of a real-time trading environment by applying best-in-class SRE principles. PRINCIPAL SITERELIABILITY ENGINEER A leading global financial institution is seeking a Principal SiteReliability Engineer to provide essential support for their Foreign Exchange (FX … This is a unique opportunity to play a pivotal role in ensuring the reliability, performance, and scalability of a real-time trading environment by applying best-in-class SRE principles. You will work directly with senior traders and developers on the trading floor, optimising workflows, troubleshooting complex issues, and driving ongoing improvements across both processes and technology. What you More ❯
SiteReliability Engineer - Data Infrastructure, AD/ADAS London/Product & Technology - AD/ADAS/Employee/hybrid Woven by Toyota is enabling Toyota's once-in-a-century transformation into a mobility company. Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current state of mobility through human … automotive software development. The right candidate will have excellent communication skills, solid coding skills, expertise in building scalable, reliable, highly available and fault-tolerant systems, broad knowledge of software engineering and sitereliabilityengineering in areas such as Large-Scale Data and Compute Infrastructure, Stream Processing, Kubernetes, High-Performance Networking, Observability and Infrastructure Automation. RESPONSIBILITIES Set … maintain, optimize and support large scale, multi-region, multi-cloud compute and storage infrastructure powering our data platform and mission critical services. Work with fellow Data Infrastructure engineers and SiteReliability engineers to ensure our systems are scalable, reliable, fault-tolerant, highly available, highly performant, and observable. Manage incidents, triage product or system issues and debug/track More ❯
are seeking a foundational member for the Cloud Infrastructure team at Writer. This role involves contributing to the development and implementation of our SiteReliabilityEngineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer's critical systems, proactively guaranteeing that our high-ROI products reach customers seamlessly. Your responsibilities … ensure cost efficiency. Ensure the security and compliance of our systems, adhering to industry standards and regulations. Provide mentorship and technical guidance to junior engineers, fostering a culture of reliability and continuous improvement. Stay current with emerging technologies and industry trends to improve our sitereliability practices. Is this you? Proven expertise in SiteReliabilityEngineering with at least 7 years of hands-on experience. Deep understanding of system architecture and infrastructure design for high availability and performance. Bachelor's degree in Computer Science, Engineering, or a related field. Strong proficiency in programming languages such as Python, Java, or Go for automation and monitoring. Experience with cloud platforms like AWS, Azure, or More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
VIQU IT
Lead SiteReliability Engineer Hybrid/Remote – Once a month requirement in Leeds. Up to £80,000 per annum plus car allowance plus bonus. VIQU have partnered with a leading company within the supply chain industry who are seeking a Lead SiteReliability Engineer (AWS) to join and mentor their growing team. This position will lead … the organisations cloud infrastructure. This role is mostly remote, with monthly travel required to Leeds. Responsibilities of the Lead SiteReliability Engineer: Lead a team of four SRE’s, helping to maintain the stability of cloud platforms. Take on hands on technical responsibilities within AWS, utilising a range of cloud technologies (CI/CD, Container Orchestration, IaaS, Scripting … the Lead SiteReliability Engineer: Must have at least a years’ experience in managing technical teams, and over five years of experience in a hands on, technical SRE/Dev Ops Engineer role. Experience with CI/CD tools (Jenkins and Concourse CI ideally). Must hold experience within AWS and hold relevant AWS certifications (SA1, DOP-C02 More ❯
Senior SiteReliability Engineer page is loaded Senior SiteReliability Engineer Apply remote type Remote Job: Remote locations GBR-London-5 Canada Square time type Full time posted on Posted Yesterday job requisition id JREQ190781 Senior SiteReliability Engineer - Reuters The Reuters Professional DevOps team is a global squad with members from over five … Professional DevOps Team is looking for an experienced engineer, who's passionate about automation and scalability to work from our London Office . About the Role: As a Senior SiteReliability Engineer at Reuters , you will: Work with a global team, responsible for the infrastructure powering and other products Architect, diagram, document and implement highly scalable solutions for … forward until the adoption of chosen solutions reaches a 100% Communicate clearly, frequently, and take pleasure in simplifying technical concepts for non-technical audiences About You: As our Senior SiteReliability Engineer, you are likely to have: Essential Skills & Experience Comfortable with various flavors of (U L)inux and ready to discuss implementations of reg(ex ular expressions More ❯
SiteReliability Engineer (SRE) Manager - Apple Services Engineering London, England, United Kingdom Software and Services Description Apple Service Engineering (ASE)'s Compute team is seeking highly motivated individual with strong technical and communication skills to join us in on our quest to build and enhance massive clusters hosting Virtual Machines, Containers and associated infrastructure that can … engage with the upstream community to drive Apple's requirements. Ultimately, you will help build the platform that delivers our applications at scale to our end users.As a Compute SiteReliabilityEngineering manager, you will be leading a team responsible for providing the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow … for new applications and services to flourish. Minimum Qualifications Extensive Leadership in Cloud Computing: In depth experience building and leading high-performing engineering teams, with a deep focus on cloud computing and hands-on experience across public and/or private cloud environments. Large-Scale Infrastructure Management: Proven ability to manage enterprise services in large-scale nix environments and More ❯
to £95,000 + Bonus + Shares Watford (Hybrid) Method Resourcing are proud to be partnering with a fast-growing, international technology business delivering critical services across multiple high-reliability sectors. They're seeking a Head of Delivery Enablement who can … ensure cohesive, end-to-end delivery across architecture, DevOps, quality assurance, and project delivery. Role Overview: Acting as the Technical Product Owner for SiteReliabilityEngineering (SRE), you'll manage the technical backlog to balance future strategic initiatives with feedback from engineering teams. You will guide DevOps engineers through the full delivery lifecycle, lead the development … strategic work, align on tooling, and drive improvements in observability, automation, and testing. Ideal Experience & Skills Demonstrated technical leadership across diverse skillsets, including SiteReliabilityEngineering (SRE), DevOps, and Quality Assurance (QA) Proven track record of aligning and integrating cross-functional technical teams and complex systems Strong stakeholder management skills with the ability to influence decisions and More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Embarcaderomediagroup
we're looking for a SiteReliability & Platform Engineer to help lead the way. You'll sit at the heart of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices like GitOps, Infrastructure … enablement, to help development teams ship faster, safer, and more cost-efficiently. What you'll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through platform tools, reusable Terraform modules … This is a great opportunity for someone passionate about building robust infrastructure and enabling others to move faster and more securely. You might come from a cloud engineering, SRE, or DevOps background - what matters most is your curiosity, systems thinking, and drive to improve operational efficiency. At Sorted, we are committed to fostering an inclusive environment where people from More ❯
Has anyone actually ever given you a good description of what SRE is? Recently I've met dozens of companies implementing an SRE function. Half are just rebranding an ops team (because Ops ain't cool), some don't want to call the additional silo they have created 'DevOps' (because apparently that's the wrong thing to do) so they … re calling it SRE and the rest actually don't really know how to describe what they're doing. And if you can't describe it simply, you don't know what it is, chief (because Google do it, isn't the right answer). That was until today, when I met a company who actually white boarded their vision … process rather than the build. We discussed Kubernetes, Prometheus and API Gateways. Most importantly, they spoke like they knew what the hell they were on about. Not just about SRE, but on the whole Engineering process. This is a company with at the top of their game, who are about to introduce a brand new monitisation model to a More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Senior SiteReliability EngineerLondon - Hybrid£80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliability Engineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression!This company operates … performance. With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries.In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems.The ideal candidate … and conduct chaos engineering experiments*Monitor and maintain Kafka clusters for performance and reliability*Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and observability tools like Datadog or Grafana*Able to troubleshoot issues across infrastructure More ❯
Senior SiteReliability Engineer London - Hybrid £80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliability Engineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression! This company … With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal … and conduct chaos engineering experiments Monitor and maintain Kafka clusters for performance and reliability Respond to and resolve application-level production incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with Kafka and observability tools like Datadog or Grafana Able to troubleshoot issues across infrastructure More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment
Senior SiteReliability Engineer London - Hybrid £80,000 - £90,000 + 38 Days Holiday + Private Healthcare + Life Assurance + Flexible Working + Pension Excellent opportunity for SiteReliability Engineer to join a forward-thinking and high-growth technology company offering a Hybrid work environment, a great benefits, and opportunities for further progression! This company … With a strong culture rooted in integrity, creativity, and technical excellence, they've become a trusted partner across global industries. In this role you'll take ownership of platform reliability, resilience engineering, and incident management across cutting-edge cloud infrastructure. You'll play a key role in ensuring uptime, performance, and continuous improvement of core systems. The ideal … and conduct chaos engineering experiments *Monitor and maintain Kafka clusters for performance and reliability *Respond to and resolve application-level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with Kafka and observability tools like Datadog or Grafana *Able to troubleshoot issues across infrastructure More ❯
Employment Type: Permanent
Salary: £80000 - £90000/annum 38 Days Holiday, Healthcare, Pension
Vacancy for Snr SiteReliability Engineer (SRE) at Preservica Abingdon/Remote, UK About You You have a proven track record in DevOps and software development, with a passion for creating reliable solutions to deploy software at scale and speed. You are eager to challenge the status quo, learn, and adopt new technologies. Excellent communication skills across all … Our team is small but growing, so self-motivation, organization, and the ability to multitask and prioritize are crucial. The Role Serve as a primary visionary for DevOps/SiteReliabilityEngineering across the entire technology organization. Eliminate process bottlenecks to enable frictionless, reliable, and high-velocity feature development through automation of Build, Test, Deploy, and Operate More ❯
We are seeking an exceptional technology leader to oversee our global s ite reliabilityengineering ( SRE), DevOps, and Platform Engineering teams. This hands-on engineering leadership role requires someone who can both provide technical vision and build strong stakeholder relationships across the organization. The ideal candidate will bring a combination of deep technical expertise, strategic thinking … Leadership: Serve as a hands-on technical leader who can architect, design, and guide the implementation of highly resilient systems Build a compelling vision and strategic roadmap for our SRE, DevOps, and Platform Engineering functions Establish and evangelize engineering best practices across teams and the wider organization Drive technical innovation while ensuring operational excellence Provide architectural guidance to … capabilities, and constraints Required Skills & Experience: Extensive experience in engineering leadership roles Strong hands-on technical background in cloud platforms, containerization, and modern DevOps practices Demonstrated experience leading SRE, DevOps, or Platform Engineering teams Deep understanding of system architecture, resilience patterns, and high-availability design Experience developing strategic roadmaps and executing technical vision Proven ability to build and More ❯
Luupli started internal testing since June 2024 and getting ready for a commercial BETA testing from December 2024, with the hope of launching fully summer of 2025 Job Title: SiteReliability Platform Engineer About Luupli: Luupli is a social media app that has equity, diversity, and equality at its heart. We believe that social media can be a … made up of passionate and dedicated individuals who are committed to making Luupli a success. Role Description: We are seeking a talented and experienced SiteReliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure and services, primarily hosted … Terraform, and proficiency in scripting with Python or Bash, we invite you to apply for this exciting opportunity. Role and Responsibilities: 1. Infrastructure Design and Automation: - Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform. - Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and More ❯
schools more joyful places to work, as well as learn. About the role We are looking for an enthusiastic and proactive SiteReliability Engineer to join our SRE team and help us ensure we provide world-class resilience and performance across the platform. The remit and focus of the role is to advise on all aspects of site … and backups Conduct assessments of capacity and plan for scaling to meet current and future business needs. Work closely with the Head of Platform Engineering and Head of SRE to strategize and implement scalable solutions. Work closely with the Platform team, feature teams and, 2nd line support and other stakeholders to ensure a good level of service is provided … for our customers and embed SRE practices. Key player in the response and troubleshooting of incidents, ensuring rapid resolution and minimising downtime. Participate in blameless postmortems to identify root cause and corrective actions Develop and maintain playbooks and documentation About you Experience in performance monitoring and analysis Capacity planning experience Scripting and automation skills, with experience in relevant technologies. Experience More ❯
Fancy being our next SRE Superstar? SiteReliability Engineer (SRE) Sunderland (Hybrid) Full-time Alright, listen up! Here at Tombola, we're not just about bingo - we're about brilliant tech, seamless experiences, and keeping millions of players happy. And to do that, we need a SiteReliability Engineer who's as excited about rock-solid … working hand-in-hand with our dev, infra, and security teams, making sure we balance exciting new features with unbeatable stability. What you'll be getting up to: System Reliability & Availability Hero: You'll be the guardian of our uptime, making sure our critical systems are always available and hitting those all-important SLAs . You'll also be … tech and better ways of doing things, constantly pushing us to improve system reliability, performance, and efficiency. Sound like a bit of you? If you're an experienced SRE with a passion for building reliable, scalable, and efficient systems, and you love working in a fun, collaborative environment, then we want to hear from you! Ready to join the More ❯