Site Reliability Engineer Job Vacancies

76 to 100 of 144 Site Reliability Engineer Jobs

Operations Site Reliability Engineer

London, United Kingdom
eBay Inc
team of passionate thinkers, innovators, and dreamers - and help us connect people and build communities to create economic opportunity for all. About the team and the role: As a Site Reliability Engineer at eBay, you'll play a key role in managing major incidents and the overall health of our services, making sure they are both resilient … and high-performing. You'll create strategies for availability and reliability, enhance domain ecosystem observability, and support a shift toward a more engineering-focused culture. Your contributions will ensure that eBay's technology remains cutting-edge and reliable for our global community. What you will accomplish: Proactive Monitoring : Continuously monitor the health of eBay's critical services to identify … and address potential issues before they escalate. Solution Development : Collaborate with Architecture, Engineering, and Operations teams to develop solutions that ensure high site availability, reliability and performance. Collaborative Problem Solving : Work closely with partner teams to resolve recurring technical issues, onboard new alerts, and develop high-quality Standard Operating Procedures (SOPs). Automation and Process Enhancement : Identify and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineering Manager II, Site Reliability Engineering

Dublin, Ireland
Google Inc
distributed systems. Ability to debug, optimize code, and to automate routine tasks. Systematic problem-solving approach, coupled with effective communication skills. About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services-both our internally critical and our externally-visible … systems-have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex … challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take More ❯
Employment Type: Permanent
Salary: EUR Annual
Posted:

Senior DevOps Engineer/SRE - Systems Integrator

London, United Kingdom
Hybrid / WFH Options
Hamilton Barnes Associates Limited
Are you an experienced Senior DevOps/Site Reliability Engineer looking for your next contract role? Join one of the world's leading IT services, consulting, and business solutions organization. Founded in 1968, the company consistently ranks among the top global IT service providers. With a presence in over 50 countries, the company has built a reputation … across industries including banking, healthcare, telecommunications, and retail. The leading consultancy firm has partnered with a global technology leader and they are currently seeking an experienced Senior DevOps/Site Reliability Engineer to join the team. Additionally, this role provides a hybrid working arrangement based in London. Ready to make a move? Get in touch and apply More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer, Compute Germany, Netherlands, United Kingdom

London, United Kingdom
Hybrid / WFH Options
vercel.com
looking for experienced SREs help grow our small team into a global footprint that can provide expert engagement across our core serving systems. As an early member of the SRE team you will report directly to the Director of Managed Infrastructure and play a foundational role in expanding our SRE practice, integrating reliability principles more deeply into Vercel's … Devise repeatable, low-toil operational practices through the development of automated systems for software delivery, system failover, and capacity management. About You: At least 3 years experience in an SRE role, or at least 5 years experience in an adjacent role (e.g. platform engineering), operating in a scaled environment. Firm grasp of the SRE philosophy and mindset, with practical experience … working on or directly with SRE teams that have proactively engaged in system design and improvement. Strong sense of accountability and commitment to problem solving, backed by a curiosity to dig deep and identify root causes. Willingness to proactively engage with development teams to influence the course of software design and operational practices. Capability to manage risk, make decisions, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer (Python)

Watford, Hertfordshire, United Kingdom
La Fosse Associates
Site Reliability Engineer £70,000 pa Hertfordshire My client, a leading entertainment group, is looking for a mid-level SRE to join their platform team in their Hertfordshire office. In this role, you'll take ownership of the end-to-end monitoring and alerting stack, designing and maintaining infrastructure and alert configurations (e.g., with Prometheus/Grafana More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer (Python)

Watford, Hertfordshire, South East, United Kingdom
La Fosse
Site Reliability Engineer (Python) £70,000 pa Hertfordshire My client, a leading entertainment group, are looking for a mid level SRE to join their platform team in their Hertfordshire office. In the role you'll take ownership of the end-to-end monitoring and alerting stack, designing and maintaining infrastructure and alert configurations (e.g., with Prometheus/ More ❯
Employment Type: Permanent
Salary: £60,000
Posted:

Site Reliability Engineer with Security Clearance

Honolulu, Hawaii, United States
Hybrid / WFH Options
OMW Consulting
Role - Site Reliability Engineer Location - Honolulu - Hybrid - 1-2 days a week on site Security … clearance - Minimum Secret - need this ahead of applying Salary - $150k-$200k + Equity I am partnered with a leading defense tech scale up who are looking to add an SRE to their team based in Hawaii. This role is hybrid with an expectation of 1-2 days on site in Honolulu, however there is some weeks where you will … not need to go on site at all. Due to the nature of the client you must hold an active secret clearance as a minimum ahead of applying for this position. To be considered for this position you must have experience with the following: Experience with Security Clearance and DoD IT Environment: You hold an active security clearance, are More ❯
Employment Type: Permanent
Salary: USD 200,000 Annual
Posted:

DevOps / SRE Engineer

England, United Kingdom
Devopshunt
level. Being a part of this team will accelerate your career. Take a closer look at the role: Job Description: We have an opportunity for a talented DevOps/SRE Engineer to join the TWG Cadillac Formula 1 Team as part of the Event IT Team. In your role as a DevOps/SRE Engineer, you will be … at the forefront of developing our technological advantage by maintaining the reliability, scalability, and performance of our cloud and on-premises infrastructure. You will collaborate with software engineers, data scientists, and race strategists to streamline application deployments, monitor system performance, and troubleshoot advanced operational issues. Your work will directly impact the team's race performance by ensuring smooth data … Pipelines: Build and maintain CI/CD pipelines for rapid deployment and software updates. Monitoring & Alerting: Utilize advanced monitoring tools for proactive system health checks and automated incident alerts. Site Reliability: Improve system reliability through incident management, root cause analysis, and capacity planning. Security & Compliance: Follow security best practices, including access control, vulnerability management, and adherence to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
ZILO
sector, our technology is truly flexible and designed to transform any business at scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious … If you're ready to shape the future, let's talk. About the Role We're looking for a Senior Site Reliability Engineer to join our SRE team. This is a hybrid role that blends deep platform engineering with application-level troubleshooting . You'll be responsible for the stability, performance, and resilience of our cloud-native … service code Resolve incidents and support root causes (Java and GoLang services) Contribute to postmortems and reliability engineering initiatives Who You Are Essential Experience 5+ years in an SRE, DevOps, or infrastructure role Deep hands-on experience with AWS , EKS/Kubernetes , and Terraform Working knowledge of Kafka tuning, monitoring, and operational troubleshooting Strong familiarity to be able to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Global Processing Services
Engineer to act as a North Star for this evolving discipline. As our first engineer in this role, you'll have the unique opportunity to shape our SRE strategy, establish best practices, and set the standard for service reliability and performance. What You'll Do Define strategies for Application Performance Monitoring, Unit Cost, and Chaos Engineering. Continuously … so product teams can innovate effectively. Playing a key role in shaping the core technology layers that drive our platform's success. What You Need Proven experience implementing SRE principles at scale, including deep knowledge of SLI/SLO/SLA differences. A product engineering background with strong coding skills in Python, C#, or similar. Experience with incident management … PCI compliance). Background in capacity planning, performance, and load testing. Sysadmin skills for troubleshooting disk, network, and infrastructure issues. Why Join Thredd? The chance to define and lead SRE best practices from the ground up. A high-impact role in a rapidly growing company. A collaborative, innovation-driven culture where your expertise will shape our platform's future. If More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Orgvue Limited
and future states of the organisation and make faster, more informed decisions. The company is headquartered in London, with offices in Philadelphia, The Hague, Toronto, and Sydney. Role: Principal Site Reliability Engineer You will be a senior technical leader focused on scaling and hardening our AWS- and Kubernetes-based infrastructure. You will collaborate across product, platform, and … expertise, excellent communication skills, and a collaborative spirit. Responsibilities: Define and enforce SLOs, SLIs, and error budgets across critical services Develop and implement cloud infrastructure and tooling strategies Enhance SRE practices across the organization Implement robust observability metrics, logs, and traces using our observability tools Guide the team in building automated, self-healing systems Own and evolve incident response processes … security, DevOps, and software teams to ensure compliance and operational excellence Evaluate and adopt tools and practices to improve platform performance and reliability Desired Skills & Experience: Experience leading SRE transformations Hands-on expertise with Kubernetes (EKS preferred) in production Strong experience with AWS core services (EC2, EKS, RDS, S3, ALB/NLB, IAM, CloudWatch, etc.) Proficiency in Infrastructure as More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer with Security Clearance

San Diego, California, United States
Elite Government Strategy
Site Reliability Engineer Key Responsibilities: This position will primarily focus on providing design and implementation expertise on infrastructure provisioning, management and lifecycle implementation of cloud components and services, containers and other critical concepts of DevSecOps principles. Increase platform reliability through automation, health checks, and resilient rollout patterns. Build and deploy health checks, auto-scaling, and self … healing components. Implement advanced deployment strategies (blue/green, canary). Automate rollbacks and recovery paths in CI/CD pipelines. Integrate reliability testing into dev workflows. Required Skills: Kubernetes (probes, readiness strategies). CI/CD pipelines (GitHub Actions, ArgoCD). Automation via Helm, Terraform. Experience with rollout strategies and traffic shifting. Clearance: Secret More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineer with Security Clearance

Tampa, Florida, United States
OMW Consulting
Site Reliability Engineer Salary $140k-$200k + Equity Secret Clearance or higher is required My client, a VC-backed organization in the defense tech space, is looking to hire multiple SREs as they build out their DevOps team across the USA. My client has created a modern product which is streamlining processes and saving time in critical … rest of the skills and experience needed for this position are listed below: Secret Clearance or higher Experience working within the DOD cloud environment 4 Years+ Experience as a SRE Experience in creating CI/CD Pipelines Strong knowledge of Kubernetes Experience with either Ironbank, Cloud One, Platform one Risk management Framework security experience Experience working with AWS If you More ❯
Employment Type: Permanent
Salary: USD 200,000 Annual
Posted:

Site Reliability Engineer with Security Clearance

Saint Louis, Missouri, United States
OMW Consulting
Site Reliability Engineer Salary $140k-$200k + Equity Secret Clearance or higher is required My client, a VC-backed organization in the defense tech space, is looking to hire multiple SREs as they build out their DevOps team across the USA. My client has created a modern product which is streamlining processes and saving time in critical … rest of the skills and experience needed for this position are listed below: Secret Clearance or higher Experience working within the DOD cloud environment 4 Years+ Experience as a SRE Experience in creating CI/CD Pipelines Strong knowledge of Kubernetes Experience with either Ironbank, Cloud One, Platform one Risk management Framework security experience Experience working with AWS If you More ❯
Employment Type: Permanent
Salary: USD 200,000 Annual
Posted:

Site Reliability Engineer with Security Clearance

San Diego, California, United States
OMW Consulting
Site Reliability Engineer Salary $140k-$200k + Equity Secret Clearance or higher is required My client, a VC-backed organization in the defense tech space, is looking to hire multiple SREs as they build out their DevOps team across the USA. My client has created a modern product which is streamlining processes and saving time in critical … rest of the skills and experience needed for this position are listed below: Secret Clearance or higher Experience working within the DOD cloud environment 4 Years+ Experience as a SRE Experience in creating CI/CD Pipelines Strong knowledge of Kubernetes Experience with either Ironbank, Cloud One, Platform one Risk management Framework security experience Experience working with AWS If you More ❯
Employment Type: Permanent
Salary: USD 200,000 Annual
Posted:

Site Reliability Engineer

London, United Kingdom
LinuxRecruit
Has anyone actually ever given you a good description of what SRE is? Recently I've met dozens of companies implementing an SRE function. Half are just rebranding an ops team (because Ops ain't cool), some don't want to call the additional silo they have created 'DevOps' (because apparently that's the wrong thing to do) so they … re calling it SRE and the rest actually don't really know how to describe what they're doing. And if you can't describe it simply, you don't know what it is, chief (because Google do it, isn't the right answer). That was until today, when I met a company who actually white boarded their vision … process rather than the build. We discussed Kubernetes, Prometheus and API Gateways. Most importantly, they spoke like they knew what the hell they were on about. Not just about SRE, but on the whole Engineering process. This is a company with at the top of their game, who are about to introduce a brand new monitisation model to a web More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Zefr
globe. What you'll do: As a Site Reliability Engineer at Zefr, you'll apply your expertise in cloud infrastructure, CI/CD, Observability, and core SRE concepts, to deliver high-quality, reliable, and scalable solutions. A significant aspect of this role involves working closely with Zefr's Engineering and Data Science teams ensuring the infrastructure required More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Azure Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Nordcloud group
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure Site Reliability Engineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Azure Site Reliability Engineer

Wokingham, Berkshire, United Kingdom
Hybrid / WFH Options
Nordcloud
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure Site Reliability Engineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies … solving We encourage you to apply , even if you don't meet all of the requirements. We value your growth potential and enthusiasm! This role is required to on site in Wokingham twice a week, please do not apply if this is not possible for you. What we offer: Individual training budget and exam fees for certifications Flexible working More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - London

London, United Kingdom
Hybrid / WFH Options
Valarian Technologies Limited
software, platforms, and infrastructure. The Role Join us as a Site Reliability Engineer and help us build the future of data sovereignty! We're seeking an SRE passionate about creating high-performance, scalable, and reliable services for our production infrastructure. You'll have a direct impact, improving existing systems and developing innovative solutions to complex challenges. Our … implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting. This will involve designing and implementing robust metrics and logging systems. Engineer the ACRA platform for high availability and fault tolerance. This includes ensuring resilience against Cloud Availability Zone outages and the ability to gracefully handle node failures. Guarantee 99.9% uptime … capacity planning, and optimization of resource utilization. Collaborate closely with the product engineering team to influence the design and implementation of new products and features, ensuring they meet our reliability and scalability standards from the outset. Preferred Qualifications Bachelor's degree (or equivalent) in Computer Science or a related field; relevant practical experience will also be considered Proficiency with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - Automotive

England, United Kingdom
Hamilton Barnes Associates Limited
Hamilton Barnes is currently representing a major vehicle manufacturer that is actively seeking a Site Reliability Engineer for an initial 6-month contract with the possibility of extension. This position has on site commitments 2/3 Days Per Week in Gaydon. If you are interested in learning more we encourage you to apply today! Responsibilities More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineering Manager

United Kingdom
General Motors
Site Reliability Engineering Manager page is loaded Site Reliability Engineering Manager Apply remote type Remote locations Remote - United Kingdom time type Full time posted on Posted Yesterday job requisition id JR- Job Description As an SRE Engineering Manager, you will be expected to not only lead your team in setting priorities and ensuring alignment with organizational goals but also to be deeply technical. We expect our … details, solve problems hands-on, and support your team's technical decisions is crucial. You'll be a mentor, guide, and a partner, helping engineers grow, and ensuring the reliability and efficiency of the systems they are working on. We believe in setting a high bar for engineering managers who can lead by example in both technical expertise and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Disney Cruise Line - The Walt Disney Company
TechOps, Quality & Systems Engineering (TQSE) team within Technology & Digital for Disney Experiences, working closely with World Wide business, Global Information Security (GIS), and application teams across the company. The Site Reliability Engineer will report to the Manager, Technology (TQSE). About the Role & Team: At Disney, storytelling is at the heart of everything we do-and in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
TransFICC
flexible remoteworking locations within UK/Europe) Employment type: Permanent Working Hours: Full time (9-6 UK) Salary: Up to £110K + Shares + Benefits TransFICC is hiring a Site Reliability Engineer to provide high-performance services to our customers. We develop an integration service … product that enables our clients to have a flexible, hosted service without requiring their internal resources to respond to connectivity challenges across trading venues. You will be joining our SRE team and contributing to TransFICC's automation culture. We are a multi-disciplinary team covering everything from desktop and laptop support to data centre provisioning of servers and vendor network … automated, so having experience with a software automation tool like Ansible and coding ability is a must. We are looking for someone experienced as a sys admin or network engineer; however, you must have a reasonable understanding of both. Constructive, open-minded and self-motivated. A belief in life learning, and an awareness of how much there still is More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - Ai Platform

Berlin, Germany
N26 GmbH
About the opportunity We are seeking a Site Reliability Engineer to join the Platform Engineering domain in the AI Platform team. The mission of Platform Engineering is to provide trusted, performant, self-service platforms that empower product teams to build 'the bank the world loves to use.' The AI Platform team contributes to this mission by creating More ❯
Employment Type: Permanent
Salary: EUR Annual
Posted:
Site Reliability Engineer
10th Percentile
£52,500
25th Percentile
£63,630
Median
£70,000
75th Percentile
£85,000
90th Percentile
£99,500