Remote Site Reliability Engineer Job Vacancies

26 to 50 of 54 Remote Site Reliability Engineer Jobs

Lead Site Reliability Engineer

Columbia, Missouri, United States
Hybrid / WFH Options
Centene
organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: We are seeking a highly skilled and experienced M365 Lead Site Reliability Engineer to join our team. The ideal candidate will be responsible for developing and creating monitoring and observability dashboards within Splunk, Dynatrace, and other monitoring and … alerting platforms. This role requires advanced proficiency in PowerShell scripting and Graph APIs, as well as intermediate proficiency in Power Apps/Automate. This role will ensure the reliability, performance, and scalability of our Microsoft 365 environment. Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents … visibility. Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization Leads post incident reviews and documents findings for future informed decision making Drives implementation of approved proposals to optimize Software More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Florissant, Missouri, United States
Hybrid / WFH Options
Centene
organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: We are seeking a highly skilled and experienced M365 Lead Site Reliability Engineer to join our team. The ideal candidate will be responsible for developing and creating monitoring and observability dashboards within Splunk, Dynatrace, and other monitoring and … alerting platforms. This role requires advanced proficiency in PowerShell scripting and Graph APIs, as well as intermediate proficiency in Power Apps/Automate. This role will ensure the reliability, performance, and scalability of our Microsoft 365 environment. Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents … visibility. Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization Leads post incident reviews and documents findings for future informed decision making Drives implementation of approved proposals to optimize Software More ❯
Employment Type: Permanent
Salary: USD 90 Hourly
Posted:

Lead Site Reliability Engineer

Jefferson City, Missouri, United States
Hybrid / WFH Options
Centene
organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: We are seeking a highly skilled and experienced M365 Lead Site Reliability Engineer to join our team. The ideal candidate will be responsible for developing and creating monitoring and observability dashboards within Splunk, Dynatrace, and other monitoring and … alerting platforms. This role requires advanced proficiency in PowerShell scripting and Graph APIs, as well as intermediate proficiency in Power Apps/Automate. This role will ensure the reliability, performance, and scalability of our Microsoft 365 environment. Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents … visibility. Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization Leads post incident reviews and documents findings for future informed decision making Drives implementation of approved proposals to optimize Software More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Saint Louis, Missouri, United States
Hybrid / WFH Options
Centene
organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: We are seeking a highly skilled and experienced M365 Lead Site Reliability Engineer to join our team. The ideal candidate will be responsible for developing and creating monitoring and observability dashboards within Splunk, Dynatrace, and other monitoring and … alerting platforms. This role requires advanced proficiency in PowerShell scripting and Graph APIs, as well as intermediate proficiency in Power Apps/Automate. This role will ensure the reliability, performance, and scalability of our Microsoft 365 environment. Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents … visibility. Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization Leads post incident reviews and documents findings for future informed decision making Drives implementation of approved proposals to optimize Software More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

St. Louis, Missouri, United States
Hybrid / WFH Options
Centene
organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: We are seeking a highly skilled and experienced M365 Lead Site Reliability Engineer to join our team. The ideal candidate will be responsible for developing and creating monitoring and observability dashboards within Splunk, Dynatrace, and other monitoring and … alerting platforms. This role requires advanced proficiency in PowerShell scripting and Graph APIs, as well as intermediate proficiency in Power Apps/Automate. This role will ensure the reliability, performance, and scalability of our Microsoft 365 environment. Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents … visibility. Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization Leads post incident reviews and documents findings for future informed decision making Drives implementation of approved proposals to optimize Software More ❯
Employment Type: Permanent
Salary: USD 90 Hourly
Posted:

Lead Site Reliability Engineer

Kansas City, Missouri, United States
Hybrid / WFH Options
Centene
organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: We are seeking a highly skilled and experienced M365 Lead Site Reliability Engineer to join our team. The ideal candidate will be responsible for developing and creating monitoring and observability dashboards within Splunk, Dynatrace, and other monitoring and … alerting platforms. This role requires advanced proficiency in PowerShell scripting and Graph APIs, as well as intermediate proficiency in Power Apps/Automate. This role will ensure the reliability, performance, and scalability of our Microsoft 365 environment. Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents … visibility. Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization Leads post incident reviews and documents findings for future informed decision making Drives implementation of approved proposals to optimize Software More ❯
Employment Type: Permanent
Salary: USD 90 Hourly
Posted:

Senior DevOps Engineer/SRE - Systems Integrator

London, United Kingdom
Hybrid / WFH Options
Hamilton Barnes Associates Limited
Are you an experienced Senior DevOps/Site Reliability Engineer looking for your next contract role? Join one of the world's leading IT services, consulting, and business solutions organization. Founded in 1968, the company consistently ranks among the top global IT service providers. With a presence in over 50 countries, the company has built a reputation … across industries including banking, healthcare, telecommunications, and retail. The leading consultancy firm has partnered with a global technology leader and they are currently seeking an experienced Senior DevOps/Site Reliability Engineer to join the team. Additionally, this role provides a hybrid working arrangement based in London. Ready to make a move? Get in touch and apply More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer, Compute Germany, Netherlands, United Kingdom

London, United Kingdom
Hybrid / WFH Options
vercel.com
looking for experienced SREs help grow our small team into a global footprint that can provide expert engagement across our core serving systems. As an early member of the SRE team you will report directly to the Director of Managed Infrastructure and play a foundational role in expanding our SRE practice, integrating reliability principles more deeply into Vercel's … Devise repeatable, low-toil operational practices through the development of automated systems for software delivery, system failover, and capacity management. About You: At least 3 years experience in an SRE role, or at least 5 years experience in an adjacent role (e.g. platform engineering), operating in a scaled environment. Firm grasp of the SRE philosophy and mindset, with practical experience … working on or directly with SRE teams that have proactively engaged in system design and improvement. Strong sense of accountability and commitment to problem solving, backed by a curiosity to dig deep and identify root causes. Willingness to proactively engage with development teams to influence the course of software design and operational practices. Capability to manage risk, make decisions, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer with Security Clearance

Honolulu, Hawaii, United States
Hybrid / WFH Options
OMW Consulting
Role - Site Reliability Engineer Location - Honolulu - Hybrid - 1-2 days a week on site Security … clearance - Minimum Secret - need this ahead of applying Salary - $150k-$200k + Equity I am partnered with a leading defense tech scale up who are looking to add an SRE to their team based in Hawaii. This role is hybrid with an expectation of 1-2 days on site in Honolulu, however there is some weeks where you will … not need to go on site at all. Due to the nature of the client you must hold an active secret clearance as a minimum ahead of applying for this position. To be considered for this position you must have experience with the following: Experience with Security Clearance and DoD IT Environment: You hold an active security clearance, are More ❯
Employment Type: Permanent
Salary: USD 200,000 Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
ZILO
sector, our technology is truly flexible and designed to transform any business at scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious … If you're ready to shape the future, let's talk. About the Role We're looking for a Senior Site Reliability Engineer to join our SRE team. This is a hybrid role that blends deep platform engineering with application-level troubleshooting . You'll be responsible for the stability, performance, and resilience of our cloud-native … service code Resolve incidents and support root causes (Java and GoLang services) Contribute to postmortems and reliability engineering initiatives Who You Are Essential Experience 5+ years in an SRE, DevOps, or infrastructure role Deep hands-on experience with AWS , EKS/Kubernetes , and Terraform Working knowledge of Kafka tuning, monitoring, and operational troubleshooting Strong familiarity to be able to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Orgvue Limited
and future states of the organisation and make faster, more informed decisions. The company is headquartered in London, with offices in Philadelphia, The Hague, Toronto, and Sydney. Role: Principal Site Reliability Engineer You will be a senior technical leader focused on scaling and hardening our AWS- and Kubernetes-based infrastructure. You will collaborate across product, platform, and … expertise, excellent communication skills, and a collaborative spirit. Responsibilities: Define and enforce SLOs, SLIs, and error budgets across critical services Develop and implement cloud infrastructure and tooling strategies Enhance SRE practices across the organization Implement robust observability metrics, logs, and traces using our observability tools Guide the team in building automated, self-healing systems Own and evolve incident response processes … security, DevOps, and software teams to ensure compliance and operational excellence Evaluate and adopt tools and practices to improve platform performance and reliability Desired Skills & Experience: Experience leading SRE transformations Hands-on expertise with Kubernetes (EKS preferred) in production Strong experience with AWS core services (EC2, EKS, RDS, S3, ALB/NLB, IAM, CloudWatch, etc.) Proficiency in Infrastructure as More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Zefr
globe. What you'll do: As a Site Reliability Engineer at Zefr, you'll apply your expertise in cloud infrastructure, CI/CD, Observability, and core SRE concepts, to deliver high-quality, reliable, and scalable solutions. A significant aspect of this role involves working closely with Zefr's Engineering and Data Science teams ensuring the infrastructure required More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Azure Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Nordcloud group
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure Site Reliability Engineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Azure Site Reliability Engineer

Wokingham, Berkshire, United Kingdom
Hybrid / WFH Options
Nordcloud
European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success. Currently, we are looking for a Senior Azure Site Reliability Engineer to join our team in the UK. Your daily responsibilities: Architect, implement, and improve existing monitoring and alerting systems Proactively investigate and identify performance anomalies … solving We encourage you to apply , even if you don't meet all of the requirements. We value your growth potential and enthusiasm! This role is required to on site in Wokingham twice a week, please do not apply if this is not possible for you. What we offer: Individual training budget and exam fees for certifications Flexible working More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer - London

London, United Kingdom
Hybrid / WFH Options
Valarian Technologies Limited
software, platforms, and infrastructure. The Role Join us as a Site Reliability Engineer and help us build the future of data sovereignty! We're seeking an SRE passionate about creating high-performance, scalable, and reliable services for our production infrastructure. You'll have a direct impact, improving existing systems and developing innovative solutions to complex challenges. Our … implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting. This will involve designing and implementing robust metrics and logging systems. Engineer the ACRA platform for high availability and fault tolerance. This includes ensuring resilience against Cloud Availability Zone outages and the ability to gracefully handle node failures. Guarantee 99.9% uptime … capacity planning, and optimization of resource utilization. Collaborate closely with the product engineering team to influence the design and implementation of new products and features, ensuring they meet our reliability and scalability standards from the outset. Preferred Qualifications Bachelor's degree (or equivalent) in Computer Science or a related field; relevant practical experience will also be considered Proficiency with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
TransFICC
flexible remoteworking locations within UK/Europe) Employment type: Permanent Working Hours: Full time (9-6 UK) Salary: Up to £110K + Shares + Benefits TransFICC is hiring a Site Reliability Engineer to provide high-performance services to our customers. We develop an integration service … product that enables our clients to have a flexible, hosted service without requiring their internal resources to respond to connectivity challenges across trading venues. You will be joining our SRE team and contributing to TransFICC's automation culture. We are a multi-disciplinary team covering everything from desktop and laptop support to data centre provisioning of servers and vendor network … automated, so having experience with a software automation tool like Ansible and coding ability is a must. We are looking for someone experienced as a sys admin or network engineer; however, you must have a reasonable understanding of both. Constructive, open-minded and self-motivated. A belief in life learning, and an awareness of how much there still is More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer (SRE) - C13 - London

London, United Kingdom
Hybrid / WFH Options
citi.com
We are seeking an exceptional technology leader to oversee our global s ite reliability engineering ( SRE), DevOps, and Platform Engineering teams. This hands-on engineering leadership role requires someone who can both provide technical vision and build strong stakeholder relationships across the organization. The ideal candidate will bring a combination of deep technical expertise, strategic thinking, and people leadership … Leadership: Serve as a hands-on technical leader who can architect, design, and guide the implementation of highly resilient systems Build a compelling vision and strategic roadmap for our SRE, DevOps, and Platform Engineering functions Establish and evangelize engineering best practices across teams and the wider organization Drive technical innovation while ensuring operational excellence Provide architectural guidance to ensure systems … initiatives, capabilities, and constraints Required Skills & Experience: Extensive experience in engineering leadership roles Strong hands-on technical background in cloud platforms, containerization, and modern DevOps practices Demonstrated experience leading SRE, DevOps, or Platform Engineering teams Deep understanding of system architecture, resilience patterns, and high-availability design Experience developing strategic roadmaps and executing technical vision Proven ability to build and maintain More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Remote Senior Site Reliability Engineer Manager (Remote)

Cambourne, Cambridgeshire, United Kingdom
Hybrid / WFH Options
Remotestar
to gemstone supplies They have a presence in London, Hong Kong, Amsterdam, and as well in Mumbai and now in New York in 2001. About the role : As the SRE Manager, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and services through both direct technical contribution along with team building and … tooling. Drive automation initiatives to streamline operational workflows and improve efficiency. Develop and maintain tools, scripts, and dashboards to monitor system health, performance, and reliability. Build a first class SRE team. Through a combination of leading by example, coaching and mentoring, mould the team would want to have around you. Provide leadership and guidance to the SRE team, fostering a … culture of collaboration, innovation, and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer (SRE) - Front-end/React Specialist

London, United Kingdom
Hybrid / WFH Options
ZILO
impact. We value continuous learning, personal growth, and providing our team with resources to succeed. Ready to shape the future? Let's talk. We're looking for a seasoned SRE with a front-end focus, expert in React applications, to join our SRE team. In this role, you'll ensure the reliability, performance, and operability of our React-based … invalidation, HTTP caching headers) to reduce latency and origin load. Collaborate with UX teams to balance feature richness with performance targets. Collaboration & Knowledge Sharing Serve as the React/SRE subject-matter expert: mentor engineers on best practices for building resilient front-ends. Produce and maintain runbooks, debugging guides, and incident-playbooks specific to client-side failures. Partner closely with … wider backend SRE, DevOps, and product teams to ensure end-to-end reliability. Enhanced leave - 38 days inclusive of 8 UK Public Holidays. Private Health Care including family cover. Life Assurance - 5x salary. Flexible working - work from home and/or in our London Office. Employee Assistance Program. Company Pension (Salary Sacrifice options available). Access to training and development. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer - Networking

United Kingdom
Hybrid / WFH Options
Lambda Inc
top AI computing platform. We equip engineers with the tools to deploy AI that is fast, secure, affordable, and built to scale. Whether they need powerhouse GPU hardware on-site or the flexibility of cloud-based solutions, we've got the horsepower to make it happen. Lambda's AI Cloud has been adopted by the world's leading companies … performance through the use of network engineering and other applicable technologies Help with deploying and maintaining network monitoring and management tools You Have 5+ years of experience being SWE, SRE or Network Reliability Engineering Been part of the implementation of production-scale networking projects Experience being on-call and incident response management Have experience building and maintaining Software Defined More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Azure Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Experian Group
We are seeking a skilled Azure Cloud DevOps Engineer to join our team. The ideal candidate will have a strong background in DevOps practices, cloud solutions, and network engineering in Microsoft Azure. This role involves maintaining and developing a cloud environment that hosts mission critical financial services applications used across Australia and New Zealand. This role is pivotal for … in Computer Science, Information Technology, or a related field. At least one of the below certifications: Microsoft Certified: Azure Administrator Associate Microsoft Certified: Azure Developer Associate Microsoft Certified: DevOps Engineer Expert Microsoft Certified: Azure Network Engineer Associate Cisco Certified Network Associate (CCNA) Additional Information What We Offer Hybrid work model 20 days of annual leave Comprehensive medical and … countries, FORTUNE Best Companies to work and Glassdoor Best Places to Work (globally 4.4 Stars) to name a few. Check out Experian Life on social or our Careers Site to understand why. Experian is proud to be an Equal Opportunity and Affirmative Action employer. Innovation is a critical part of Experian's DNA and practices, and our diverse workforce More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Wokingham, England, United Kingdom
Hybrid / WFH Options
eTeam
We are a Global Recruitment specialist that provides support to the clients across EMEA, APAC, US and Canada. We have an excellent job opportunity for you. Role Title: Principal SRE Location: Wokingham (Reading). Hybrid, 60% remote and 40% onsite Duration: Until 30/01/2026 Rate: £580 per day Inside IR35 through an Umbrella Company C ontractor Must … Hold Active SC Clearance Role Description: Key Responsibilities: Lead and drive platform-first initiatives to improve scalability, reliability, and performance. Design, build, and maintain resilient infrastructure supporting distributed systems. Implement monitoring and alerting systems to ensure high availability and performance. Collaborate with engineering teams to enhance system reliability and mitigate risks. Develop and maintain CI/CD pipelines More ❯
Posted:

Site Reliability Engineer

Leeds, Yorkshire, United Kingdom
Hybrid / WFH Options
William Hill PLC
bets per second, accommodate 20 million users, and process 160 terabytes a day. You can be sure there are many challenges waiting for you. The Leeds-based, highly skilled SRE team are primarily managing the Kubernetes clusters within the organisation for multiple departments, and through a DevOps culture enabling those departments with observability and pipelines for their business applications. Their … job is to guarantee system reliability, performance, and supportability with a strong engineering emphasis on building autonomous solutions that deliver value to end-users early, often, and fast. We are also open to candidates that come from a Software Engineering background - As long as you show the willingness to learn, we are more than happy to invest the time … Storage Platforms, developing any necessary integration Supporting Incidents - Assist Incident Management in Production all the way through impact assessment, service restoration and post-mortems, including being part of the SRE on call rotation Sharing Knowledge - Enabling development teams within the DevOps Culture, promoting best practice, documenting runbooks, presenting talks, working with production engineering teams Who we are looking for: We More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineering- Need Active SC Clearance

Wokingham, Berkshire, England, United Kingdom
Hybrid / WFH Options
eTeam Inc
We are a Global Recruitment specialist that provides support to the clients across EMEA, APAC, US and Canada. We have an excellent job opportunity for you. Role Title: Site Reliability Engineering- Need Active SC Clearance Location: Wokingham (Reading) | Hybrid, 60% remote and 40% onsite Duration: 27/02/2026 Rate:402GBP/Day(Inside IR35) Role Description More ❯
Employment Type: Contractor
Rate: £400 - £402 per day
Posted:

Senior DevOps and SRE Engineer

Cambridge, Cambridgeshire, United Kingdom
Hybrid / WFH Options
Arm Limited
enable innovation across the business. To support that mission, we're growing our Data Engineering Platform team and investing deeply in modern, reliable infrastructure. We're seeking a DevOps engineer with hands-on expertise in containerisation, orchestration, cloud platforms, continuous-delivery pipelines, and cloud at scale. In this role, you'll partner with the team to develop new functionality … cloud deployments (AWS-first) using Terraform and platform tooling Improve security posture across IAM, secrets, and networking Help the team ship faster and safer by mentoring on DevOps and SRE practices We're solving for reliability, compliance, performance, and speed - at once. You'll be key to making it work. Required Skills: Knowledge of one or more programming languages … highly leveraged platform, enabling hundreds of engineers to use critical data systems with confidence. You'll have ownership, impact, and a seat at the table as we define how SRE and platform thinking shape our next-generation data infrastructure. If you're looking to scale not just systems but the capabilities of the engineers around you, this is your team. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Site Reliability Engineer
10th Percentile
£52,500
25th Percentile
£63,630
Median
£70,000
75th Percentile
£85,000
90th Percentile
£99,500