Site Reliability Engineer Jobs

1 to 25 of 142 Site Reliability Engineer Jobs

Site Reliability Engineer - SRE

Hampshire, South East, United Kingdom
Proactive Appointments Limited
Site Reliability Engineer - SRE One of our biggest customers based in the Financial Services sector is looking for an experienced Site Reliability Engineer - SRE to join them as they look to create a newly appointed team. Site Reliability Engineer: We have … an exciting brand-new opportunity to join a dynamic IT Team as a Site Reliability Engineer. We are looking for an expert in this field who has extensive experience and knowledge in managing APM tools such … as Dynatrace and has demonstrable experience (at least 3 years) as a Site Reliability Engineer. The Site Reliability Engineer (SRE) will take ownership of the observability suite, leveraging deep DevOps skills and experience to proactively enhance the performance and stability of APIs and applications. This more »
Employment Type: Permanent
Salary: £65,000
Posted:

Site Reliability Engineer SRE

Reigate, Surrey, South East
Hybrid / WFH Options
Client Server
Site Reliability Engineer/SRE Reigate/WFH to £85k Global FinTech is seeking a skilled Site Reliability Engineer/SRE to collaborate across product focussed Agile engineering teams to ensure the reliability, availability and performance of client facing services. Responsibilities will include … week for team meet-ups and stakeholder meetings with the other three days work from home. About you: You have experience in a similar SRE/Site Reliability Engineer position You have experience of running 24x7 services in the public cloud - Azure preferred You have experience with … happy to collaborate with senior stakeholders and mentor others What's in it for you: As a Site Reliability Engineer/SRE you will receive a competitive salary plus a range of perks and benefits: Up to £85k salary plus bonus Hybrid working (3 days a week more »
Employment Type: Permanent
Salary: £75,000 - £85,000
Posted:

SRE / Site Reliability Engineer Azure - FinTech

Reigate, Surrey, South East
Hybrid / WFH Options
Client Server
SRE/Site Reliability Engineer (Azure PagerDuty DataDog) Reigate/WFH to £85k Do you have expertise with SRE within an Azure cloud environment? You could be progressing your career in an impactful role at a global FinTech. As an SRE/Site Reliability Engineer … week for team meet-ups and stakeholder meetings with the other three days work from home. About you: You have experience in a similar SRE/Site Reliability Engineer position You have experience of running 24x7 services in the public cloud - Azure You have experience with observability … keen to take ownership of projects and happy to collaborate with senior stakeholders and mentor others What's in it for you: As a SRE/Site Reliability Engineer you will receive a competitive salary plus a range of perks and benefits: Up to £95k salary plus more »
Employment Type: Permanent
Salary: £75,000 - £85,000
Posted:

Site Reliability Engineer SRE - Dev Background

City of London, London, United Kingdom
Hybrid / WFH Options
Tec Partners
Job Title: Site Reliability Engineer (Software Dev Background) Type: Permanent Location: Fully remote Salary: £55-65K Our client are growing their team and are looking for a Site Reliability Engineer - (ideally from a software development/software engineering background) to contribute to the … our cloud infrastructure, help with the definition of best practices for infrastructure management and to support development processes with CI/CD. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, scalability, and security of their systems. Responsibilities: Design, build … functional teams to identify opportunities for automation, streamlining workflows, and improving efficiency. Stay up-to-date with the latest trends and technologies in the SRE/DevOps space, making recommendations for adoption based on industry best practices. Requirements: Proven experience as a Site Reliability Engineer SRE or more »
Employment Type: Permanent
Salary: £55000 - £65000/annum
Posted:

Principal Site Reliability Engineer (SRE)

London (city), London, England
Hybrid / WFH Options
T Rowe Price
invite you to explore the opportunity to join us and grow your career with us. Job Title: Principal Site Reliability Engineer (SRE) Department: CDO Technology Group Summary: We are seeking a highly motivated and experienced Principal Site Reliability Engineer (SRE) to join the CDO … Technology leadership team to stand up and lead the SRE function within CDO Technology. In this role, you will be responsible for ensuring the availability, latency, performance, efficiency, and stability of our critical infrastructure, which supports a range of data platforms, applications, and services. You will collaborate closely with development … infrastructure, and anticipate significant risks. Work with development teams to review architecture design to ensure high availability and proper disaster recovery strategy Collaborate with reliability and infrastructure engineering team in T Rowe Price to build synergy in tooling for the implementation of observability, tracing, and alerting Qualifications: Bachelor's more »
Employment Type: Permanent
Salary: Competitive
Posted:

Site Reliability Engineer

United Kingdom
Hybrid / WFH Options
developrec
Cloud Infrastructure Site Reliability Engineer (SRE) £55,000 - £65,000 Fully remote Due to the nature of the position candidates must be eligible and willing to undergo Security Clearance My client are a household name and global organisation who deliver innovative, digitally enabled solutions to transform, simplify … and support their customers. They are recruiting for a Sitereliability engineer to support their customers using their public cloud infrastructure. Job Description: The Cloud Infrastructure Site Reliability Engineer (SRE) supports the public cloud infrastructure used to deliver public cloud hosted managed services to customers. We will have a high customer focus being actively involved more »
Posted:

Cloud Operations Site Reliability Engineer

England, United Kingdom
Loftware
About the role: Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud Operations Site Reliability Engineer will be … troubleshooting customer environments for mission-critical application use across the range of cloud platforms used by Loftware, including AWS and Azure. The Cloud Operations Site Reliability Engineer is someone that is a team player with the desire and passion for modern technology and keen to take on … large-scale responsibility for the cloud environment. The Cloud Operations Site Reliability Engineer will work with the rest of the Cloud Operations team and alongside QA and Development to continually improve automated infrastructure and application deployment, to build and maintain reliable cloud infrastructure and services and to more »
Posted:

Site Reliability Engineer

Greater London, England, United Kingdom
Humankind Global Recruitment
Site Reliability Engineer London (Hybrid 2 days a week on site) Permanent £75,000 - £85,000 p/a The Background We are partnered with an innovative IT consultancy based in London but with a global presence who are leading advisors in their industry by creating … lasting value for their clients. Due to growth within the business they are looking for a highly skilled Systems Engineer to join their Corporate IT Team and focus on the Applications side of their IT offering. This is an exciting opportunity for someone with a passion for technology to … flexible benefits fund. You… In order to be a successful Site Reliability Engineer you will have… Previous experience working as an SRE/at system administrator level In-depth knowledge of Windows Operating Systems and VMware with a good understanding of Linux Operating Systems In depth knowledge more »
Posted:

Site Reliability Engineer

Edinburgh, Scotland, United Kingdom
McFall Recruitment Limited
Site Reliability Engineer (SRE) Are you ready to shape the future of a cutting-edge platform? We're looking for a Site Reliability Engineer (SRE) to join our squad dedicated to ensuring our foundation is scalable and robust. Role can be based in either … scale, optimize performance, and ensure efficient maintenance. SLO/SLA Concepts: Implement and manage Service Level Objectives and Agreements to guarantee our platform's reliability and performance. Infrastructure Management: Use Terraform to manage infrastructure and deployments, ensuring everything runs smoothly and efficiently. CI/CD Proficiency: Work with a more »
Posted:

Lead Site Reliability Engineer

Leeds, England, United Kingdom
Fruition IT
Lead Site Reliability Engineer Leeds - once a month in the office on average £80,000-£90,000 + benefits A leading global organisation are seeking a Lead Site Reliability Engineer to play a pivotal role in the development, implementation … and ongoing maintenance of its core Infrastructure and Cloud-based platforms. This role encompasses diverse responsibilities, including leading and managing a small DevOps/SRE team. The Lead Site Reliability Engineer will lead the charge in selecting, configuring, and supporting Cloud Platform components and tooling. Proficiency in more »
Posted:

DevOps Engineer/SRE

London Area, United Kingdom
Alexander Ash Consulting
Site Reliability Engineer … Global Quantitative Investment Management Permanent/Contract - London, UK - Competitive We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join a leading quantitative research and technology firm specializing in leveraging innovative data science and cutting-edge technology to deliver unparalleled insights and solutions. … You will be working at the intersection of technology and finance ensuring the reliability, availability, performance, and cost-efficiency of their critical systems and infrastructure. You will work closely with development, operations, and research teams to build and maintain robust, scalable systems using AWS, Terraform, Ansible, and Kubernetes. Key more »
Posted:

Site Reliability Engineer

Edinburgh
Lloyds Banking Group
in digitising our Individual Annuities customer journey onto a Cloud based platform. We are seeking to recruit a Site Reliability Engineer (SRE) within the Retirement platform where your main responsibilities will be to work with our existing SRE team to ensure strong observability across our services utilizing … tools such as Dynatrace and Splunk. You will work closely with the wider team to embed SRE principles of delivering secure, robust, and reliable infrastructure and features to our customers. Helping our service teams to understand root causes of incidents. Striving to remove manual tasks (toil) through automation and the … s where you'll make a difference: Influencing across all disciplines within both the business and engineering side of the business in terms of SRE principles especially in relation to increasing reliability. Whilst skills, knowledge and prior experience are meaningful to us we want people who are highly motivated, and more »
Employment Type: Permanent
Salary: £45,954 - £51,060
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Bayside Solutions
Site Reliability Engineer Contract Salary Range: £91,400 - £108,000 per year Location: London, England - Hybrid Role Job Summary: We seek a Site Reliability Engineer to join our team and play a crucial role in ensuring our applications and services' reliability, availability, and … Willingness to adapt and learn new tools and technologies as needed Availability to participate in on-call rotations as required Desired Skills and Experience Site Reliability, Java, AWS, Azure, Kubernetes, GIT, CD Bayside Solutions, Inc. may collect your personal information during the position application process. Please reference Bayside more »
Posted:

Lead Site Reliability Engineer

Boylston, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Hopkinton, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Westborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Boxborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Maynard, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Sudbury, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Hudson, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Wayland, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Marlborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Southborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Stow, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Harvard, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:
Site Reliability Engineer
10th Percentile
£56,250
25th Percentile
£68,750
Median
£97,500
75th Percentile
£118,750
90th Percentile
£143,750