Site Reliability Engineer Jobs

51 to 75 of 142 Site Reliability Engineer Jobs

Lead Site Reliability Engineer

Upton, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Ashland, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Maynard, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Woodville, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Westborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Boxborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Boylston, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Berlin, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Harvard, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Marlborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Hudson, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Wayland, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

West Boylston, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

South Lancaster, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Still River, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Worcester, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineer (SRE)

Belfast, Northern Ireland, United Kingdom
Hybrid / WFH Options
Ocho
Ocho are delighted to be working exclusively on a Site Reliability Engineer (SRE) role, with one of our key European clients who specialise in cyber security. We have a proven track record placing exceptional candidates into this company, and now they trust us on the below… We … re seeking an experienced SRE to shape the Site Reliability Group and fortify the global network. Key responsibilities include: Implementing SRE principles and establishing data-driven metrics Conducting preplanning assessments and collaborating to resolve issues Bringing at least 3 years of experience in cloud/web/CDN more »
Posted:

Site Reliability Engineer

London Area, United Kingdom
Hybrid / WFH Options
Salt
Site Reliability Engineer/SRE/London/Hybrid Remote My client do amazing things with data. The consider themselves as experts in all things consumer and location, bringing together cutting-edge analytical techniques, creative thinking and diverse perspectives to drive growth for their client base. They … highly regarded, innovative datasets in the market and their people are the best at manipulating that data to provide insight. Working as the DevOps engineer, you will play a critical role in the development, deployment, and management of software infrastructure. You will collaborate closely with cross-functional teams to … will have: Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience). 4+ years of experience in a DevOps, SRE or similar role. Proficiency in scripting languages such as Python, Bash, or PowerShell. Hands-on experience with CI/CD tools such as GitLab CI. more »
Posted:

Site Reliability Engineer

Manchester Area, United Kingdom
Fairmont Recruitment
Company | Health and Fitness 📏 Size | 400 🧢 Role | Senior Site Reliability Engineer 🪜 Level | Senior ✨Skills | K8's, Terraform, Honeycomb, AWS 📍 Based | Manchester City Centre 💻 Hybrid | 2 days a week in-office 💰 Offer | up to £70k + Benefits A Scale-up Tech for good business based in Manchester City … Centre is looking an experienced Site Reliabiliy Engineer to assist with the growing demand for their services. If you're an advocate for monitoring and observability practices who enjoys working closely with product teams to ensure systems are secure, scalable and reliable then this could be the perfect more »
Posted:

Senior Site Reliability Engineer

United Kingdom
Hybrid / WFH Options
THINKalpha
Location: 100% Remote. The working timezone is EU/GMT. ThinkAlpha is looking for a Senior Site Reliability Engineer to work in the core infrastructure team supporting our data analytics platform and transactional trading engine. Our team provides solutions for real-time analytics, financial search, data integration … real-time analytics, ETL processes, backtesting trading strategies, live trading, natural language processing, and our platform/user interface. In your role as an SRE you will focus on scalability and reliability from the ground up. You will help build and shape how everything runs at THINKalpha and be … our IaC codebase by creating and maintaining Terraform and Ansible modules, and participate in the review process for the IaC developed by the other SRE engineers. Help developers with their needs when it comes to infrastructure updates and accounts management Support our CICD infrastructure and be familiar enough with the more »
Posted:

Site Reliability Engineer - Windows

London Area, United Kingdom
Mondrian Alpha
infrastructure site reliability engineer who primarily has experience in windows environments and a specialism in storage. You'd be joining an SRE team that underpins the entirety of the funds systems meaning you'll have direct impact on the success of the company. You can also expect … a broad range of exposure and responsibilities from scaling large volumes of research related data to improving the reliability and speed of the application estate. Primarily we're looking for strong experience in windows infrastructure engineering, storage, kubernetes and python/powershell automation. Any additional experience around Prometheus/ more »
Posted:

Site Reliability Engineer

Greater London, England, United Kingdom
L&G Recruitment
SRE Engineer should have knowledge of alerting and monitoring tools The tools can be Splunk, Log DNA, Grafana, AWS Cloud Watch Should have knowledge of CI/CD tools. The tools can be Team City, Jenkins, IBM Tool Chain etc Should have knowledge of APM and observability tools. The more »
Posted:

Site Reliability Engineering Team Leader

Stoke-On-Trent, England, United Kingdom
Hybrid / WFH Options
bet365
Who we are looking for A Site Reliability Engineering Team Leader, who will help facilitate and drive activity and efforts of the team to deliver effective technical solutions to operational problems. The Site Reliability team works with several sections across the business, ensuring that our critical more »
Posted:

Senior Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
DRAGOONIS TECHNOLOGIES LIMITED
Reference : BH-298c Job Role: Senior Site Reliability Engineer Job Type: Contract IR35 : Inside IR35 Day Rate: £600/Day Contract Duration: 6 months Working Hours: 5 days per week Remote Working : 4 days remote working. 1 day on-site in London Location: Hybrid Remote/… London (UK only) Role Overview: Were looking for a Senior Site Reliability Engineer with deep Google Cloud (GCP) experience, to join our customers organisation. Responsibilities Influencing Service Level Objectives, Non-Functional Requirements, and infrastructure requirements Ensuring that the Service Level Objectives in the dev teams are met … Root Cause Analysis) Maintain existing compliance and governance standards established in the business Key Experience: Deep understanding of Google Cloud (GCP) Deep understanding of SRE ethos and principles Vast amounts of Terraform experience Solid experience with Python Solid experience of Observability tooling. Good experience in dashboard creation/data visualisation more »
Employment Type: Contract, Work From Home
Rate: £600 per day
Posted:

Site Reliability Engineer

Edinburgh
Lloyds Banking Group
in digitising our Individual Annuities customer journey onto a Cloud based platform. We are seeking to recruit a Site Reliability Engineer (SRE) within the Retirement platform where your main responsibilities will be to work with our existing SRE team to ensure strong observability across our services utilizing … tools such as Dynatrace and Splunk. You will work closely with the wider team to embed SRE principles of delivering secure, robust, and reliable infrastructure and features to our customers. Helping our service teams to understand root causes of incidents. Striving to remove manual tasks (toil) through automation and the … s where you'll make a difference: Influencing across all disciplines within both the business and engineering side of the business in terms of SRE principles especially in relation to increasing reliability. Whilst skills, knowledge and prior experience are meaningful to us we want people who are highly motivated, and more »
Employment Type: Permanent
Salary: £45,954 - £51,060
Posted:
Site Reliability Engineer
10th Percentile
£56,250
25th Percentile
£68,750
Median
£97,500
75th Percentile
£118,750
90th Percentile
£143,750