Permanent Site Reliability Engineer Jobs

1 to 25 of 113 Permanent Site Reliability Engineer Jobs

Senior Site Reliability Engineer

Halifax, West Yorkshire, Yorkshire and the Humber
Lloyds Banking Group
JOB TITLE: Senior Site Reliability Engineer (SRE) LOCATION: Halifax, Leeds or Manchester HOURS: Full-time WORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at one of our office sites. Who are Lloyds Banking Group … there are some specific skills that we'd need to see: · Experience of CI/CD across various tooling and methodologies. · Experience as a SRE, Service, DevOps Engineer or in a similar software or cloud role with a focus on service management. · Strong critical thinking skills, leading live support more »
Employment Type: Permanent
Salary: £68,202 - £75,780
Posted:

Principal Application Engineer (SRE)

Illinois, United States
Discover Financial Services
build a brighter financial future and achieve yours along the way with a rewarding career. As a Principal Site Reliability Engineer (SRE), you'll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. You'll work on projects including … CI/CD, improving data monitoring, and work with our internal product group to help build and define our SRE practice within our Fraud value stream. Responsibilities Develop and run SRE own tooling and observability using automation like CI/CD, and Kubernetes. Build monitoring that alerts on symptoms rather … or related Internal applicants only: technical proficiency rating of proficient on the Dreyfus engineering scale Preferred Qualifications Bonus Points If You Have: 5+ years SRE experience Think about systems: edge cases, failure modes, behaviors, specific implementations. Strong knowledge of SDLC (System Development Life Cycle) Strong knowledge of git, Docker, Kubernetes more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Principal Application Engineer (SRE)

Chicago, Illinois, United States
Discover Financial Services
build a brighter financial future and achieve yours along the way with a rewarding career. As a Principal Site Reliability Engineer (SRE), you'll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. You'll work on projects including … CI/CD, improving data monitoring, and work with our internal product group to help build and define our SRE practice within our Fraud value stream. Responsibilities Develop and run SRE own tooling and observability using automation like CI/CD, and Kubernetes. Build monitoring that alerts on symptoms rather … or related Internal applicants only: technical proficiency rating of proficient on the Dreyfus engineering scale Preferred Qualifications Bonus Points If You Have: 5+ years SRE experience Think about systems: edge cases, failure modes, behaviors, specific implementations. Strong knowledge of SDLC (System Development Life Cycle) Strong knowledge of git, Docker, Kubernetes more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Manager Cloud Engineering (SRE / Cloud Data)

Illinois, United States
Discover Financial Services
skilling, tooling and allocating Chapter members to best support the needs of Product and Value Stream Engineering teams using Site Reliability Engineering (SRE) practices, along with a disciplined approach to professional development of Chapter engineers. This role coaches the team to continuously improve their process and observability practices … technical implementation within own technical domain Ensure consistency of technical execution and knowledge across Products, sharing common practices and challenges within the engineering domain Engineer solutions for special projects as needed Internal Organization and engineering Leadership Develop own engineering Chapter into a highly technically competent, consistent, thoughtful and customer more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineering Manager

Nottingham, Nottinghamshire, East Midlands, United Kingdom
Experian Ltd
age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity. Job Description As a Site Reliability Engineering Manager, you will lead a global team of talented SREs in the development, deployment, and continuous improvement of our Cyber Threat … data storage and compute budget, ensuring effective allocation of resources through management of the data lifecycle. Qualifications This role requires a great deal of SRE technical and managerial skills in a large enterprise environment, such as: A great background in theSRE field supporting a Cyber Threat Detection function, with demonstrable more »
Employment Type: Permanent
Posted:

Senior Site Reliability Engineer

Leeds, West Yorkshire, Yorkshire, United Kingdom
Hybrid / WFH Options
Evri
help you grow. We're never one-size-fits-all. Our careers are as unique as you are. We are looking for a Senior Site Reliability Engineer to be responsible for providing the tooling, processes and support that their team requires to Reliably deploy applications to production … standards to the TDA (Technical Design Authority) Ensuring that the Service Level Objectives in your area are met Helping to develop and promote the SRE service catalogue Ensuring the best security practices are followed Supporting and developing junior members of the team Capturing the SLIs and mapping them to the more »
Employment Type: Permanent, Part Time, Work From Home
Salary: £60,000
Posted:

Lead Site Reliability Engineer

Upton, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Milford, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Fayville, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Berlin, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Westborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Boylston, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Hopkinton, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Boxborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Ashland, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Stow, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Harvard, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Wayland, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Southborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Woodville, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Sudbury, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Marlborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Maynard, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Hudson, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Bolton, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:
Site Reliability Engineer
10th Percentile
£56,250
25th Percentile
£68,750
Median
£97,500
75th Percentile
£118,750
90th Percentile
£143,750