Site Reliability Engineer Jobs

26 to 50 of 142 Site Reliability Engineer Jobs

Senior Site Reliability Engineer

Arlington, Texas, United States
Epsilon
enhances and strengthens internal tooling while evangelizing new use cases among existing internal customers and stakeholders. Fulfil the responsibilities of a DevOps and automation engineer working on cloud-native technologies. Research the collection, parsing, and analysis of infrastructure data from various devices or services while developing/enhancing tool more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Site Reliability Engineer

Fort Worth, Texas, United States
Epsilon
enhances and strengthens internal tooling while evangelizing new use cases among existing internal customers and stakeholders. Fulfil the responsibilities of a DevOps and automation engineer working on cloud-native technologies. Research the collection, parsing, and analysis of infrastructure data from various devices or services while developing/enhancing tool more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineer SRE

Reigate, Surrey, South East
Hybrid / WFH Options
Client Server
Site Reliability Engineer/SRE Reigate/WFH to £85k Global FinTech is seeking a skilled Site Reliability Engineer/SRE to collaborate across product focussed Agile engineering teams to ensure the reliability, availability and performance of client facing services. Responsibilities will include … week for team meet-ups and stakeholder meetings with the other three days work from home. About you: You have experience in a similar SRE/Site Reliability Engineer position You have experience of running 24x7 services in the public cloud - Azure preferred You have experience with … happy to collaborate with senior stakeholders and mentor others What's in it for you: As a Site Reliability Engineer/SRE you will receive a competitive salary plus a range of perks and benefits: Up to £85k salary plus bonus Hybrid working (3 days a week more »
Employment Type: Permanent
Salary: £75,000 - £85,000
Posted:

Principal Application Engineer (SRE)

Houston, Texas, United States
Discover Financial Services
help millions of consumers build a brighter financial future and achieve yours along the way with a rewarding career. Site Reliability Engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to IT infrastructure and operations. The main objectives are … availability, latency, performance, efficiency, change management, monitoring, emergency response and capacity planning of their services. As an Application Site Reliability Engineer (SRE) you will be part of team of people who are responsible for the availability of several of Discover's most critical applications: our PULSE network … operational goals (MTTR reduction, incident reduction, platform availability, SLO\SLA targets) Ensure the proper level of documentation exists, is maintained, and reviewed regularly Drive SRE community discussions Participate in an on call rotation Minimum Qualifications At a minimum, here's what we need from you: Bachelors - Computer Science or related more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineer- Lead

London, United Kingdom
McGregor Boyall Associates Limited
Site Reliability Engineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD A leading provider of financial services is seeking two Site Reliability Engineers- Leads with a solid and proven background in Azure or GCP. This position will also be based onsite in … Will consider candidates from any of the key vendors across the Cloud- Azure, GCP, and AWS. Kubernetes & troubleshooting, managed services like AKS Using your SRE Attitude (understanding SLI, SLO & SLA) Container Image Management & Security like Aquasec Code Quality & repository Management like SonarQube & NexusQ Service Mesh (Istio) traffic shaping, canary, blue … Unit/Integration/Load Testing Azure Application Gateway & API Management Azure IAM - Identity & Access Management Azure Policy Management & Cloud Security Azure Express Route Site Reliability Engineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD McGregor Boyall is an equal opportunity employer and do more »
Employment Type: Permanent
Posted:

Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
La Fosse Associates Ltd
La Fosse are currently partnered with a client who are looking to hire a Site Reliability Engineer into their team, on a contract that will initially run to the end of this year. This role is paying £550 a day, fully remote and inside IR35. Main Responsibilities more »
Employment Type: Contract, Work From Home
Rate: Up to £550 per day
Posted:

Senior Site Reliability Engineer

Leeds, West Yorkshire, Yorkshire, United Kingdom
Hybrid / WFH Options
Evri
help you grow. We're never one-size-fits-all. Our careers are as unique as you are. We are looking for a Senior Site Reliability Engineer to be responsible for providing the tooling, processes and support that their team requires to Reliably deploy applications to production … standards to the TDA (Technical Design Authority) Ensuring that the Service Level Objectives in your area are met Helping to develop and promote the SRE service catalogue Ensuring the best security practices are followed Supporting and developing junior members of the team Capturing the SLIs and mapping them to the more »
Employment Type: Permanent, Part Time, Work From Home
Salary: £60,000
Posted:

Senior Site Reliability Engineer

Saffron Walden, Essex, South East, United Kingdom
Hybrid / WFH Options
EMBL-EBI
The IT & Technical Services department's Operations team is seeking a Senior Site Reliability Engineer to support the growing portfolio of services it provides to EMBl-EBIs service and research teams. The Operations team is responsible for maintaining and developing the Institutes Transfer Services , the application and … to the varied nature of this role, it may suit an individual with experience in a hands-on systems management role, a Senior Infrastructure Engineer, or someone from a site reliability engineering background. The role will initially focus on the email systems - understanding and upgrading the infrastructure … cultural, multi-disciplinary staff, at different levels of their IT career. We are eager to welcome new talent who will join us in ensuring reliability and supporting EMBL-EBI's mission to advance scientific discovery. Your role During the first months, the role will focus on the upgrade of more »
Employment Type: Permanent, Work From Home
Salary: £55,000
Posted:

Lead Engineer SRE

Nottingham, Nottinghamshire, East Midlands, United Kingdom
Microlise
Lead Engineer SRE When registering to this job board you will be redirected to the online application form. Please ensure that this is completed in full in order that your application can be reviewed. Our Engineering Team is 200 strong, from Apprentice Engineers through to Enterprise Architects, and were … currently in an exciting period of growth! As our new Lead Engineer, you would be key to maximising this growth through coaching in terms of technical performance, achieving technical evangelism and acting in a leadership role in terms of design review. We provide clear career ladders for each employee … you are looking for a new challenge and have a strong technical background, then we want to hear from you! As our new Lead Site Reliability Engineer , you will be key to maximising the observability of our infrastructure and applications, and to resolving error-prone manual processes more »
Employment Type: Permanent
Salary: £55,000
Posted:

Site Reliability Engineer (SRE)

London, United Kingdom
Fuel Recruitment Limited
evolution of their applications to deliver a modern, first class, cloud based platform to their users. As such we are looking for an experienced SRE to join the team to drive best Agile practices, DevOps and software development ways of working. You must have worked within the FS industry previously … strong CI/CD experience and have the ability to automate to eliminate/reduce toil. This role will require you to be on site 3 days a week and is inside IR35. more »
Employment Type: Contract
Rate: £500 - £700/day
Posted:

Senior Site Reliability Engineer

Halifax, West Yorkshire, Yorkshire and the Humber
Lloyds Banking Group
JOB TITLE: Senior Site Reliability Engineer (SRE) LOCATION: Halifax, Leeds or Manchester HOURS: Full-time WORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at one of our office sites. Who are Lloyds Banking Group … there are some specific skills that we'd need to see: · Experience of CI/CD across various tooling and methodologies. · Experience as a SRE, Service, DevOps Engineer or in a similar software or cloud role with a focus on service management. · Strong critical thinking skills, leading live support more »
Employment Type: Permanent
Salary: £68,202 - £75,780
Posted:

Principal Application Engineer (SRE)

Illinois, United States
Discover Financial Services
build a brighter financial future and achieve yours along the way with a rewarding career. As a Principal Site Reliability Engineer (SRE), you'll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. You'll work on projects including … CI/CD, improving data monitoring, and work with our internal product group to help build and define our SRE practice within our Fraud value stream. Responsibilities Develop and run SRE own tooling and observability using automation like CI/CD, and Kubernetes. Build monitoring that alerts on symptoms rather … or related Internal applicants only: technical proficiency rating of proficient on the Dreyfus engineering scale Preferred Qualifications Bonus Points If You Have: 5+ years SRE experience Think about systems: edge cases, failure modes, behaviors, specific implementations. Strong knowledge of SDLC (System Development Life Cycle) Strong knowledge of git, Docker, Kubernetes more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Principal Application Engineer (SRE)

Chicago, Illinois, United States
Discover Financial Services
build a brighter financial future and achieve yours along the way with a rewarding career. As a Principal Site Reliability Engineer (SRE), you'll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. You'll work on projects including … CI/CD, improving data monitoring, and work with our internal product group to help build and define our SRE practice within our Fraud value stream. Responsibilities Develop and run SRE own tooling and observability using automation like CI/CD, and Kubernetes. Build monitoring that alerts on symptoms rather … or related Internal applicants only: technical proficiency rating of proficient on the Dreyfus engineering scale Preferred Qualifications Bonus Points If You Have: 5+ years SRE experience Think about systems: edge cases, failure modes, behaviors, specific implementations. Strong knowledge of SDLC (System Development Life Cycle) Strong knowledge of git, Docker, Kubernetes more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineering Manager

Nottingham, Nottinghamshire, East Midlands, United Kingdom
Experian Ltd
age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity. Job Description As a Site Reliability Engineering Manager, you will lead a global team of talented SREs in the development, deployment, and continuous improvement of our Cyber Threat … data storage and compute budget, ensuring effective allocation of resources through management of the data lifecycle. Qualifications This role requires a great deal of SRE technical and managerial skills in a large enterprise environment, such as: A great background in theSRE field supporting a Cyber Threat Detection function, with demonstrable more »
Employment Type: Permanent
Posted:

Lead Site Reliability Engineer

Hopkinton, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Berlin, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Southborough, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Harvard, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Maynard, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Sudbury, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Hudson, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Boylston, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Woodville, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Ashland, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Stow, Massachusetts, United States
BJ's Wholesale Club
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
Employment Type: Permanent
Salary: USD Annual
Posted:
Site Reliability Engineer
10th Percentile
£56,250
25th Percentile
£68,750
Median
£97,500
75th Percentile
£118,750
90th Percentile
£143,750