Permanent Site Reliability Engineering Jobs

26 to 50 of 223 Permanent Site Reliability Engineering Jobs

Lead Site Reliability Engineer

Shrewsbury, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Ashland, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Berlin, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Still River, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

South Lancaster, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

West Boylston, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Worcester, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineer- Lead

London, United Kingdom
McGregor Boyall Associates Limited
Site Reliability Engineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD A leading provider of financial services is seeking two Site Reliability Engineers- Leads with a solid and proven background in Azure or GCP. This position will also be based onsite in London … Will consider candidates from any of the key vendors across the Cloud- Azure, GCP, and AWS. Kubernetes & troubleshooting, managed services like AKS Using your SRE Attitude (understanding SLI, SLO & SLA) Container Image Management & Security like Aquasec Code Quality & repository Management like SonarQube & NexusQ Service Mesh (Istio) traffic shaping, canary, blue … Unit/Integration/Load Testing Azure Application Gateway & API Management Azure IAM - Identity & Access Management Azure Policy Management & Cloud Security Azure Express Route Site Reliability Engineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD McGregor Boyall is an equal opportunity employer and do not more »
Employment Type: Permanent
Posted:

Lead Engineer SRE

Nottingham, Nottinghamshire, East Midlands, United Kingdom
Microlise
Lead Engineer SRE When registering to this job board you will be redirected to the online application form. Please ensure that this is completed in full in order that your application can be reviewed. Our Engineering Team is 200 strong, from Apprentice Engineers through to Enterprise Architects, and were … you are looking for a new challenge and have a strong technical background, then we want to hear from you! As our new Lead Site Reliability Engineer , you will be key to maximising the observability of our infrastructure and applications, and to resolving error-prone manual processes through more »
Employment Type: Permanent
Salary: £55,000
Posted:

Sr. Staff DevOps Engineer

Westminster, Colorado, United States
Hybrid / WFH Options
Maxar Technologies
a high-quality analytics environment and platform to enable successful data intelligence at Maxar. This position is hybrid with several days a week on-site with your colleagues in Westminster, CO. Life with Us Your Project: The Data Intelligence team owns and maintains a variety of infrastructure and services … for coding standards and software architecture. Responsible for the 'ilities': availability, scalability, maintainability, reliability, and securability of our tools and environments. We embrace SRE principles. Actively identify opportunities for improvement in our infrastructure and propose solutions to realize them. Collaborate with a team of skilled DevOps Engineers, Data Engineers … and Business Intelligence Developers. Minimum requirements for this position: Active U.S. Government security clearance Bachelor's Degree in Software Engineering, Computer Science, or related engineering field. 4 additional years of experience may be substituted for a degree. Minimum of 5 years related work experience for a Senior Software more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Site Reliability Engineer

South West London, London, United Kingdom
Experian Ltd
Company Description Internal Grade E/EB9 Job Description Work that matters what youll be doing Were looking for a Site Reliability Engineer to join our Experian Data Quality team where you will be working … on cutting edge products within our Aperture suite (Data Studio and Data Governance). This role has aspects of both reliability engineering (SRE) and test engineering (SDET). It is ideally suited to someone looking to take on some aspects of a technical leadership role for the … test frameworks), working in collaboration with our Architects, Development teams, and DevOps specialists to use results of these tests to help prioritize and implement reliability improvements for our customers. You will work closely with our Director of Engineering, Test Automation Lead, and wider QA team to shape our more »
Employment Type: Permanent
Posted:

Senior DevOps Consultant

London, United Kingdom
Stealth It Recruitment Ltd
and Retail sectors. Primary responsibilities and experience required: Solid understanding of hybrid and multi-cloud environments (AWS, GCP, Azure), DevOps, CI/CD and SRE Implementation of DevSecOps models along with necessary tooling, business change and processes. Implementing product centric operating model- Focussing on building the right product, cultivating right … waste from the system. Consulting/Coaching experience in implementing new ways of working and enabling agile delivery transformation. Enabling continuous delivery while ensuring reliability, quality, observability, and performance Understanding of build and deployment pipelines, test driven development, automated testing, Test data management, automated Environment provisioning, Version control, Monitoring … up process/systems to measure, track and take corrective actions to drive speed, productivity, and quality improvements. Security and governance models for cloud engineering and operations -Desired Driving IT and business automation across product management, engineering, and operations -Desired ITIL and service management Being comfortable in a more »
Employment Type: Permanent
Posted:

Site Reliability Engineer

Edinburgh
Lloyds Banking Group
investment in digitising our Individual Annuities customer journey onto a Cloud based platform. We are seeking to recruit a Site Reliability Engineer (SRE) within the Retirement platform where your main responsibilities will be to work with our existing SRE team to ensure strong observability across our services utilizing … tools such as Dynatrace and Splunk. You will work closely with the wider team to embed SRE principles of delivering secure, robust, and reliable infrastructure and features to our customers. Helping our service teams to understand root causes of incidents. Striving to remove manual tasks (toil) through automation and the … where you'll make a difference: Influencing across all disciplines within both the business and engineering side of the business in terms of SRE principles especially in relation to increasing reliability. Whilst skills, knowledge and prior experience are meaningful to us we want people who are highly motivated, and more »
Employment Type: Permanent
Salary: £45,954 - £51,060
Posted:

Site Reliability Engineer

Halifax, West Yorkshire, Yorkshire and the Humber
Lloyds Banking Group
JOB TITLE: Site Reliability Engineer - Homes Platform SALARY: £62,874 - £69,860 LOCATION(S): Halifax or Leeds HOURS: [Full-time] WORKING PATTERN: Our work style is hybrid, which involves … spending at least two days per week currently, or 40% of our time, at our Halifax or Leeds Office About this opportunity Our Cloud SRE (Site Reliability Engineering) team is looking for an experienced and passionate Engineer with strong hands-on development experience. As a Cloud SRE … Mortgages at the heart of our strategy to become the best bank for customers. The role will have accountabilities including: Delivering against Azure and SRE Public Cloud technology roadmaps Collaboratively working with other engineering teams to build, release and evolve enterprise-class solutions, that are reliable and evergreen as more »
Employment Type: Permanent
Salary: £62,874 - £69,860
Posted:

Software Engineer Lead-Java/AWS

Plano, Texas, United States
USAA
advancing professional development through active participation in industry organizations, writing programming publications, pursuing educational opportunities, establishing personal networks, and participating in professional societies. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated with business activities are effectively identified, measured, monitored, and controlled in accordance with … or initiatives. 6 years of experience delivering technology solutions in all phases of the software systems and application development lifecycle. Highly proficient in software engineering languages and tools; ability to develop on multiple platforms. Knowledge and advanced experience of leading code/design reviews. Demonstrated ability to address complex … Services , REACT and Database technologies. 2+ years hands on experience with Openshift on-prem deployments and EKS deployments in AWS into production, using strong SRE principles Up to 4 years of hands-on experience in Terraform development to facilitate auto provisioning infrastructure-as-code and using Artifactory repositories and Cloud more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Software Engineer Lead-Java/AWS

San Antonio, Texas, United States
USAA
advancing professional development through active participation in industry organizations, writing programming publications, pursuing educational opportunities, establishing personal networks, and participating in professional societies. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated with business activities are effectively identified, measured, monitored, and controlled in accordance with … or initiatives. 6 years of experience delivering technology solutions in all phases of the software systems and application development lifecycle. Highly proficient in software engineering languages and tools; ability to develop on multiple platforms. Knowledge and advanced experience of leading code/design reviews. Demonstrated ability to address complex … Services , REACT and Database technologies. 2+ years hands on experience with Openshift on-prem deployments and EKS deployments in AWS into production, using strong SRE principles Up to 4 years of hands-on experience in Terraform development to facilitate auto provisioning infrastructure-as-code and using Artifactory repositories and Cloud more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Executive, Application Development

San Antonio, Texas, United States
USAA
degree in information technology or related field Experience in managing teams of 500+ people and annual financial budget of over $100 million Manages the Site Reliability Engineering and Production Availability practice for all the Bank Products Builds Automation and AI center of excellence catering to multiple areas … and Regulatory needs Responsible for managing architectural solutions that reduces the total cost of ownership of Technology Deep understanding of the principles of software engineering, systems engineering and automation to include experience leading these practices Experience with FinTech Domain and products Experience in maintaining the reliability of more »
Employment Type: Permanent
Salary: USD Annual
Posted:

GCP Cloud Security Engineer, Senior

Phoenix, Arizona, United States
USAA
controls. This position/role is directly aligned to USAA's technology modernization and future proofing strategies across the enterprise. Conducts software and systems engineering to develop new capabilities, ensuring Information Security is integrated across the enterprise. Conducts comprehensive technology research to evaluate potential vulnerabilities in Enterprise systems. Identifies … a focus on security, often collaborating with Engineers or Architects outside of team/department. Leads the team in code/design reviews and engineering efficiencies to ensure effective operations and accurate planning. Supports the resolution of complex production issues and troubleshooting of end-to-end solutions that span … community impact through active participation in internal and external training outlets, conferences, blog post, and participating in professional societies, advisory boards, and consortiums. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated within their domain activities are effectively identified, measured, monitored, and controlled in accordance more »
Employment Type: Permanent
Salary: USD Annual
Posted:

IAM Separation of Duties Program - Process & Control Owner (Mid-Level)

San Antonio, Texas, United States
USAA
Come be a part of what makes us so special! The Opportunity As a dedicated Info Security Engineer I, you Conduct software and systems engineering to develop new capabilities, ensuring Information Security is integrated across the enterprise. Conducts comprehensive technology research to evaluate potential vulnerabilities in Enterprise systems. Identifies … technical solutions with a focus on security, often collaborating with Engineers or Architects within the team/department. Supports code/design reviews and engineering efficiencies to ensure effective operations and accurate planning. Supports the resolution of production issues and troubleshooting of end-to-end solutions that span multiple … for cross functional or highly complex key technologies within a specific security domain. Drives community impact through active participation in internal training outlets. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated within their domain activities are effectively identified, measured, monitored, and controlled in accordance more »
Employment Type: Permanent
Salary: USD Annual
Posted:

IAM Separation of Duties Program - Process & Control Owner (Mid-Level)

Plano, Texas, United States
USAA
Come be a part of what makes us so special! The Opportunity As a dedicated Info Security Engineer I, you Conduct software and systems engineering to develop new capabilities, ensuring Information Security is integrated across the enterprise. Conducts comprehensive technology research to evaluate potential vulnerabilities in Enterprise systems. Identifies … technical solutions with a focus on security, often collaborating with Engineers or Architects within the team/department. Supports code/design reviews and engineering efficiencies to ensure effective operations and accurate planning. Supports the resolution of production issues and troubleshooting of end-to-end solutions that span multiple … for cross functional or highly complex key technologies within a specific security domain. Drives community impact through active participation in internal training outlets. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated within their domain activities are effectively identified, measured, monitored, and controlled in accordance more »
Employment Type: Permanent
Salary: USD Annual
Posted:

IAM Separation of Duties Program, Team Lead

San Antonio, Texas, United States
USAA
be a part of what makes us so special! The Opportunity As a dedicated Info Security Engineer Lead , you will conduct software and systems engineering to develop new capabilities, ensuring Information Security is integrated across the enterprise. Conducts comprehensive technology research to evaluate potential vulnerabilities in Enterprise systems. Identifies … a focus on security, often collaborating with Engineers or Architects outside of team/department. Leads the team in code/design reviews and engineering efficiencies to ensure effective operations and accurate planning. Independently resolves complex production issues and leads troubleshooting of end-to-end solutions that span multiple … community impact through active participation in internal and external training outlets, conferences, blog post, and participating in professional societies, advisory boards, and consortiums. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated within their domain activities are effectively identified, measured, monitored, and controlled in accordance more »
Employment Type: Permanent
Salary: USD Annual
Posted:

GCP Cloud Security Engineer, Senior

Colorado Springs, Colorado, United States
USAA
controls. This position/role is directly aligned to USAA's technology modernization and future proofing strategies across the enterprise. Conducts software and systems engineering to develop new capabilities, ensuring Information Security is integrated across the enterprise. Conducts comprehensive technology research to evaluate potential vulnerabilities in Enterprise systems. Identifies … a focus on security, often collaborating with Engineers or Architects outside of team/department. Leads the team in code/design reviews and engineering efficiencies to ensure effective operations and accurate planning. Supports the resolution of complex production issues and troubleshooting of end-to-end solutions that span … community impact through active participation in internal and external training outlets, conferences, blog post, and participating in professional societies, advisory boards, and consortiums. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated within their domain activities are effectively identified, measured, monitored, and controlled in accordance more »
Employment Type: Permanent
Salary: USD Annual
Posted:

GCP Cloud Security Engineer, Senior

San Antonio, Texas, United States
USAA
controls. This position/role is directly aligned to USAA's technology modernization and future proofing strategies across the enterprise. Conducts software and systems engineering to develop new capabilities, ensuring Information Security is integrated across the enterprise. Conducts comprehensive technology research to evaluate potential vulnerabilities in Enterprise systems. Identifies … a focus on security, often collaborating with Engineers or Architects outside of team/department. Leads the team in code/design reviews and engineering efficiencies to ensure effective operations and accurate planning. Supports the resolution of complex production issues and troubleshooting of end-to-end solutions that span … community impact through active participation in internal and external training outlets, conferences, blog post, and participating in professional societies, advisory boards, and consortiums. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated within their domain activities are effectively identified, measured, monitored, and controlled in accordance more »
Employment Type: Permanent
Salary: USD Annual
Posted:

GCP Cloud Security Engineer, Senior

Tampa, Florida, United States
USAA
controls. This position/role is directly aligned to USAA's technology modernization and future proofing strategies across the enterprise. Conducts software and systems engineering to develop new capabilities, ensuring Information Security is integrated across the enterprise. Conducts comprehensive technology research to evaluate potential vulnerabilities in Enterprise systems. Identifies … a focus on security, often collaborating with Engineers or Architects outside of team/department. Leads the team in code/design reviews and engineering efficiencies to ensure effective operations and accurate planning. Supports the resolution of complex production issues and troubleshooting of end-to-end solutions that span … community impact through active participation in internal and external training outlets, conferences, blog post, and participating in professional societies, advisory boards, and consortiums. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated within their domain activities are effectively identified, measured, monitored, and controlled in accordance more »
Employment Type: Permanent
Salary: USD Annual
Posted:

GCP Cloud Security Engineer, Senior

Plano, Texas, United States
USAA
controls. This position/role is directly aligned to USAA's technology modernization and future proofing strategies across the enterprise. Conducts software and systems engineering to develop new capabilities, ensuring Information Security is integrated across the enterprise. Conducts comprehensive technology research to evaluate potential vulnerabilities in Enterprise systems. Identifies … a focus on security, often collaborating with Engineers or Architects outside of team/department. Leads the team in code/design reviews and engineering efficiencies to ensure effective operations and accurate planning. Supports the resolution of complex production issues and troubleshooting of end-to-end solutions that span … community impact through active participation in internal and external training outlets, conferences, blog post, and participating in professional societies, advisory boards, and consortiums. Leverages Site Reliability Engineering practices in their domain. Ensures risks associated within their domain activities are effectively identified, measured, monitored, and controlled in accordance more »
Employment Type: Permanent
Salary: USD Annual
Posted:
Site Reliability Engineering
10th Percentile
£57,000
25th Percentile
£62,338
Median
£80,000
75th Percentile
£112,500
90th Percentile
£127,000