Permanent Site Reliability Engineering Jobs

1 to 25 of 183 Permanent Site Reliability Engineering Jobs

SRE Lead

London, United Kingdom
LinuxRecruit
itself in many different ways, especially in the world of technology. In this instance, I want to discuss Technical Leadership; in this role as SRE Lead, your focus will be less about people management and more about architectural input. Working closely with the Head of Platform, you will help define … the technical roadmap, as well as outlining the overall strategy for the organisation's SRE function. You will also remain very much hands-on. In terms of cloud-related infrastructure, your focus will be on GCP where you will also leverage a containerised tech environment running Kubernetes (GKE). If … environments then this could be the one for you. If you're interested then use the button below to apply (no CV required).Specialism SRE Job type Permanent Location London, UK Remote Salary80,000 - 95,000 per annum Nick Swann Senior Consultant more »
Salary: £ 80 K
Posted:

Technical Product Manager - Security

London, United Kingdom
Wise
compromising reliability and security. Our internal platform covers the whole software development life cycle from cloud infrastructure to local development, CI/CD, SRE, Security or Machine Learning.The Security Squad is part of the Platform Tribe and offers a comprehensive suite of products that make sure we manage our … for a Technical Product Manager to drive the evolution of our Security Squad. You will lead the overall product vision and strategy, overseeing multiple engineering teams and working closely with your product lead and the Chief Information Security Officer. In this role you will be embedded into a wider … company-wide principles, and then our teams set their own guidelines.What will you be working onYour mission is to partner with the Security Squad engineering teams to build world-class products for our internal users.Identify and unlock opportunities across the security domain while aligning your plans with the overall more »
Salary: £ 80 K
Posted:

Digital Service Manager

Guildford, Surrey, United Kingdom
AXA Group
serving digital products to customers and internal usersDemonstrated application of best practices to assess service performance through frameworks such as ITIL, DevOps, SAFe and Site Reliability Engineering Experience implementing reporting and dashboarding capabilities using PowerBI, ServiceNow, Azure DevOps and Confluence Understanding of agile/digital software development more »
Salary: £ 70 K
Posted:

Cloud Operations Engineer, FedRamp

Lincoln, Lincolnshire, United Kingdom
Hybrid / WFH Options
MongoDB
innovation and creativity.MongoDB Atlas is the premier multi-cloud database-as-a-service built and operated by the makers of MongoDB. The Cloud Operations Engineering team at MongoDB is a worldwide team responsible for the consistent operational success of every MongoDB Atlas customer. As a Cloud Operations Engineer, you … global team of Cloud Operations Engineers who are tasked with ensuring our uptime guarantees to our Atlas customer baseHelp scale the worldwide Cloud Operations Engineering team with the strategic implementation and refinement of new processes and toolsAssist in scoping, designing and deploying systems that reduce Mean Time to Resolve … monitoring or through reactive alerts via our Technical Services team)Work First Shift: 7am - 4pm ESTRequirements2+ years experience with being an on call DevOps, SRE, or Cloud Operations engineer Expertise with Linux system administration, configuration, troubleshootingExperience in monitoring, system performance data collection and analysis, and reportingKnowledge of database operations and more »
Salary: £ 80 K
Posted:

SRE Team Lead

London, United Kingdom
LinuxRecruit
to microservices, scalability, SLIs and SLOs. The development team is in the process of breaking down an old monolithic application into microservices. Implementing proper SRE principles and practices became essential to ensure that things did not get broken in the transformation. Now, although established it is time to take that … Azure and Gloo mesh. Other tools in play include but aren’t limited to ArgoCD, Terraform and API gateway. If you’re passionate about SRE and ensuring that it is properly implemented then this is an opportunity to shape a team and culture. You can adapt the current SLOs and … options there too, maximising your potential with a path to the top. On top of this you will have actual ownership of a growing SRE function in a large scale organisation. No CV needed for an initial chat.Specialism DevOps, Cloud, SRE Job type Permanent Location London, UK Remote Salary75 more »
Salary: £ 80 K
Posted:

DevOps Platform Engineer

City of London, England, United Kingdom
Hybrid / WFH Options
Cyber Security Jobsite
Platform Engineer to join an existing DevOps team working in the Law Enforcement sector based in London. As a key member of a DevOps Engineering team, you'll work as part of empowered, autonomous team with regular contact with end-users to flexibly and efficiently understand, design, develop, deploy … You will work in a team given as much ownership and responsibility as you have the appetite for but part of a much bigger Engineering community to give you the support you need to grow in your career. We fully embrace DevOps ways of working in our teams, and … days per week alongside the rest of the team. You will have many of the following: Experience working in a similar DevOps/SRE/Infrastructure role An appreciation of Infrastructure as Code, and CI/CD tooling Scripting abilities with languages such as Shell, Bash, or Python etc A more »
Posted:

Site Reliability Engineer

United Kingdom
Oracle
The job is remote from the UK, currently without VISA sponsorship Job description: Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral … a desire to a develop deep understanding of services and technologies. Career Level - IC4 Qualifications: 5+ years experience in Systems Engineering, DevOps or SRE roles running large scale infrastructure, cloud or web services Proficiency with the following: Kubernetes, Terraform, Helm, Docker Proficiency in language like Go, Python, Bash/ more »
Posted:

Site Reliability Engineer

London Area, United Kingdom
Durlston Partners
Site Reliability Engineer – High Frequency Trading – London - £120-150k base + bonus + share options It’s a very cool time to join this high frequency trading firm. They are past the shaky uncertainty phase of a start-up. The core blocks have been built. They are more »
Posted:

Platform Engineer - Azure

Exeter, England, United Kingdom
Hybrid / WFH Options
BJSS
and monitor the cloud ecosystem, enabling others to do whatever it is they need to do. We see it as a mix of DevOps, SRE and technical design, always with a focus on security, reliability and the other well architected Pillars. About the Role How many times are you … communities we have created at our local offices. But we offer plenty of flexibility and you can split your time between the office, client site and WFH Discounts – we have preferred rates from dozens of retail, lifestyle, and utility brands An industry-leading referral scheme with no limits on … the number of referrals Flexible holiday buy/sell option Electric vehicle scheme Training opportunities and incentives – we support professional certifications across engineering and non-engineering roles, including unlimited access to O’Reilly Giving back – the ability to get involved nationally and regionally with partnerships to get people more »
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Bayside Solutions
Site Reliability Engineer Contract Salary Range: £91,400 - £108,000 per year Location: London, England - Hybrid Role Job Summary: We seek a Site Reliability Engineer to join our team and play a crucial role in ensuring our applications and services' reliability, availability, and performance. This … Willingness to adapt and learn new tools and technologies as needed Availability to participate in on-call rotations as required Desired Skills and Experience Site Reliability, Java, AWS, Azure, Kubernetes, GIT, CD Bayside Solutions, Inc. may collect your personal information during the position application process. Please reference Bayside more »
Posted:

DevOps Engineer

London Area, United Kingdom
HCLTech
HCLTech is a global technology company, home to 219,000+ people across 54 countries, delivering industry-leading capabilities centered around digital, engineering and cloud, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing … CPG, and Public Services. Consolidated revenues as of $13 billion. Job description- We are looking for a Senior DevOps Engineer. Proven Experience leading the SRE teams Proven experience with managing complex devops projects Experience with Java platform engineering. Good understanding of following is beneficial- Kubernetes/Helm Jenkins/TeamCity more »
Posted:

Site Reliability Engineer - GCP

London Area, United Kingdom
NP Group
Site Reliability Engineer – Google Cloud London Excellent Salary & Package including Bonus Key Skills – SRE, GCP (Enterprise Deployments), HELM, Python/Golang/Java, IAC/Automation, Blockchain Technologies, Node Infrastructure, Security Hardening Overview An influential member of a team of highly skilled engineers building out cloud native infrastructure … as an enabler for the developers and business. Predominantly supporting Java, Typescript and Python workloads which are built upon open-source software. As an SRE subject matter expert you will: Enable cross functional teams to rapidly code, build and deliver. Own critical parts of the software development life cycle such … accountable for the cloud native deployment environments across dev, staging and production. Expertise Required: At least 5 years professional experience in a DevOps/SRE role Google Cloud Expertise - GCP Enterprise Level Deployments, Helm etc. Experience building tooling, scripts or applications to enhance the developer experience. 2+ years current experience more »
Posted:

Cloud Database Reliability Engineer NSC

United Kingdom
Oracle
next generation IaaS cloud and the next generation cloud support experience to go with it. We are building a team of energetic, customer-focused site reliability engineers to build a world-first and best in class customer experience blending sys admin, database engineering, and cloud disciplines. You … ll be part of a team that learns deeply how our cloud platform works so you can be the bridge between Engineering and Operations. As part of the broader Engineering organization, you will act as the voice of the customer to influence product features and plans to improve more »
Posted:

Site Reliability Engineer

London Area, United Kingdom
Hybrid / WFH Options
RedCat Digital
processing data at a scale comparable to Meta and Google! They are on the lookout for multiple count Senior Site Reliability Engineers (SRE) to join one of their incredibly talented teams. As a Site Reliability Engineer (SRE), you will play a crucial role in ensuring the … reliability, scalability, and performance of our systems and infrastructure. You will work closely with cross-functional teams to design, implement, and maintain robust and resilient systems, with a focus on automation, monitoring, and incident response. The role: • Working arrangements: Flexible – can be fully remote (UK residents only – unfortunately, Visa … support our core products and services. Develop and maintain automation tools and scripts for deployment, monitoring, and management of infrastructure components. Collaborate with software engineering teams to ensure that applications are designed with reliability, scalability, and performance in mind. Implement and maintain monitoring, alerting, and logging systems to more »
Posted:

Cloud Operations Site Reliability Engineer

England, United Kingdom
Loftware
About the role: Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud Operations Site Reliability Engineer will be hands-on … troubleshooting customer environments for mission-critical application use across the range of cloud platforms used by Loftware, including AWS and Azure. The Cloud Operations Site Reliability Engineer is someone that is a team player with the desire and passion for modern technology and keen to take on large … scale responsibility for the cloud environment. The Cloud Operations Site Reliability Engineer will work with the rest of the Cloud Operations team and alongside QA and Development to continually improve automated infrastructure and application deployment, to build and maintain reliable cloud infrastructure and services and to manage the more »
Posted:

Site Reliability Engineer - Remote

Glasgow, Lanarkshire, United Kingdom
Hybrid / WFH Options
Sanderson Recruitment Plc
Site Reliability Engineer -Remote/Glasgow -Salary to £75,000 + Bonus -Immediate Start Fantastic new opportunity to the market to join our Glasgow-based tech-for-good client, specialising in digital solutions and who have a huge global reach. Due to increased success in their space and … demand for their services, they are now recruiting for a Site Reliability Engineer to join the team as they embark on a hugely exciting roadmap. The core function of the role will be to maintain a secure and reliable infrastructure, define and manage the infrastructure and support the more »
Employment Type: Permanent
Salary: GBP 75,000 Annual
Posted:

Azure Cloud Engineer - SRE

City of London, London, United Kingdom
Hybrid / WFH Options
Akkodis
Azure Site Reliability Engineer Akkodis are currently working in partnership with a leading service provider to recruit an experienced Azure Site Reliability Engineer to join a growing team of talented Cloud Engineers providing high level support and project delivery for a large customer base. Please note … fully remote role and you must be eligible to gain security clearance (do not need to hold currently). The Role As an Azure Site Reliability Engineer you will support the cloud infrastructure used to deliver cloud hosted managed services to customers. You will have a high customer … Azure Networking Azure Storage Azure Monitor and Log Analytics Azure Security Center Demonstrable career operational experience from one of the following areas: Server Infrastructure Engineering (Virtualisation/Windows/Linux). Office/Microsoft 365 Administration. Network Engineering. DevOps (CI/CD, pipelines and Infrastructure as Code) In-depth more »
Employment Type: Permanent
Salary: £65000 - £70000/annum
Posted:

Lead Site Reliability Engineer

Stow, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Harvard, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Boylston, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Hudson, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Wayland, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Fayville, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Bolton, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Site Reliability Engineer

Sudbury, Massachusetts, United States
BJ's Wholesale Club
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to Site Reliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Employment Type: Permanent
Salary: USD Annual
Posted:
Site Reliability Engineering
10th Percentile
£55,400
25th Percentile
£61,250
Median
£80,000
75th Percentile
£112,500
90th Percentile
£125,000