Permanent Site Reliability Engineering Jobs in the UK

1 to 25 of 1,927 Permanent Site Reliability Engineering Jobs in the UK

Mid-Senior DevOps / Site Reliability Engineer (m/f/*)

London, England, United Kingdom
Hybrid / WFH Options
Quaisr Limited
DevOps/Site Reliability Engineer, Junior/Mid/Senior (m/f/*) We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a DevOps/Site Reliability Engineer with experience managing complex infrastructure and deploying scalable, reliable systems. You … systems. Quaisr is an early stage startup backed by Crane Venture Partners (UK) and Acequia Capital (USA), along with international angel investors. We were founded on partnerships with leading engineering and research institutions including Imperial College London and The Alan Turing Institute. Our customers are early-adopter multinational organisations in the pharmaceuticals, fast-moving consumer goods, defence, and energy More ❯
Posted:

DevOps/Site Reliability Engineer, Junior/Mid/Senior (m/f/ )

United Kingdom
Hybrid / WFH Options
Crane Venture Partners
are We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a DevOps/Site Reliability Engineer with experience managing complex infrastructure and deploying scalable, reliable systems. You are passionate about automation, cloud technologies, and continuous improvement. Must have: Proven track record of … systems. Quaisr is an early stage startup backed by Crane Venture Partners (UK) and Acequia Capital (USA), along with international angel investors. We were founded on partnerships with leading engineering and research institutions including Imperial College London and The Alan Turing Institute. Our customers are early-adopter multinational organisations in the pharmaceuticals, fast-moving consumer goods, defence, and energy More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

England, United Kingdom
Hybrid / WFH Options
Stratospherec Limited
reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills and Experience: Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language. Proficiency in … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more. More ❯
Posted:

Senior Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Stratospherec Limited
reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills and Experience: Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language. Proficiency in … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more. More ❯
Posted:

Senior Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
TieTalent
reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation. The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software … ensure high availability and performance Collaborate with product engineering teams to design/build fit-for-purpose and observable software Required Skills And Experience Proven experience in a SRE/DevOps/Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language. Proficiency in … and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider. Please apply with your CV to find out more Nice-to-have skills More ❯
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Global Screening Services
observability, and responding to incidents as well as ensuring the platform is performant and reliable. You will be a key member of the team, liaising with product teams, embedding SRE principles and building the observability platform for the next stage of growth at GSS. You will have direct input into the direction of Technical Operations, solving problems, supporting developers and … collaborative, flexible, and innovative work culture where your ideas are valued. What You’ll Do Key responsibilities in this role will include (but not be limited to): Leveraging core SRE values - measuring (SLI/SLO/SLA), testing, and eliminating toil via automation with appropriate Disaster Recovery planning Refining KPIs to enable data-driven decision making for availability and reliability … within a production, SaaS company (preferably event-driven) Be a self-starter that relishes responsibility. Take strategic direction and own end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards More ❯
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Direct message the job poster from Spectrum IT Recruitment Account Manager @ Spectrum IT | Recruitment, Customer Service Site Reliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a Site Reliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp … of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of More ❯
Posted:

Senior Data Engineer

London, England, United Kingdom
Hybrid / WFH Options
Leonardo
with data architects and security teams to ensure compliance with security policies and data governance standards. Manage and monitor large-scale data flows in real-time, ensuring system performance, reliability, and data integrity. Develop robust data models to support analytics and reporting within secure environments. Perform troubleshooting, debugging, and performance tuning of data pipelines and the Elastic Stack. Build … Engineer, Robotics & Microscope Control London, England, United Kingdom 1 day ago London, England, United Kingdom 1 week ago London, England, United Kingdom 2 weeks ago Senior Software Engineer/SRE - Managed Systems Engineering London, England, United Kingdom 3 days ago London, England, United Kingdom 1 week ago London, England, United Kingdom 2 months ago London, England, United Kingdom More ❯
Posted:

Data Reliability Engineer (Data SRE)

Belfast, Northern Ireland, United Kingdom
CME Group
Description Data System Reliability Engineer (dSRE) Role Overview: A crucial role in CME's Cloud data transformation, the data SRE will be aligned to data product pods ensuring the our data infrastructure is reliable, scalable, and efficient as the GCP data footprint expands rapidly. Accountabilities: Automate data tasks on GCP Work with data domain owners, data scientists and other … data engineers, data scientists, and software engineers to understand data requirements, troubleshoot issues, and support data-driven initiatives. Continuously assess and improve data infrastructure and data processes to enhance reliability, efficiency, and performance. Maintain clear and up-to-date documentation related to data systems, configurations, and standard operating procedures. Qualifications: Bachelor's or Master's degree in Computer Science … Software Engineering, Data Science or related field, or equivalent practical experience Experience as a Data Site Reliability Engineer or a similar role, with a focus on data infrastructure management Proficiency in data technologies, such as relational databases, data warehousing, big data platforms (e.g., Hadoop, Spark), data streaming (e.g., Kafka), and cloud services (e.g., AWS, GCP, Azure). More ❯
Posted:

Senior Site Reliability Engineer - AWS Kubernetes

London, England, United Kingdom
Source Technology
Senior Site Reliability Engineer - AWS Kubernetes Senior Site Reliability Engineer - AWS Kubernetes Get AI-powered advice on this job and more exclusive features. A truly unique opportunity to help launch a brand new team within a global financial services provider. This new team of highly skilled Full Stack Infrastructure Engineers will cover Compute, Storage, Network and … Cloud technologies. You will help design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance. Requirements: Proven experience managing and optimizing a diverse infrastructure stack. Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation). Familiarity of service mesh technologies (Istio, Linkerd). Solid understanding of virtualization (VMware, Hyper-V) and containerization … Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster recovery and business continuity planning. Experience with performance tuning and capacity planning. Understanding of chaos engineering principles and practices. Skills in cost optimization for cloud infrastructure. Specific Tools and Techniques: Experience in using cloud native monitoring tools like AWS CloudWatch, Azure Monitor, and Google Cloud More ❯
Posted:

Site Reliability Engineer

Southampton, Hampshire, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Site Reliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a Site Reliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on-premises environments, empowering organisations to enhance customer experiences, maintain regulatory compliance, and proactively fight fraud. The … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp … of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of More ❯
Employment Type: Permanent
Posted:

Site Reliability Engineer

Portsmouth, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Site Reliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a Site Reliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on-premises environments, empowering organisations to enhance customer experiences, maintain regulatory compliance, and proactively fight fraud. The … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp … of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of More ❯
Posted:

Site Reliability Engineer

Hampshire, England, United Kingdom
Hybrid / WFH Options
Spectrum IT Recruitment
Site Reliability Engineer Southampton HQ Hybrid Working Cloud, SaaS, AWS, We are working alongside one of our longstanding clients in helping them recruit a Site Reliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on-premises environments, empowering organisations to enhance customer experiences, maintain regulatory compliance, and proactively fight fraud. The … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming language such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp … of cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of More ❯
Posted:

Site Reliability Engineer

London, England, United Kingdom
Hybrid / WFH Options
ZipRecruiter
Job Description Site Reliability Engineer Southampton HQ - 2 Times a week in Office Cloud, SaaS, AWS, Please be advised Security Clearance is required for this position We are working alongside one of our longstanding clients in helping them recruit a Site Reliability Engineer. The company deliver cutting-edge enterprise software solutions across both cloud and on … Professional DevOps Engineer, or similar credentials Do You Have What It Takes? 3-6 years of hands-on experience in a similar role, with a strong emphasis on systems engineering, automation, and service reliability Proficient in at least one programming such as Python, Go, Java, or C#, along with scripting skills in Bash or PowerShell Solid grasp of … cloud platforms like AWS, including an understanding of how core services like EC2, ECS, Lambda, and DynamoDB operate under reliability constraints Practical experience using infrastructure-as-code tools like CloudFormation or Terraform In-depth knowledge of CI/CD principles and hands-on experience with tools such as Jenkins, GitLab CI/CD, or CircleCI Strong understanding of containerisation More ❯
Posted:

Senior SRE

London, England, United Kingdom
Index Exchange
Join to apply for the Senior SRE role at Index Exchange 5 days ago Be among the first 25 applicants Join to apply for the Senior SRE role at Index Exchange We shaped the earliest forms of ad tech, and we’re looking for the technical expertise to help shape its future. Our customers have unique problems that can only … and reliability of our worldwide 24x7 on-premises and cloud environments. As the first line of defense this team has ownership of operations engineering. Collaborating closely with IT, SRE, Network, and Data engineering teams, and key stakeholders across business, product, and software engineering teams. We play a crucial role in maintaining systems health, responding to incidents, and … for automation. Work Experience 6-8 years of proven experience in previous roles or one of the following roles: DevOps Engineer Linux System Administrator Site Reliability Engineer (SRE) Built or maintained a private-cloud infrastructure running centos/rocky linux on a mix of bare-metal, virtualization, and containerization. Managed public cloud environments such as aws, gcp, azure More ❯
Posted:

Senior Production Operations Engineer

London, England, United Kingdom
Index Exchange
and reliability of our worldwide 24x7 on-premises and cloud environments. As the first line of defense this team has ownership of operations engineering. Collaborating closely with IT, SRE, Network, and Data engineering teams, and key stakeholders across business, product, and software engineering teams. We play a crucial role in maintaining systems health, responding to incidents, and … of-contact team for operational issues, providing both internal and external teams with technical support and ensuring the issue remains in custody until resolution. Collaborate with product and software engineering teams to relay operational insights and requirements. Automation, Tooling & Research Continuously identify opportunities for optimization and present findings to technical leads and management. Research and implement improvements enhancing systems … for automation. Work Experience 6-8 years of proven experience in previous roles or one of the following roles: DevOps Engineer, Linux System Administrator, Site Reliability Engineer (SRE). Built or maintained a private-cloud infrastructure running CentOS/Rocky Linux on a mix of bare-metal, virtualization, and containerization. Managed public cloud environments such as AWS, GCP More ❯
Posted:

Senior DevOps Engineer SaaS Product (MS Azure)

London, England, United Kingdom
Hybrid / WFH Options
ZigZag Global
hybrid environment (Window/Linux) using Kubernetes, docker, Cosmosdb, SQL Server, embodying our startup spirit of innovation and adaptability. Manage Azure resources with a grow-up mindset, prioritizing scalability, reliability, and cost-efficiency. Monitor system performance, ensuring the satisfaction of our users and stakeholders by proactively addressing issues and optimizing performance. Continuous Integration and Deployment: Implement and maintain CI … empowerment through streamlined deployment processes. Configuration Management: Implement configuration management solutions with flexibility and adaptability, using tools like Terragrunt, Terraform, Ansible ARM. Manage infrastructure as code, promoting consistency and reliability across environments to support our expanding operations. Foster a human-centric approach by prioritizing ease of use and accessibility in configuration management processes, empowering team members to contribute effectively. … inclusivity and empathy in collaboration and knowledge-sharing activities, ensuring every voice is heard and valued. What you'll need to be successful: Bachelor's degree in Computer Science, Engineering, or a related field. Extensive experience in a DevOps or infrastructure engineering role. Strong proficiency in Microsoft Azure cloud services and architecture. Experience with containerization technologies such as More ❯
Posted:

Delivery Engineer

United Kingdom
Hybrid / WFH Options
Sportserve
tools/systems in a really fast-paced dynamic team and environment. The key values that we strive towards in Technology Ownership, Continuous Improvement Mindset, Trust and Honesty. Delivery Engineering Team Delivery Engineering is an internal platform team focused on building and improving systems to accelerate the delivery of software and infrastructure. Our team's first target will … be enabling DevOps practices due to its direct impact on the overall processes and culture. The primary goal of Delivery Engineering is to build the "Golden Path" to production by automating and concentrating improvements around bottlenecks such as Integration points and handoffs. We will work closely with Development teams to break down silos, helping to build more resilient pipelines … equipping ourselves with the right tools like operating procedures (SOPs) and architectural diagrams Improve the team skill set by increasing and sharing knowledge, thus setting the quality and precision engineering benchmark Understand the importance of and be a strong advocate for non-functionals eg. monitoring, alerting, logging Required Skills and Experience 3+ years of experience in DevOps oriented roles More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Technical Architect / Data DevOps Engineer

Bristol, United Kingdom
Hewlett Packard Enterprise Development LP
the next level? We have a brand-new opportunity for a bright, driven, customer focussed professional to join our Hybrid Cloud 'Delivery' team, and work alongside our Enterprise Data Engineering consultants to accelerate and drive data engineering opportunities. The Advisory and Professional Services (A&PS) delivery team within HPE Pointnext Services is responsible for bringing thought-leadership, industry … implementation of scalable clustered Big Data solutions, with a specific focus on automated dynamic scaling, self-healing systems. Participating in the full lifecycle of data solution development, from requirements engineering through to continuous optimisation engineering and all the typical activities in between Providing technical thought-leadership and advisory on technologies and processes at the core of the data … Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or similar languages The following Technical More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer

London, England, United Kingdom
Capgemini
programmes, using our skills to drive outstanding delivery. Having the right people using the right methodology and following the right process gives every client the right result. The Platform Engineering capability within Digital is a growing team dedicated to ensuring that the technologies selected and the way they are implemented meet the business objectives of our clients. There are … as Terraform, CloudFormation, ARM, or Pulumi. Experience in building secure applications and infrastructure. Strong communication skills, with the ability to convey and understand complex technical concepts clearly and concisely. SRE skills including observability and telemetry monitoring. Familiarity with the HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul). Experience in containerization using Docker, Kubernetes, OpenShift, and Helm. Programming skills in languages … address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast-evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering, and platforms. The Group reported in 2022 global revenues of €22 billion. When you join Capgemini, you don’t just start a new job. You become part of something More ❯
Posted:

Site Reliability Engineer

London, England, United Kingdom
SS&C Technologies
to small and mid-market firms, rely on SS&C for expertise, scale, and technology. Job Description We are looking for a highly skilled Site Reliability Engineer (SRE) to join our engineering team. As an SRE, you will be responsible for ensuring the scalability, reliability, and performance of our systems, infrastructure, and applications. You will collaborate … and incident response systems to ensure high availability and performance. Develop tools and automation to eliminate manual operations and improve system efficiency. Collaborate with development teams to ensure that reliability and performance are considered from the outset. Conduct root cause analysis and postmortems to learn from system failures and prevent recurrence. Participate in on-call rotations and respond to … and problem-solving skills. Strong communication and collaboration abilities. Preferred Qualifications: Experience with large-scale distributed systems. Familiarity with SLAs, SLOs, and SLIs. Previous experience in a DevOps or SRE role in a production environment. We encourage applications from people of all backgrounds and particularly welcome applications from under-represented groups, to enable us to bring a diversity of perspectives More ❯
Posted:

Senior Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

London, England, United Kingdom
Hybrid / WFH Options
Future Talent Group
Senior Site Reliability Engineer – FinTech/Global Payments – London HQ/Remote First Senior Site Reliability Engineer – FinTech/Global Payments – London HQ/Remote First Get AI-powered advice on this job and more exclusive features. This range is provided by Future Talent Group. Your actual pay will be based on your skills and experience … new industry standard within their market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability and availability. Monitor systems … capacity planning. Collaborate with dev teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Strong More ❯
Posted:

Senior Site Reliability Engineer - FinTech / Global Payments - London HQ / Remote First

London, England, United Kingdom
Hybrid / WFH Options
ZipRecruiter
Job Description Senior Site Reliability Engineer – FinTech/Global Payments – London HQ/Remote First Salary - £85,000 + Bonus Location - This UK-based team offers a fully remote working option, with a headquarters in Central London. In this role, you will be joining a leading SaaS FinTech scale-up, setting a … new industry standard within their market. The business aims to scale its platform significantly over the next few years to support a growing international client base. Responsibilities Champion core SRE practices: define SLIs/SLOs/SLAs, reduce toil through automation, and plan for Disaster Recovery. Refine KPIs to support data-driven decisions around reliability and availability. Monitor systems … capacity planning. Collaborate with dev teams to build resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands More ❯
Posted:

Senior Site Reliability Engineer - (Networks, AWS & Kubernetes)

London, United Kingdom
Source Technology
Senior Site Reliability Engineer - (Networks, AWS & Kubernetes) (BH-48405-2) Location: London, England Sector: IT Salary: £90,000.00 to £120,000.00 per annum Benefits: + 15% bonus + car allowance A truly unique opportunity to help launch a brand new team within a global financial services provider. This new team of highly skilled Full Stack Infrastructure Engineers will … cover Compute, Storage, Network, and Cloud technologies. You will help design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance. Requirements: Proven experience managing and optimizing a diverse infrastructure stack. Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation). Familiarity with service mesh technologies (Istio, Linkerd). Solid understanding of virtualization … Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster recovery and business continuity planning. Experience with performance tuning and capacity planning. Understanding of chaos engineering principles and practices. Skills in cost optimization for cloud infrastructure. Specific Tools and Techniques: Experience in using cloud native monitoring tools like AWS CloudWatch, Azure Monitor, and Google Cloud More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

Manchester, United Kingdom
S&P Global, Inc
to minimize recurrence. What We're Looking For: Basic Required Qualifications: Bachelor's degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a Site Reliability Engineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running in AWS Comfortable with Infrastructure as … GitHub Actions, Azure DevOps Excellent troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues Familiarity working in an Agile environment True understanding of Site Reliability Engineering Ability to build and maintain a system and culture that supports and implements SLOs. Familiar with Docker & Kubernetes, specifically EKS & ECS Familiar with programming languages More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Site Reliability Engineering
10th Percentile
£57,388
25th Percentile
£64,319
Median
£82,500
75th Percentile
£115,000
90th Percentile
£135,000