Site Reliability Engineer Jobs in England

26 to 37 of 37 Site Reliability Engineer Jobs in England

Software Engineering Manager, Site Reliability, Cloud Incident Response

London, United Kingdom
Google Inc
response. Preferred qualifications: Master's degree or PhD in Computer Science, or a related technical field. Experience as a cloud customer. About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally … visible systems-have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage … the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

London, United Kingdom
Duffel
developer experience to go with it. The tools used on the team include Elixir, Phoenix, Kubernetes and Google Cloud Platform. Site Reliability Engineering at Duffel As an SRE at Duffel, you'll be part of a small team within engineering that is responsible for the reliability, performance, and resilience of our infrastructure and applications. You will be … silently drop spans. - An enthusiasm for both software development and systems engineering. - A high bar for code and configuration quality and readability. - A good understanding of current observability and reliability practices. - Experienced and comfortable in running incident response. - Big picture thinking - you can make trade offs on technical work streams against business impact. - Fantastic communication skills. You're able … We manage a data pipeline using Pub/Sub, Airbyte, and dbt. Our Current Focus We're currently driving a big shift in how we think about and monitor reliability across the engineering organisation, with a focus on early detection of customer-impacting issues. We're extending and standardising our use of OpenTelemetry, and introducing Honeycomb as the single More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer III, Site Reliability Engineering, Google Cloud

London, United Kingdom
Google Inc
Ability to debug, optimize code, and to automate routine tasks. Excellent problem-solving approach, with effective verbal and written communication skills. About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally … visible systems-have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage … the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Splunk SRE Engineer with ELK Stack and Kibana-6months-Birmingham

Birmingham, United Kingdom
Kirtana Consulting
Kirtana consulting is looking for Splunk SRE Engineer with ELK Stack and Kibana for 6months rolling contract in Birmingham. Job description: Role Title: Splunk SRE Engineer Responsible for leading and executing the migration of data, dashboards, alerts, and configurations from Splunk systems to Elasticsearch. This role involves deep technical expertise in Splunk architecture, data ingestion, and observability tools More ❯
Employment Type: Contract
Rate: GBP Annual
Posted:

Software Engineer / SRE

Leeds, West Yorkshire, Yorkshire, United Kingdom
Hybrid / WFH Options
Fruition Group
Software Engineer/SRE JavaScript/TypeScript, Node.js, AWS, Observability Leeds/Hybrid, c. 2x per week Salary up to £65,000 We're looking for a Software Engineer with strong AWS and Observability experience to join a growing engineering team in Leeds. This is a hybrid role, giving you the flexibility to split your time between home … and a modern city-centre office. You'll work across both engineering and site reliability, helping to build and scale systems that are reliable, secure, and observable. You'll be a key part of improving platform performance and automation, while collaborating with developers, product teams, and operations. What you'll be doing: Building and maintaining scalable cloud infrastructure … in AWS Implementing and improving observability tools (monitoring, logging, tracing) Automating deployments and improving CI/CD pipelines Driving reliability, availability and performance across systems Working with developers and SREs to solve complex problems What we're looking for: Strong experience with AWS (EC2, ECS, Lambda, RDS etc.) Good knowledge of observability tools (Grafana, Prometheus, OpenTelemetry, Datadog, or similar More ❯
Employment Type: Permanent, Work From Home
Salary: £65,000
Posted:

Site Reliability Engineer

London, United Kingdom
Cisco Systems
on the Splunk TechOps team, empowering our customers to execute our vision making machine data accessible, usable, and valuable to everyone! The Splunk TechOps organization runs Splunk cloud, blending SRE, Systems Engineering and Service Engineering disciplines, across functional global teams. Come join a team that is striving for operational awesomeness and trying to automate the world. We have a large … experience and drive the growth of Splunk Cloud. What we're looking for NOTE: 4 x 10h shifts: Wednesday - Saturday/8am-6pm We are looking for a TechOps SRE to help maintain, contribute to and improve the next generation of our large scale Cloud offering. You will be working with providers and supporting the infrastructure that powers Splunk's … that deal with operating systems (particularly Linux) and networking. You might also have worked with Cloud technologies. Your previous job titles might be something close to systems admin, network engineer or devops engineer. You're passionate about your work. Our customers are passionate about Splunk and we want the same from our engineers. You should enjoy actively being responsible More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Site Reliability Engineer

City of London, London, United Kingdom
TechNET IT Recruitment Ltd
with a modern, full-stack platform that delivers logs, metrics, traces, and security monitoring — cutting costs by up to 70% while boosting efficiency. They are looking for a Lead SRE to own and elevate our Alerting & Incident Management platform . You’ll be the driving force behind reliability, customer satisfaction, and product excellence — ensuring smooth alert management, fewer engineering … experience by speeding up alert resolution and reducing interruptions for engineers. Build solutions to common pain points, shaping roadmaps, documentation, and technical knowledge. Develop benchmarking tools to improve performance, reliability, and scalability. Stay ahead of incident management trends to drive new workflows and product improvements. Mentor teams and lead with clear, impactful communication. What We’re Looking For 5+ … platform experience (PagerDuty, OpsGenie, etc. a plus). Solid technical foundation with cloud/distributed systems. Excellent communicator, comfortable working across US/IL time zones. Bonus: leadership experience, SRE/DevOps background, knowledge of SLO/SLA practices. More ❯
Posted:

Lead Site Reliability Engineer

London Area, United Kingdom
TechNET IT Recruitment Ltd
with a modern, full-stack platform that delivers logs, metrics, traces, and security monitoring — cutting costs by up to 70% while boosting efficiency. They are looking for a Lead SRE to own and elevate our Alerting & Incident Management platform . You’ll be the driving force behind reliability, customer satisfaction, and product excellence — ensuring smooth alert management, fewer engineering … experience by speeding up alert resolution and reducing interruptions for engineers. Build solutions to common pain points, shaping roadmaps, documentation, and technical knowledge. Develop benchmarking tools to improve performance, reliability, and scalability. Stay ahead of incident management trends to drive new workflows and product improvements. Mentor teams and lead with clear, impactful communication. What We’re Looking For 5+ … platform experience (PagerDuty, OpsGenie, etc. a plus). Solid technical foundation with cloud/distributed systems. Excellent communicator, comfortable working across US/IL time zones. Bonus: leadership experience, SRE/DevOps background, knowledge of SLO/SLA practices. More ❯
Posted:

Splunk SRE Engineer

Birmingham, United Kingdom
eTeam Workforce Limited
Rate range: GBP 360 Work mode: Hybrid, 3 days working from client office Contract duration: Location: Birmingham, UK JOB DETAILS Role Title: Splunk SRE Engineer Responsible for leading and executing the migration of data, dashboards, alerts, and configurations from Splunk systems to Elasticsearch. This role involves deep technical expertise in Splunk architecture, data ingestion, and observability tools, along with More ❯
Employment Type: Contract
Rate: GBP Daily
Posted:

Senior Site Reliability Engineer - MongoDB, Cassandra

Basingstoke, Hampshire, United Kingdom
Visa Inc
requirements, being transactional, analytical, non-relational, or data warehouse. The wider DBA team is the technology owner of multiple RDBMS and NoSQL technologies, is responsible to strategize, advance, and engineer enterprise solution for automated build and patching and efficient administration, that meet security, availability, performance, fast delivery and reporting requirements, and to support projects and products using these technologies. … JOB SCOPE As an engineer in this team, the individual will be involved in the build and run activities related to NoSQL database technology and infrastructure. The role will contribute to solution engineering and support as well as being responsible for delivering database projects, maintaining running systems and performing problem analysis and troubleshooting. The individual should be well versed More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

SRE/Infrastructure Engineer

Basingstoke, Hampshire, United Kingdom
InfoSum
As an SRE/Infrastructure Engineer, is responsible for designing, implementing, and maintaining the cloud infrastructure our platform sits on, as well as the monitoring and deployment services that enable the rest of engineering to develop, deliver and maintain our platform services. You will also be instrumental in both monitoring and incident response, playing a key role in ensuring … maximum reliability and minimal downtime. You will collaborate with teams across the company, including developers, customer support, product owners and sales, to ensure the reliability, scalability, and performance of our platform. Infrastructure Design and Implementation: assist or lead in the design, deployment, and operation of the infrastructure components required to support our applications and services. This includes managed More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer/SRE

Bromley, Greater London, Bromley Town, United Kingdom
Hybrid / WFH Options
Ascendion
Below are the details of the position: Job Title: Platform Engineer/SRE Work Location: Bromley, UK (Hybrid – 3 days a week) Job Description: 15+ years’ experience in delivering large scale applications with focus on performance, scalability, security, and reliability. Experience in a highly Agile continuous integration and continuous deployment environment, preferably within a financial domain. Strong experience in More ❯
Employment Type: Permanent, Contract
Posted:
Site Reliability Engineer
England
10th Percentile
£55,500
25th Percentile
£64,375
Median
£75,000
75th Percentile
£85,000
90th Percentile
£99,500