ensuring governance, security, compliance, and control. Experience Requirements: Proven experience in a senior SRE role or similar. Strong knowledge of cloud technologies and SLA SLO SLI management. Experience leading teams and implementing SCRUM processes. Excellent communication and leadership skills. Experience line managing, mentoring, and coaching. Responsibilities: Collaborate with the Principal More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
ensuring governance, security, compliance, and control. Experience Requirements: Proven experience in a senior SRE role or similar. Strong knowledge of cloud technologies and SLA SLO SLI management. Experience leading teams and implementing SCRUM processes. Excellent communication and leadership skills. Experience line managing, mentoring, and coaching. Responsibilities: Collaborate with the Principal More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
planning Create sustainable systems and services through automation and uplifts Balance feature development speed and reliability with well-defined servicelevelobjectives Have you got what it takes? 3-6 years of working experience in a similar role, with a focus More ❯
Warwick, Warwickshire, United Kingdom Hybrid / WFH Options
ICEO
Cooperate with engineering and product teams to design and implement highly available and fault-tolerant systems. Participate in improving ServiceLevelObjectives, ServiceLevel Indicators, and error budgets to enhance system reliability. Work More ❯
valued. What You'll Do Key responsibilities in this role will include (but not be limited to): Leveraging core SRE values - measuring (SLI/SLO/SLA), testing, and eliminating toil via automation with appropriate Disaster Recovery planning Refining KPIs to enable data-driven decision making for availability and reliability More ❯
deployment) of e.g. ELK, CloudWatch, Fluentd, to enable forensic log analysis and system tuning as well as data-driven performance analysis (i.e. SLI/SLO) and capacity planning. You are a competent Linux & Windows systems administrator (for multiple distributions), including storage management (e.g. LVM, RAID) and security best-practices e.g. More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
and experience in automating/scripting. Understand and write code in multiple languages such as Python, Java, Golang, BASH and PowerShell. Experience in monitoring SLO’s, SLI’s and SLA’s a logging updates and altering where appropriate. Perks and Benefits Up to £105k (DoE) 2 days a week onsite More ❯
City Of Bristol, England, United Kingdom Hybrid / WFH Options
Gravitas Recruitment Group (Global) Ltd
production environment and experience in automating/scripting. · Ability to quickly understand, update and write code in languages (ideally Java). · Working experience monitoring SLO’s, SLI’s and SLAs and logging updates. · Strong DevOps understanding and familiarity, including experience of Infrastructure as Code and CI/CD pipelines, e.g. More ❯
ensure the reliability of this environment for our customers, SREs work closely with developers and product managers to understand servicelevelobjectives, think through failures scenarios, and design systems which balance cost with reliability objectives. Additionally, SREs collaborate with the Information More ❯
Expertise in defining and monitoring service quality metrics (such as RED, Golden Signals), establishing microservice ServiceLevelObjectives (SLOs), and managing error budgets. Proficiency in Linux, cloud networking, microservices architecture, and Amazon EKS. Preferred qualifications include: Prior More ❯
Expertise in defining and monitoring service quality metrics (such as RED, Golden Signals), establishing microservice ServiceLevelObjectives (SLOs), and managing error budgets. Proficiency in Linux, cloud networking, microservices architecture, and Amazon EKS. Preferred qualifications include: Prior More ❯
and training engineers up to Staff standard. Operational Stability: Demonstrate a production first attitude, continuously considering observability and maintaining ServiceLevelObjectives, while delivering change at pace. Research & Innovation: Embrace emerging technologies and trends, and share insights with the organisation, while More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Sanderson Recruitment
As our Site Reliability Engineer, you'll work closely with our feature team and other colleagues to meet defined servicelevelobjectives and continually improve systems and environments. You'll define error budgets that support finding the right balance between risk More ❯
standards to meet performance, reliability, and maintainability of the systems. With a strong production-first mindset, drive observability, maintain ServiceLevelObjectives (SLOs), and ensure efficient incident resolution. Oversee the maintenance of existing systems, ensuring continuous improvements and prompt resolution of More ❯
standards to meet performance, reliability, and maintainability of the systems. With a strong production-first mindset, drive observability, maintain ServiceLevelObjectives (SLOs), and ensure efficient incident resolution. Oversee the maintenance of existing systems, ensuring continuous improvements and prompt resolution of More ❯
systems and third-party solutions. Network Health Management: Define and implement prediction pipelines for long-term network health, availability, and service-level objectives. Operations Automation: Lead initiatives to automate and optimize network operations focusing on scalability and reliability. Collaborative Development: Work closely More ❯
to reduce failures, manual tasks and therefore improving overall application performance and availability. As well as responding to stakeholder requests within agreed timescales or SLO, they will also be supporting maintenance activities, critical systems, and the planning of releases related to production applications. This is an opportunity to join an More ❯
Create pro-active monitoring and observability solutions to help us see issues before our customers do Define and measure ServiceLevelObjectives and ServiceLevel Indicators Why Lloyds Banking Group We're on … workplaces, and colleagues to make our Group a great place for everyone. Including you! What you'll need Strong practitioner in SRE principles (SLI, SLO & SLA) using Observability, Logging, Monitoring & Alerting Experience of Infrastructure as Code and CI/CD pipelines using tools such as Terraform, Jenkins and Harness Can More ❯
quality. The Service Delivery Manager will be responsible for ensuring our technical teams meet their servicelevelobjectives, driving operational excellence, and maintaining strong relationships with internal and external stakeholders. You will play a vital part in More ❯
quality. The Service Delivery Manager will be responsible for ensuring our technical teams meet their servicelevelobjectives, driving operational excellence, and maintaining strong relationships with internal and external stakeholders. You will play a vital part in More ❯
and operations. Write maintainable, well-tested, high-quality code and uphold engineering best practices. Focus on observability and maintain ServiceLevelObjectives, take operational responsibility for the Identity Platform, including joining the on-call rota. Foster a strong engineering culture through More ❯
your ideas to technical and non-technical audiences. Additional Desired Skills Experience with incident management platforms like PagerDuty, OpsGenie, or similar tools Understanding of SLO/SLA management and implementations Knowledge of industry standard incident management frameworks and best practices Familiarity with automated remediation and runbook automation Experience with DevOps More ❯
and service automation. Lead the definition and track ServiceLevelObjectives (SLO) to measure service availability in combination with service, product and engineering communities. Collaborate with product and engineering More ❯
Level Agreements (SLA) through ServiceLevelObjectives (SLO) and ServiceLevel Indicators (SLI). Liaise with client technical and business teams as needed to ensure More ❯
Worcester, Worcestershire, West Midlands, United Kingdom
FBI &TMT
Collaborate with stakeholders, software engineers, QA engineers, and ADMs Manage backlog ownership and ensure discipline within the team Define ServiceLevelObjectives (SLOs) for measuring service performance Job Requirements: Experience in a technical BA role or similar More ❯