Cambridge, Gloucestershire, UK Hybrid / WFH Options
AI Tech Suite
ensuring governance, security, compliance, and control. Experience Requirements: Proven experience in a senior SRE role or similar. Strong knowledge of cloud technologies and SLA SLO SLI management. Experience leading teams and implementing SCRUM processes. Excellent communication and leadership skills. Experience line managing, mentoring, and coaching. Responsibilities: Collaborate with the Principal More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
ensuring governance, security, compliance, and control. Experience Requirements: Proven experience in a senior SRE role or similar. Strong knowledge of cloud technologies and SLA SLO SLI management. Experience leading teams and implementing SCRUM processes. Excellent communication and leadership skills. Experience line managing, mentoring, and coaching. Responsibilities: Collaborate with the Principal More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
planning Create sustainable systems and services through automation and uplifts Balance feature development speed and reliability with well-defined servicelevelobjectives Have you got what it takes? 3-6 years of working experience in a similar role, with a focus More ❯
Warwick, Warwickshire, United Kingdom Hybrid / WFH Options
ICEO
Cooperate with engineering and product teams to design and implement highly available and fault-tolerant systems. Participate in improving ServiceLevelObjectives, ServiceLevel Indicators, and error budgets to enhance system reliability. Work More ❯
/CD pipelines, Infrastructure as Code, and automation frameworks tailored to our systems Drive disaster recovery planning, high availability architecture, and 24/7 SLO adherence for critical ad-serving solutions Build and maintain custom, complex deployment pipelines using Jenkins and other modern tools Improve system reliability and developer productivity More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
and experience in automating/scripting. Understand and write code in multiple languages such as Python, Java, Golang, BASH and PowerShell. Experience in monitoring SLO’s, SLI’s and SLA’s a logging updates and altering where appropriate. Perks and Benefits Up to £105k (DoE) 2 days a week onsite More ❯
City Of Bristol, England, United Kingdom Hybrid / WFH Options
Gravitas Recruitment Group (Global) Ltd
production environment and experience in automating/scripting. · Ability to quickly understand, update and write code in languages (ideally Java). · Working experience monitoring SLO’s, SLI’s and SLAs and logging updates. · Strong DevOps understanding and familiarity, including experience of Infrastructure as Code and CI/CD pipelines, e.g. More ❯
ensure the reliability of this environment for our customers, SREs work closely with developers and product managers to understand servicelevelobjectives, think through failures scenarios, and design systems which balance cost with reliability objectives. Additionally, SREs collaborate with the Information More ❯
Expertise in defining and monitoring service quality metrics (such as RED, Golden Signals), establishing microservice ServiceLevelObjectives (SLOs), and managing error budgets. Proficiency in Linux, cloud networking, microservices architecture, and Amazon EKS. Preferred qualifications include: Prior More ❯
Expertise in defining and monitoring service quality metrics (such as RED, Golden Signals), establishing microservice ServiceLevelObjectives (SLOs), and managing error budgets. Proficiency in Linux, cloud networking, microservices architecture, and Amazon EKS. Preferred qualifications include: Prior More ❯
to reduce failures, manual tasks and therefore improving overall application performance and availability. As well as responding to stakeholder requests within agreed timescales or SLO, they will also be supporting maintenance activities, critical systems, and the planning of releases related to production applications. This is an opportunity to join an More ❯
Create pro-active monitoring and observability solutions to help us see issues before our customers do Define and measure ServiceLevelObjectives and ServiceLevel Indicators Why Lloyds Banking Group We're on … workplaces, and colleagues to make our Group a great place for everyone. Including you! What you'll need Strong practitioner in SRE principles (SLI, SLO & SLA) using Observability, Logging, Monitoring & Alerting Experience of Infrastructure as Code and CI/CD pipelines using tools such as Terraform, Jenkins and Harness Can More ❯
quality. The Service Delivery Manager will be responsible for ensuring our technical teams meet their servicelevelobjectives, driving operational excellence, and maintaining strong relationships with internal and external stakeholders. You will play a vital part in More ❯
and operations. Write maintainable, well-tested, high-quality code and uphold engineering best practices. Focus on observability and maintain ServiceLevelObjectives, take operational responsibility for the Identity Platform, including joining the on-call rota. Foster a strong engineering culture through More ❯
and service automation. Lead the definition and track ServiceLevelObjectives (SLO) to measure service availability in combination with service, product and engineering communities. Collaborate with product and engineering More ❯
Worcester, Worcestershire, West Midlands, United Kingdom
FBI &TMT
Collaborate with stakeholders, software engineers, QA engineers, and ADMs Manage backlog ownership and ensure discipline within the team Define ServiceLevelObjectives (SLOs) for measuring service performance Job Requirements: Experience in a technical BA role or similar More ❯
improve ways of working and contribute to team, department and divisional continuous improvement projects aimed to drive operational efficiency, deliver on KPIs, SLA’s, SLO’s & SLI’s financial targets and great member experience and outcome. Identify emerging trends and define strategies for adopting new technologies within the engineering domain. More ❯
key member of the SRE leadership team. Lead the definition and track ServiceLevelObjectives (SLO) to measure service availability in combination with service, product and engineering communities. Collaborate with product and engineering More ❯
customers in their security infrastructure design and planning. You will be regularly completing deployment projects on or before expected ServiceLevelObjectives (SLOs) and integrating new systems into existing network architecture. On the support side, you'll efficiently manage customer support More ❯
The Underwriting Support Supervisor will oversee the day-to-day operations of an Associate Underwriter Team and will manage servicelevelobjectives to ensure proper support is being provided to the Underwriting Team. Ideally located in our West Chester, OH offices More ❯
the security operations team. They will also be proficient in using multiple ticketing systems to manage incidents effectively, ensuring servicelevelobjectives are adhered to. Experience utilising Kusto Query Language (KQL) for log analysis will also be beneficial. This is a More ❯
tight cooperation inside and outside of your immediate team. Build trust and reliability in your products, review performance against servicelevelobjectives, address incidents and prioritize improvements. Qualifications: Not all applications will have skills that match a job description exactly. Ciptex More ❯
levels. Collaborate with team members to identify servicelevel indicators, establish servicelevelobjectives, and error budgets with stakeholders. Maintain high technical expertise in one or more domains, proactively resolving technology bottlenecks. Serve … of software applications and technical processes, with emerging expertise in specific disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Knowledge of CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containers and orchestration tools like Docker, Kubernetes, ECS. More ❯
experience with Reliability concepts to ensure high performance and high service availability, able to define implement and improve business performance SLO's. 2+ years of experience with Production operations including 24x7 on-call support, escalation/paging with OpsGenie, incident management, RCA (Root Cause Analysis) and … of applications and services running on IaaS and PaaS in Microsoft Azure. AWS and GCP are nice to have. ServiceLevelObjectives and indicators focused on improving business workflow performance and availability. Technical and business dashboards, metrics, and actionable alerting. Processes More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
bet365
Level Indicators (SLI) and ServiceLevelObjectives (SLO) for reliability and customer satisfaction. Knowledge of contemporary observability tools, techniques and best practice including Splunk, New Relic, Grafana and Pager Duty. Excellent knowledge of More ❯