mentioned), PagerDuty/OpsGenie or similar, and Jenkins. NON-TECHNICAL REQUIREMENTS: Awareness of Site Reliability Engineering (SRE) principles, including ServiceLevelObjectives (SLOs), ServiceLevel Indicators (SLIs), and error budgets. Understanding of development more »
Yeovil, England, United Kingdom Hybrid / WFH Options
Education Horizons
and process control & monitoring to improve quality and efficiency. Ensuring that the SRE team is meeting the required SLOs (ServiceLevelObjectives) & SLAs (ServiceLevel Agreements) for their products & services. Ensuring maintenance is more »
Manchester Area, United Kingdom Hybrid / WFH Options
bet365
working from home policy. Preferred Skills, Qualifications and Experience Excellent knowledge of SRE principles, including the creation and management of effective SLI’s and SLO’s for reliability and customer satisfaction. Knowledge of contemporary observability tools, techniques and best practice including Splunk, New Relic, Grafana and Pager Duty. Excellent knowledge more »
Farnborough, Hampshire, South East, United Kingdom Hybrid / WFH Options
Interact Consulting Limited
a must. Excellent understanding of networking principles (IP addressing, virtual networks, network security and networking models). Understanding of observability and site-reliability principles (SLO's, SLI's) and working with engineering teams to improve the applications and platform. Good understanding of SQL and working with relational databases. more »
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Evri
security, and cost optimisation Supporting your team in dealing with operational issues such as availability, performance, and scalability Influencing ServiceLevelObjectives, Non-Functional Requirements, and infrastructure requirements Highlighting deviations from technology standards to the TDA (Technical Design Authority) Ensuring that … the ServiceLevelObjectives in your area are met Helping to develop and promote the SRE service catalogue Ensuring the best security practices are followed Supporting and developing junior members of the team Capturing the SLIs and more »
Employment Type: Permanent, Part Time, Work From Home
a Senior Site Reliability Engineer with deep Google Cloud (GCP) experience, to join our customer’s organisation. Responsibilities Influencing ServiceLevelObjectives, Non-Functional Requirements, and infrastructure requirements Ensuring that the ServiceLevel … the SLOs Reliability concepts to ensure high performance and high service availability, able to define, implement and improve business performance SLO’s. Production operations including 24x7 on-call support, escalation/paging with OpsGenie, incident management, RCA (Root Cause Analysis) Maintain existing compliance and governance standards more »
Support Analyst (UK) About Adaptiva Adaptiva, the Autonomous Endpoint Management company, delivers the fastest way to patch and manage endpoints at scale. The company offers OneSite, the first fully adaptive autonomous endpoint management (AEM) platform. At Adaptiva, we pride ourselves more »