Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
RVU Co UK
platform and reliability engineering approaches (SRE), including observability, performance optimisation, capturing analytics and security best practices Experience implementing Service Level Objectives and using them to drive error budgets, risk management and alerting Knowledge and experience with operating containers at scale within the Kubernetes ecosystem Experience in building robust, fault tolerant systems and identifying areas for improvement all in the … support of maintaining reliability standards Knowledge of incident and problemmanagement practices Proficiency in the software development lifecycle, including testing, version control, branching strategies and deployments approaches using modern software technologies Ability to debug Production code to identify root causes of failure that lead to improvement Experience building and designing APIs or CLIs that solve internal problems or More ❯
platform and reliability engineering approaches (SRE), including observability, performance optimisation, capturing analytics and security best practices Experience implementing Service Level Objectives and using them to drive error budgets, risk management and alerting Knowledge and experience with operating containers at scale within the Kubernetes ecosystem Experience in building robust, fault tolerant systems and identifying areas for improvement all in the … support of maintaining reliability standards Knowledge of incident and problemmanagement practices Proficiency in the software development lifecycle, including testing, version control, branching strategies and deployments approaches using modern software technologies Ability to debug Production code to identify root causes of failure that lead to improvement Experience building and designing APIs or CLIs that solve internal problems or More ❯
Cardiff, South Glamorgan, Wales, United Kingdom Hybrid / WFH Options
Confused.com
platform and reliability engineering approaches (SRE), including observability, performance optimisation, capturing analytics and security best practices Experience implementing Service Level Objectives and using them to drive error budgets, risk management and alerting Knowledge and experience with operating containers at scale within the Kubernetes ecosystem Experience in building robust, fault tolerant systems and identifying areas for improvement all in the … support of maintaining reliability standards Knowledge of incident and problemmanagement practices Proficiency in the software development lifecycle, including testing, version control, branching strategies and deployments approaches using modern software technologies Ability to debug Production code to identify root causes of failure that lead to improvement Experience building and designing APIs or CLIs that solve internal problems or More ❯