DevOps Site Reliability Engineering (SRE) - HO - SEO

The Reliability Enablement team helps Data Services & Analytics (DSA) teams improve their product and service reliability by providing observability and embedding Site Reliability Engineering (SRE) principles. You will be a key part of the team, working on engagements with product teams and helping grow SRE culture within the organisation.
Job description

The DevOps (SRE) is responsible for improving the reliability of our platforms and services. Your role is proactive, ensuring relevant metrics are being measured and reliability improvements are identified and implemented when necessary. This will ensure the reliability and availability of services for users.

You will also advise developers on how to use platforms and tools effectively, reviewing and advising on their use of CI/ CD pipelines and observability tooling. You may also work to deliver new platform tooling.

Tools and Technologies we use: 

We are keen for Engineers to continue learning new technologies, we have a large range in the Home Office including:

  • Backend: Java, Node.js, C#, Python, PHP, Scala, Power Platform 
  • Frontend: React, JavaScript, Typescript, Angular? 
  • Data: PostgreSQL, Microsoft SQL Server, MongoDB, Apache Kafka, Neo4J, Amazon Athena
  • DevOps: AWS, Kubernetes, Azure, Jenkins, Docker, Ansible, Terraform, Dynatrace   

    What you will do 

    Your main day to day responsibilities will be: 

    • Supporting teams to effectively build, improve and deploy reliable and secure services
    • Building new or improved shared tooling to help teams automate and maximise reliability
    • Spotting instances where teams are not using best practice and advising on how to improve
    • Supporting engineers to design new services; helping to define test and deployment pipelines
    • Helping teams improve their integration approaches; increasing reliability and the value delivered to users 

      Like many organisations we need to maintain our services 24/ 7, therefore, on occasions there may be a requirement to work out of hours, for which you will be paid an additional allowance.  

      Person specification

      UK residency and security requirements -  You need to have lived in the UK for the past 5 years .  

      Essential Criteria 

      As a DevOps (SRE), you will have experience of: 

      • Designing and implementing reliable cloud solutions using AWS or Azure according to best practices. (Software design - SWDN) 
        • Implementing automated testing, scanning and code analysis tooling, according to best practices. (Testing - TEST) 
          • Implementing and using application monitoring tooling to identify and respond to problems early. (Application support - ASUP) 
            • Designing, coding, testing, maintaining and documenting scripts and infrastructure-as-code definitions to automate build and deployment activities. (Programming/ software development - PROG) 
              • Implementing and promoting use of CI/ CD pipelines according to best practices. (Systems integration and build - SINT) 
                • Implementing data management best practices for cloud resources, such as naming, tagging, metadata, backups, and documentation. (Data management?- DATM) 

                  SFIA capability framework 

                  Skills for the Information Age (SFIA) is the technical framework that sets the standard capability and development of all engineering levels in the Home Office. This is a link to the capability framework:  All skills A-Z — English (sfia-online.org) 

                  We use set SFIA technical skills to form our interview questions and we will assess you against these technical skills during the selection process.  

                  SFIA levels of responsibility - Use the SFIA  Levels of responsibility to understand what would be expected for each Technical Skill listed below.  

                  SFIA Technical Skills  

                  The essential technical skills listed above are reflective of the Home Office Government Digital and Data Profession Career Framework. Please see below for the relevant skills required for your role.

                  Behaviours

                  We'll assess you against these behaviours during the selection process:

                  • Changing and Improving
                    Technical skills

                    We'll assess you against these technical skills during the selection process:

                    • Software design (SWDN) - Level 3
                    • Programming/ Software development (PROG) - Level 3
                    • Testing (TEST) - Level 3
                    • Systems integration and build (SINT) - Level 3
                    • Data management?(DATM) - Level?3
                    • Application support (ASUP) - Level 3
Company
Government Digital & Data
Location
Croydon and Sheffield, Croydon, London
Employment Type
Permanent
Salary
£44,720 - £52,130
Posted
Company
Government Digital & Data
Location
Croydon and Sheffield, Croydon, London
Employment Type
Permanent
Salary
£44,720 - £52,130
Posted