DevOps Engineer/Reliability Engineer (SRE)
Role Overview
We are seeking an experienced Cloud Engineer/SRE to play a key role in delivering and evolving cloud-based services. Reporting into the Cloud Platform Lead, you will be responsible for leading the technical delivery of cloud solutions across a range of initiatives.
This role combines hands-on engineering, technical leadership, and cloud platform ownership. You may work centrally on shared cloud platforms or embed within product teams depending on project needs. You will also support existing cloud environments and previously delivered adoption work.
As a senior member of the team, you will take ownership of complex projects, provide guidance to other engineers, and promote best practices in cloud adoption, automation, and reliability engineering.
Key Responsibilities Cloud Engineering & Delivery
- Lead the design and implementation of cloud solutions using both traditional infrastructure and cloud-native technologies.
- Deliver and support workloads running on Virtual Machines and Kubernetes platforms.
- Ensure solutions are secure, scalable, resilient, and aligned with cloud best practices.
- Support cloud migration initiatives using recognised industry frameworks.
DevOps & Automation
- Design, implement, and maintain CI/CD pipelines using GitHub Actions or equivalent tooling.
- Develop and manage Infrastructure as Code (IaC) using tools such as Terraform (preferred), Ansible, or similar.
- Drive automation-first approaches across build, deployment, and operational processes.
Site Reliability & Operations
- Act as the SRE lead on major initiatives, ensuring reliability, availability, and performance of services.
- Troubleshoot and resolve complex cloud infrastructure and application issues.
- Participate in a 24/7 on-call support rota, supporting live services and core cloud platforms.
- Promote and enforce robust change control and operational processes.
Technical Leadership & Collaboration
- Provide technical leadership and mentoring to junior and mid-level cloud engineers.
- Review designs, project plans, and delivery approaches, constructively challenging where required.
- Work closely with Project Managers, Cyber Security, Testing, Networking, and Application teams.
- Represent the cloud engineering team during project delivery and stakeholder discussions.
Documentation & Governance
- Produce high-quality technical design documents, operational handover documentation, and runbooks.
- Ensure adherence to organisational standards, security policies, and governance frameworks.
- Advocate for best practices in cloud architecture, DevOps, and SRE disciplines.
Essential Skills & Experience
- Proven experience leading cloud or SRE initiatives in complex environments.
- Strong DevOps background with hands-on cloud delivery experience.
Solid expertise in:
- Cloud platforms (Azure preferred)
- Kubernetes
- Terraform (IaC)
- GitHub/GitHub Actions
- Experience delivering solutions on both VM-based and containerised platforms.
- Strong understanding of networking concepts, including load balancing, Firewalls, routing, and network security.
- Experience supporting and troubleshooting production cloud environments.
- Comfortable working in Agile or project-based delivery environments.
- Willing and able to participate in a 24/7 on-call support rota.
Desirable Skills & Experience
- Cloud certifications (training can be provided if required).
- Experience designing or supporting multi-tenant cloud solutions.
- Exposure to cloud migration frameworks (eg Azure CAF or equivalent).
- Strong interest in continuous learning and staying current with cloud and DevOps technologies.
- Experience driving continuous improvement across platforms and processes.