Lead Cloud Site Reliability Engineer (SRE)

Job Description –

We’re looking for a Lead Cloud Site Reliability Engineer (SRE) with strong expertise in Azure, Kubernetes, Terraform, and GitHub to lead large-scale projects and mentor a growing team.

Key Responsibilities

  • Lead SRE activities for large-scale cloud projects, providing technical guidance to engineers.
  • Deliver solutions across VMs and Kubernetes , ensuring efficient deployment, scaling, and management.
  • Implement CI/CD pipelines using GitHub Actions or similar tools.
  • Design and manage Infrastructure as Code (IaC) using Terraform (preferred), Ansible, Jenkins, etc.
  • Assess networking requirements and design secure solutions (load balancing, firewalls, routing).
  • Troubleshoot and resolve complex cloud infrastructure and application issues.
  • Mentor junior engineers and promote knowledge sharing within the team.
  • Collaborate with stakeholders, vendors, and cross-functional teams (Cyber Security, Testing, Application).
  • Support cloud migration initiatives using frameworks like CAF, AzureRM, Google Cloud .
  • Represent the team during project delivery and ensure adherence to change control processes.
  • Participate in 24/7 on-call support rota and occasional support for previous adoption work.

What We’re Looking For

  • Strong DevOps background with automation-first mindset
  • Expertise in Azure, Kubernetes, Terraform, GitHub
  • Experience in cloud migration and networking solutions
  • Ability to lead projects and communicate effectively
  • Familiarity with change control processes

Nice to Have

  • Cloud certifications (Azure, GCP, etc.)
  • Experience with multi-Tenant solutions
  • Passion for continuous learning and innovation

Job Details

Company
Response Informatics
Location
London, UK
Posted