Site Reliability Engineer

Site Reliability Engineer (SRE)

Location: Gloucester (Hybrid, 3 days onsite)

Salary: Up to £65,000 + £7,000 bonus

Security Clearance: Must be eligible for UK Developed Vetting (DV)

We’re hiring a Site Reliability Engineer to join a high-performing engineering environment delivering critical, complex systems. This role sits at the intersection of software engineering and operations, with a strong focus on automation, scalability, and system resilience.

This is an excellent opportunity for someone with a software engineering background who is looking to move into a more systems-focused, reliability-driven career path without losing their hands-on technical edge.

As an SRE, you’ll be responsible for ensuring the reliability, availability, and performance of mission-critical systems. You’ll apply software engineering principles to infrastructure and operations challenges, reducing manual effort through automation and improving system design.

Key Responsibilities Include:

  • Supporting and maintaining live services, ensuring high availability and performance
  • Automating operational processes to reduce manual intervention
  • Monitoring, alerting, and observability improvements across systems
  • Diagnosing and resolving incidents across the full technology stack
  • Working closely with engineering teams to influence system design and reliability
  • Participating in an on-call rota (project-dependent)
  • Contributing to continuous improvement of DevOps and SRE practices

What We’re Looking For

We’re interested in candidates who bring a strong engineering mindset and enjoy solving complex systems problems.

Core Experience:

  • 2+ years commercial experience in this area
  • Background in software engineering (e.g. Java, JavaScript, or similar)
  • Experience working with cloud platforms (AWS, Azure, or similar)
  • Strong Linux/Windows command line skills (Bash, PowerShell)
  • Understanding of distributed systems, scalability, and resilience
  • Experience with monitoring/observability tools (e.g. ELK stack or similar)
  • Familiarity with containers and microservices (e.g. Docker)
  • Experience troubleshooting across infrastructure and application layers

Desirable:

  • Exposure to 2nd or 3rd line support environments
  • Knowledge of CI/CD and deployment tooling
  • Experience with infrastructure as code or configuration management tools
  • Understanding of ITIL or service management practices

Additional Requirements

  • Willingness to participate in on-call support (depending on project)

If you’re a software engineer looking to broaden your impact into reliability, systems, and large-scale infrastructure, this role offers a strong platform to do exactly that.

Job Details

Company
Anson McCade
Location
Gloucester, England, United Kingdom
Hybrid / Remote Options
Posted