Site Reliability Engineer

Site Reliability Engineer (SRE)

Location: Gloucester (Hybrid, 3 days onsite)

Salary: Up to £65,000 + £7,000 bonus

Security Clearance: Must be eligible for UK Developed Vetting (DV)

We’re hiring a Site Reliability Engineer to join a high-performing engineering environment delivering critical, complex systems. This role sits at the intersection of software engineering and operations, with a strong focus on automation, scalability, and system resilience.

This is an excellent opportunity for someone with a software engineering background who is looking to move into a more systems-focused, reliability-driven career path without losing their hands-on technical edge.

As an SRE, you’ll be responsible for ensuring the reliability, availability, and performance of mission-critical systems. You’ll apply software engineering principles to infrastructure and operations challenges, reducing manual effort through automation and improving system design.

Key Responsibilities Include:

Supporting and maintaining live services, ensuring high availability and performance
Automating operational processes to reduce manual intervention
Monitoring, alerting, and observability improvements across systems
Diagnosing and resolving incidents across the full technology stack
Working closely with engineering teams to influence system design and reliability
Participating in an on-call rota (project-dependent)
Contributing to continuous improvement of DevOps and SRE practices

What We’re Looking For

We’re interested in candidates who bring a strong engineering mindset and enjoy solving complex systems problems.

Core Experience:

2+ years commercial experience in this area
Background in software engineering (e.g. Java, JavaScript, or similar)
Experience working with cloud platforms (AWS, Azure, or similar)
Strong Linux/Windows command line skills (Bash, PowerShell)
Understanding of distributed systems, scalability, and resilience
Experience with monitoring/observability tools (e.g. ELK stack or similar)
Familiarity with containers and microservices (e.g. Docker)
Experience troubleshooting across infrastructure and application layers

Desirable:

Exposure to 2nd or 3rd line support environments
Knowledge of CI/CD and deployment tooling
Experience with infrastructure as code or configuration management tools
Understanding of ITIL or service management practices

Additional Requirements

Willingness to participate in on-call support (depending on project)

If you’re a software engineer looking to broaden your impact into reliability, systems, and large-scale infrastructure, this role offers a strong platform to do exactly that.

Apply Now

Site Reliability Engineer

Job Details