Site Reliability Engineer
eDV Site Reliability EngineerCheltenham (Hybrid 3–4 days onsite)Contract (Outside IR35) or Permanent£500–£650 per day (contract) | Competitive perm salaryActive DV clearance required - Intelligence
OverviewWe’re looking for an experienced Site Reliability Engineer to support and scale mission-critical platforms used across high-profile government environments. Key Responsibilities• Improve reliability and performance across multiple services• Automate operational tasks and reduce alert noise• Enhance monitoring and observability to prevent incidents• Support development and test environments• Contribute to infrastructure, CI/CD, and platform improvements• Participate in an on-call rota when required
Essential Skills• Terraform and infrastructure automation (Ansible/Chef or similar)• Containers & orchestration (Docker, Kubernetes, OpenShift, etc.)• CI/CD tooling (Jenkins or similar)• Monitoring & observability (Prometheus, Grafana, InfluxDB)• Messaging systems (RabbitMQ / AMQP)• Linux administration & scripting• AWS experience (EC2, RDS, S3, Lambda)• Strong understanding of networking and security fundamentals
Nice to Have• Coding experience (Java, Go, Python)• Secure or cross-domain system experience• Service management or operations background• Azure exposure