Hereford, Herefordshire, West Midlands, United Kingdom Hybrid/Remote Options
Twinstream Limited
ensuring the availability, performance, and resilience of our secure, high-impact services. You'll work with development and support teams to evolve infrastructure, streamline delivery pipelines, and strengthen system observability — ensuring performance bottlenecks and reliability risks are resolved before they ever reach production. Expect a technically rich environment, diverse challenges, and the opportunity to make a measurable difference. Key Responsibilities … Reliability Engineer: Partner with Software Engineers to enhance reliability and performance across complex systems Collaborate with SysAdmins to automate toil and eliminate manual intervention Build smarter monitoring, logging, and observability pipelines to detect and resolve issues early Support and improve development environments to hit delivery and quality goals Research new tools, services, and architectures to drive scalability and resilience Expand … Ansible, Chef, etc.) Skilled with Docker and Kubernetes/OpenShift/Docker Swarm Hands-on experience building and maintaining CI/CD pipelines (e.g. Jenkins) Deep understanding of monitoring & observability tools (Grafana, Prometheus, InfluxDB) Solid grounding in Linux, network security, SQL, and AWS (EC2, S3, RDS, Lambda) Comfortable with MQ messaging (RabbitMQ or similar) Bonus points for: Experience with Azure More ❯
Hereford, Herefordshire, West Midlands, United Kingdom Hybrid/Remote Options
Hays
focused on ensuring service availability, performance, and cost-efficiency across both cloud and on-prem infrastructure. You'll work closely with development and support teams to evolve infrastructure, enhance observability, and proactively mitigate reliability risks. Key Responsibilities: Collaborate with software engineers to improve reliability and performance Automate operational tasks and reduce alert fatigue Enhance monitoring and observability to pre-empt … platforms, ideally AWS (EC2, RDS, S3, Lambda) Desirable: Coding experience in Java, Go, Python or similar Knowledge of cross-domain technologies Experience in service management environments Practical application of observability patterns Experience with Azure Additional Information: Due to the nature of the work, successful candidates will be required to undergo security vetting. We welcome applications from all backgrounds and are More ❯
Hereford, Herefordshire, England, United Kingdom Hybrid/Remote Options
Hays Specialist Recruitment Limited
role focused on ensuring service availability, performance, and cost-efficiency across both cloud and on-prem infrastructure.You'll work closely with development and support teams to evolve infrastructure, enhance observability, and proactively mitigate reliability risks.Key Responsibilities:Collaborate with software engineers to improve reliability and performanceAutomate operational tasks and reduce alert fatigueEnhance monitoring and observability to pre-empt issuesSupport development environments … protocolsExperience with cloud platforms, ideally AWS (EC2, RDS, S3, Lambda)Desirable:Coding experience in Java, Go, Python or similarKnowledge of cross-domain technologiesExperience in service management environmentsPractical application of observability patternsExperience with AzureAdditional Information:Due to the nature of the work, successful candidates will be required to undergo security vetting.We welcome applications from all backgrounds and are committed to creating More ❯