Platform Engineer
- Hiring Organisation
- CATCHES
- Location
- Worcester, Worcestershire, UK
- Employment Type
- Full-time
lifecycle of high-performance GPU clusters using Terraform and Ansible. Maintain the stability and performance of large-scale Linux environments supporting AI/ML training workloads. Collaborate with vendors and internal teams to troubleshoot hardware and networking bottlenecks (latency, throughput). Implement monitoring solutions (Prometheus/Grafana) to visualise ...