Site Reliability Engineer - Data Centers
- Hiring Organisation
- TGS International Group
- Location
- Portsmouth, England, United Kingdom
release Perform functional and reliability testing of GPUs, servers, and associated components Verify network connectivity and performance, including high-speed interconnects where applicable Orchestration & Benchmarking Provision and configure GPU clusters using automated workflows Execute and analyse performance and stability benchmarks orchestrated via a workload scheduler Validate results against expected performance ...