1 of 1 Proactive Monitoring Jobs in Bedford

Senior HPC Cluster Engineer

Hiring Organisation
Nebius
Location
Bedford, Bedfordshire, UK
Employment Type
Full-time
hardware into the existing infrastructure, including support for new GPU hardware through software stacks like Kubernetes, QEMU, and KVM. Enhancing automation systems for proactive monitoring, detecting, and resolving issues in GPU and InfiniBand environments. Configuring and managing GPU devices and InfiniBand fabrics, ensuring efficient and reliable operation. ...