Senior Infrastructure Support Engineer
- Hiring Organisation
- Nscale
- Location
- Peterborough, Cambridgeshire, UK
- Employment Type
- Full-time
network layers in production. Kubernetes. Operate and troubleshoot K8s clusters, and understand how physical resources are abstracted up the stack to K8s. GPU platforms (NVIDIA and AMD). Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability