2 of 2 Parallel Programming Jobs

Senior System Software Engineer, NCCL - Partner Enablement

Hiring Organisation
NVIDIA
Location
United Kingdom
What We Need To See B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience. Experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM) Excellent C/C++ programming skills, including debugging, profiling, code optimization … infrastructure on HPC clusters. Prior system administration experience, esp for large clusters. Experience debugging network configuration issues in large scale deployments Familiarity with CUDA programming and/or GPUs. Good understanding of Machine Learning concepts and experience with Deep Learning Frameworks such PyTorch, TensorFlow Deep understanding of technology ...

Distinguished Software Architect - Deep Learning and HPC Communications

Hiring Organisation
NVIDIA
Location
United Kingdom
Computer Engineering or related field or strong equivalent experience; 15+ years of relevant experience in academia or the industry Expert in following areas: HPC, parallel programming models (MPI, SHMEM), at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC), computer and system architecture, GPU architecture and CUDA … network debug and performance analysis Strong in at least a few of these areas: ML/DL fundamentals and how they tie to communications, parallel algorithms, fault tolerance and resiliency, competitive assessments, performance analysis and optimizations for parallel applications on large clusters, developing applications using DL Frameworks (PyTorch ...