software on GitHub, PyPI, Anaconda Cloud, and Docker Hub, as well as use of Pytorch lightning, Git, test-driven design. Knowledge of parallel computing technologies, such as NVIDIA's CUDA platform, OpenCL, and OpenMPI. The salary range for Cambridge, UK: - Senior Scientist I, Computational Biology: £75,000 - £117,500 Senior Scientist II, Computational Biology: £94,000 - £152,500 Exact More ❯
sound engineering principles to ensure robust, maintainable solutions. PREFERRED EXPERIENCE: GPU Kernel Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM). Strong knowledge of AMD architectures (GCN, RDNA) and low-level programming to maximize performance for AI operations, leveraging tools like Compute Kernel (CK), CUTLASS, and More ❯
sound engineering principles to ensure robust, maintainable solutions. PREFERRED EXPERIENCE: GPU Kernel Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM). Strong knowledge of AMD architectures (GCN, RDNA) and low-level programming to maximize performance for AI operations, leveraging tools like Compute Kernel (CK), CUTLASS, and More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Good oral and written English skills "Nice To Have" Skills and Experience : Experience with ML software frameworks (e.g. PyTorch) Familiarity with ML hardware accelerators (e.g. NPUs, TPUs, GPUs with CUDA support) Knowledge of optimising and profiling software Experience with assembly programming Software development and integration on Linux, Android, or similar systems Knowledge of scripting languages, including Python In Return More ❯
Familiarity with a broad range of ML techniques (computer vision, generative models, audio processing, etc.), or the ability to adapt quickly to new areas. Exposure to GPU programming (e.g., CUDA) is a plus. Ability to communicate complex technical ideas clearly to colleagues and stakeholders. Client-facing experience or a willingness to engage with stakeholders and project teams is highly More ❯
specialise in specific areas of the system based on their skills and areas of interest. The main application software is written in standard and multi-threaded C++ with some CUDA for processing. The QT framework is used for the GUI. The software is performance critical as it is a real-time processing system that must respond to user input More ❯
deploying machine learning onto a range of hardware from resource constrained embedded systems through to edge computing is desirable. As is any knowledge of GPU programming languages and frameworks (CUDA, ROCm, etc). Your future colleagues will be similarly highly skilled, with experience across industry and the drive to innovate. You will find yourself in a low-management work More ❯
deploying machine learning onto a range of hardware from resource constrained embedded systems through to edge computing is desirable. As is any knowledge of GPU programming languages and frameworks (CUDA, ROCm, etc). Your future colleagues will be similarly highly skilled, with experience across industry and the drive to innovate. You will find yourself in a low-management work More ❯