Oxford, England, United Kingdom Hybrid / WFH Options
ic resources
and optimisation to maximise performance on specialised hardware. Strong understanding of hardware-software co-design principles, including memory hierarchy optimisation, cache management, and hardware/software partitioning. Knowledge of parallel computing principles and experience with parallelprogramming models (MPI, OpenMP) for distributed AI processing. Deep understanding of AI model architectures, algorithms, and frameworks (TensorFlow, PyTorch) and their More ❯
GCC) and graph traversals required for writing code for such a compiler. Solid software engineering skills-practicing software engineer, having significantly contributed to/shipped production code. Understanding of parallelprogramming-specifically as it pertains to GPUs. Strong willingness to learn Rust-as a Rust by default company, we require everyone to learn Rust so that they can … autonomy. Nice to haves: Architecture understanding-full understanding of a computer architecture specialized for training NN graphs (Intel Xeon CPU, GPUs, TPUs, custom accelerators). Rust experience-systems level programming experience in Rust. Open-source contributions to Compiler Stacks. Compilation understanding-strong understanding of compilation in regards to one or more High-Performance Computer architectures (CPU, GPU, custom accelerator More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
IC Resources
integrate kernels into the NPU framework. What's required for a successful CUDA Kernel Developer? Extensive experience in kernel development projects for GPUs Involvement in OpenCL, CUDA or similar parallelprogramming languages Understanding of ML frameworks - TensorFlow, PyTorch etc Strong C++ development skills Ability to work independently in a fast-paced environment If you are a CUDA Kernel More ❯