high-performance CUDA kernels for matrix operations and numerical solvers Profiling and optimizing GPU execution using NVIDIA tooling (e.g., qdss , Nsight Systems/Compute) Working with large-scale matrix algebra , linear equation solving, iterative solvers, and sparse/dense matrix handling Adapting existing CPU-based simulation code to GPU environments Ensuring numerical stability and precision in GPU-accelerated … handover of GPU-optimized modules Optional: contribution to Jetson-based environments if needed Required Skills Strong experience in CUDA development (custom kernels, memory management, warp optimization) Background in numerical linearalgebra , matrix operations, and solving systems of equations Experience with GPU-accelerated libraries such as: cuBLAS, cuSOLVER, cuSPARSE, Thrust , or similar Knowledge of NVIDIA debugging/profiling tools More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Forward Role
to-day, you'll: ? Deliver engaging teaching sessions covering the full ML lifecycle — data prep, model training, evaluation, deployment and monitoring at scale. ? Explain the maths behind ML models (linearalgebra, calculus, probability, stats) in an accessible, engaging way. ? Support learners throughout their apprenticeship journey alongside Learner Success Coaches. ? Contribute to content and product development — creating learning materials More ❯
small-scale system and process improvements to enhance functionality and efficiency. Qualifications Preferred: Applied Machine Learning experience (regression, classification, supervised, and unsupervised learning ) with a strong mathematical foundation in linearalgebra , calculus , probability, and statistics. Experience in time-series data analysis , including cleansing and normalization , and experience with scalable Machine Learning ( MapReduce , streaming). Software development expertise in More ❯
physical design of existing systems for optimizing performance. - Other duties may also be assigned DESIRED QUALIFICATIONS: Experience working with large scale open-source codebases Experience developing and working with linearalgebra packages (LAPACK, BLAS, ATLAS, Eigen) Experience working in high-performance computing Experience, or at least surface level knowledge/interest of single cell biology and analysis. Previous More ❯