Triton, etc.). Expertise in tailoring algorithms and ML models to exploit GPU strengths and minimize weaknesses. Knowledge of low-level GPU programming (CUDA, OpenCL, etc.) and performance tuning techniques. Understanding of modern GPU architectures, memory hierarchies, and performance bottlenecks. Ability to develop and utilize sophisticated performance models and More ❯
high-performance and energy-efficient compute platforms for modern AI workloads. You'll be working on a flagship GPU and AI platform supporting PyTorch, OpenCL, and Vulkan, designed to bring scalable, efficient AI capabilities to developers and researchers across the industry . Role Overview: As a Software Engineer - AI More ❯
strong focus on memory management, multi-threading, and low-level performance optimizations. Experience with GPU architectures (e.g., NVIDIA, AMD) and programming frameworks like CUDA, OpenCL, and TensorFlow. Understanding of machine learning algorithms, including model training and inference, and how to optimize these for GPU-based computation. Strong knowledge of More ❯
strong focus on memory management, multi-threading, and low-level performance optimizations. Experience with GPU architectures (e.g., NVIDIA, AMD) and programming frameworks like CUDA, OpenCL, and TensorFlow. Understanding of machine learning algorithms, including model training and inference, and how to optimize these for GPU-based computation. Strong knowledge of More ❯
and compute workloads on cutting-edge GPU architectures. About you: 6 years+ as a driver engineer Strong C++ or C programming skills Experience with OpenCL or Vulkan (other graphics APIs are also fine) Strong knowledge of GPU development Full details are available. Please don't hesitate to get in More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Platform Recruitment
and compute workloads on cutting-edge GPU architectures. About you: 6 years+ as a driver engineer Strong C++ or C programming skills Experience with OpenCL or Vulkan (other graphics APIs are also fine) Strong knowledge of GPU development Full details are available. Please don't hesitate to get in More ❯
programming skills Familiar with one of more GPU architecture Understanding of compute library frameworks Knowledge of at least one GPU programming model (E.g., CUDA, OpenCL, HIP, SYCL) Full details are available. Please don't hesitate to get in touch with max@platform-recruitment.com to learn more. More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Platform Recruitment
programming skills Familiar with one of more GPU architecture Understanding of compute library frameworks Knowledge of at least one GPU programming model (E.g., CUDA, OpenCL, HIP, SYCL) Full details are available. Please don't hesitate to get in touch with max@platform-recruitment.com to learn more. More ❯
South West London, London, United Kingdom Hybrid / WFH Options
La Fosse
tech experience: Background applying AI/ML in the sports domain for data generation or insights. Systems optimisation: Knowledge of GPU kernel development (CUDA, OpenCL, etc.), real-time system optimisation (e.g., Nvidia NSight), or experience working with embedded SoCs (Nvidia, Qualcomm, etc.). If you're interested in this More ❯
/ML in the sports domain to create insights or data. Advanced systems knowledge, such as: Developing GPU kernels or ML compilers (e.g., CUDA, OpenCL, TensorRT Plugins, MLIR, TVM ). System optimization for latency and utilization , using tools like Nvidia NSight . Working with embedded SoCs (e.g., Nvidia, Qualcomm More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Enigma
/ML in the sports domain to create insights or data. Advanced systems knowledge, such as: Developing GPU kernels or ML compilers (e.g., CUDA, OpenCL, TensorRT Plugins, MLIR, TVM ). System optimization for latency and utilization , using tools like Nvidia NSight . Working with embedded SoCs (e.g., Nvidia, Qualcomm More ❯
Xilinx Vitis to develop host application and customize firmware Employ RTL kernel flow for hardware integration Develop efficient interface/communication with kernel using OpenCL and/or Xilinx XRT API Algorithm understanding in C/C++, Python or Rust and ability to translate them to efficient RTL code More ❯
Remote, Marylebone High Street, Greater London, United Kingdom Hybrid / WFH Options
Andrecruit Group Ltd
Xilinx Vitis to develop host application and customize firmware Employ RTL kernel flow for hardware integration Develop efficient interface/communication with kernel using OpenCL and/or Xilinx XRT API Algorithm understanding in C/C++, Python or Rust and ability to translate them to efficient RTL code More ❯
and compute workloads on cutting-edge GPU architectures. About you: 6 years+ as a driver engineer Strong C++ or C programming skills Experience with OpenCL or Vulkan (other graphics APIs are also fine) Strong knowledge of GPU development Full details are available. Please don't hesitate to get in More ❯
If you are passionate, curious, and ready to make an impact, we are looking for you. JP Morgan spends more than $9 billion a year to be at the forefront of technological innovation. Leveraging petascale compute clusters, Quantitative Researchers develop More ❯