large language models, efficient computing based on low-precision arithmetic, deep learning models including large generative models for language, vision and other modalities . Experience writing C Triton/CUDA kernels for performance optimisation of ML models. Have contributed to open-source projects or published research papers in relevant fields. Knowledge of cloud computing platforms. Keen to present, publish More ❯
on ML infrastructure - 8+ years of current programming experience building ML infrastructure using languages such as Python, C++ or Rust - Hands-on experience with parallel computing platforms such as CUDA, OpenMP, etc - Deep understanding of AI frameworks such as PyTorch, TensorFlow, and JAX, and their demands on underlying compute infrastructure, memory bandwidth, network interconnect, and storage as scale goes More ❯
analysis to prototype quickly Desirable Experience Experience with TensorRT , Nvidia Deepstream , or other deployment frameworks Background in neural network design or edge inference Programming in C/C++ and CUDA Realtime or embedded vision applications Why Join AssetCool? Tackle some of the toughest challenges in robotics, vision, and infrastructure tech Join a growing team with global ambitions and a More ❯
high-impact initiatives and push the boundaries of model performance. You'll also work on re-implementing models in an efficient manner by using PyTorch and underlying technologies like Cuda Kernels, Torch compilation techniques. This would include: Evaluating and optimising compute resource usage (e.g., Hopper GPUs) for cost and time efficiency at training and inference times. Driving the adoption More ❯
Experience deploying SLAM in industrial or embedded environments Proficient in modern C++ development Preferred Familiarity with machine learning for semantic/geometric inference. Experience in GPU computing, e.g. Vulkan, CUDA, OpenCL or Metal Exposure to embedded systems development. Feel free to also refer someone you may know who could be good for the role. If they are successfully placed … we offer a great referral scheme! Key words – Visual-inertial Odometry/SLAM/Computer Vision/Robotics/CUDA/Vulkan/OpenCL/Metal/Sensor Fusion/Embedded Systems/Semantic Inference/Geometric Inference/C++/Spatial AI By applying to this role, you understand that we may collect your personal data & store & process More ❯
City of London, London, United Kingdom Hybrid / WFH Options
European Tech Recruit
Experience deploying SLAM in industrial or embedded environments Proficient in modern C++ development Preferred Familiarity with machine learning for semantic/geometric inference. Experience in GPU computing, e.g. Vulkan, CUDA, OpenCL or Metal Exposure to embedded systems development. Feel free to also refer someone you may know who could be good for the role. If they are successfully placed … we offer a great referral scheme! Key words – Visual-inertial Odometry/SLAM/Computer Vision/Robotics/CUDA/Vulkan/OpenCL/Metal/Sensor Fusion/Embedded Systems/Semantic Inference/Geometric Inference/C++/Spatial AI By applying to this role, you understand that we may collect your personal data & store & process More ❯
South East London, England, United Kingdom Hybrid / WFH Options
European Tech Recruit
Experience deploying SLAM in industrial or embedded environments Proficient in modern C++ development Preferred Familiarity with machine learning for semantic/geometric inference. Experience in GPU computing, e.g. Vulkan, CUDA, OpenCL or Metal Exposure to embedded systems development. Feel free to also refer someone you may know who could be good for the role. If they are successfully placed … we offer a great referral scheme! Key words – Visual-inertial Odometry/SLAM/Computer Vision/Robotics/CUDA/Vulkan/OpenCL/Metal/Sensor Fusion/Embedded Systems/Semantic Inference/Geometric Inference/C++/Spatial AI By applying to this role, you understand that we may collect your personal data & store & process More ❯
in Computer Science, Engineering, Machine Learning, Artificial Intelligence, or a related field. Strong professional experience coding in modern C++ (advanced level). Practical expertise with GPU programming, specifically using CUDA or OpenCL. Solid background in debugging, optimization, and performance tuning. Clear communicator with a collaborative and proactive working style. Hands-on experience in image-based 3D reconstruction techniques such More ❯
Nottingham, Nottinghamshire, East Midlands, United Kingdom
ETS Technical Selection
at a low level. Proven ability to create parallelizable algorithm implementations for real-time video processing. Strong coding skills in C/C++. Desirable Skills and Abilities: Experience writing CUDA kernel code. Proficiency in optimizing algorithms for speed during both design and implementation stages. Familiarity with camera calibration and 3D reconstruction techniques. Strong presentation and communication skills, especially when More ❯
Basingstoke, Hampshire, United Kingdom Hybrid / WFH Options
Hawk-Eye Innovations Ltd
Haves: Python (strong programming fundamentals) Modern C++ (C+/20) TensorRT for model optimisation PyTorch, PyTorch-Ignite Linux & Windows 10 experience GIT and collaborative software development Nice-to-Haves: CUDA OpenCV CMake & Visual Studio Typescript & Semantic UI React SSH and secure deployment workflows Bonus Skills: QT, JIRA, Confluence ClearML TeamCity for CI/CD What We Value: At Hawk More ❯
not required. Below is a detailed breakdown of all the technologies we use: Backend: Python Frontend: Typescript and React Kubernetes for deployment GCP for underlying infrastructure Machine Learning: PyTorch, CUDA, Ray We encourage people from all backgrounds, cultures, and skill levels to apply. It is okay to not meet all requirements listed as we are looking for individuals who More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
IC Resources
CUDA Kernel Developer £80,000 - £90,000 + bonus & hybrid working! I'm currently working with a Cambridge-based, multinational Semiconductor scale-up who are focused on developing AI accelerators. You will have the opportunity to work in a rapidly changing environment where your new ideas will become innovative products, services, and customer experiences. They are a successful, growing … business, offering the chance for an engineer to progress their career and achieve future aspirations. They provide a stable and supportive environment. They are looking for a CUDA Kernel Developer to develop and optimise high-performance kernels for ML operators on NPU architectures. They are looking for an exceptional engineer to join a talented team of 5 engineers at … and accelerators specialised for Ai applications. You will also collaborate with the hardware and software teams to integrate kernels into the NPU framework. What's required for a successful CUDA Kernel Developer? Extensive experience in kernel development projects for GPUs Involvement in OpenCL, CUDA or similar parallel programming languages Understanding of ML frameworks - TensorFlow, PyTorch etc Strong C++ More ❯
CUDA Kernel Developer £80,000 - £90,000 + bonus & hybrid working! I'm currently working with a Cambridge-based, multinational Semiconductor scale-up who are focused on developing AI accelerators. You will have the opportunity to work in a rapidly changing environment where your new ideas will become innovative products, services, and customer experiences. They are a successful, growing … business, offering the chance for an engineer to progress their career and achieve future aspirations. They provide a stable and supportive environment. They are looking for a CUDA Kernel Developer to develop and optimise high-performance kernels for ML operators on NPU architectures. They are looking for an exceptional engineer to join a talented team of 5 engineers at … and accelerators specialised for Ai applications. You will also collaborate with the hardware and software teams to integrate kernels into the NPU framework. What's required for a successful CUDA Kernel Developer? Extensive experience in kernel development projects for GPUs Involvement in OpenCL, CUDA or similar parallel programming languages Understanding of ML frameworks - TensorFlow, PyTorch etc Strong C++ More ❯
to-end AI solutions at scale. Essential Requirements: Proven experience as an Engineering Manager delivering complex engineering projects. Expertise in developing GPU kernels and/or ML compilers (e.g., CUDA, OpenCL, TensorRT, MLIR, TVM). Experience optimizing systems to meet strict utilization and latency requirements. Excellent interpersonal and communication skills. Desirable: Experience with C++ and ML frameworks such as More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Annapurna
to-end AI solutions at scale. Essential Requirements: Proven experience as an Engineering Manager delivering complex engineering projects. Expertise in developing GPU kernels and/or ML compilers (e.g., CUDA, OpenCL, TensorRT, MLIR, TVM). Experience optimizing systems to meet strict utilization and latency requirements. Excellent interpersonal and communication skills. Desirable: Experience with C++ and ML frameworks such as More ❯
design patterns and Agile software development practices Experience with Linux programming , scripting, and system configuration Strong background in unit testing , system testing, and version control (e.g. Git) Desirable Experience CUDA GPU programming and familiarity with Jetson devices Nvidia SDKs such as VPI, Deepstream or Jetson SDK Real-time image/data processing pipelines Parallel programming, optimisation, and algorithmic design More ❯
and platform teams Troubleshoot and optimise GPU usage across a variety of hardware and OS environments What they’re looking for Solid experience with low-level GPU programming using CUDA, Vulkan, OpenCL, Metal or similar Strong C or C++ skills and a background in systems or performance engineering Deep understanding of how modern GPUs work, including memory and compute More ❯
and platform teams Troubleshoot and optimise GPU usage across a variety of hardware and OS environments What they’re looking for Solid experience with low-level GPU programming using CUDA, Vulkan, OpenCL, Metal or similar Strong C or C++ skills and a background in systems or performance engineering Deep understanding of how modern GPUs work, including memory and compute More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯