Nottingham, Nottinghamshire, East Midlands, United Kingdom
ETS Technical Selection
at a low level. Proven ability to create parallelizable algorithm implementations for real-time video processing. Strong coding skills in C/C++. Desirable Skills and Abilities: Experience writing CUDA kernel code. Proficiency in optimizing algorithms for speed during both design and implementation stages. Familiarity with camera calibration and 3D reconstruction techniques. Strong presentation and communication skills, especially when More ❯
vision Familiarity with deep learning frameworks (PyTorch, TensorFlow, scikit-learn) Knowledge of CPU/GPU architecture and optimisation Experience with tools such as Visual Studio, gcc, CMake, Git, OpenCV, CUDA, Halcon, etc. Python or C# knowledge is considered a plus Familiarity with quality assurance practices including automated testing Professional proficiency in French and English are both required What's More ❯
and platform teams Troubleshoot and optimise GPU usage across a variety of hardware and OS environments What they’re looking for Solid experience with low-level GPU programming using CUDA, Vulkan, OpenCL, Metal or similar Strong C or C++ skills and a background in systems or performance engineering Deep understanding of how modern GPUs work, including memory and compute More ❯
experience developing and deploying autonomous vehicle software on commercial automobiles, and/or knowledge of ASPICE, DriveOS, or AutoSAR. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Experience with QNX or similar real-time operating systems. A Master’s degree or greater in Computer Science, Electrical Engineering, or a related More ❯
to drive business growth. What You Will Do Enhance our CPU, GPU, HPC, and cloud infrastructure Implement upgrades, patching, and system enhancements Provide expertise with technologies such as Linux, CUDA, SLURM, Python etc. Innovate to maintain the highest standards for our technology stack Drive IT solutions that align with our business objectives Research and evaluate new technology solutions Collaborate More ❯
Deep understanding of CFD principles and numerical methods Background in meshing, multi-physics or flow networks Experience with parallel computing (e.g. OpenMP, MPI) Experience with GPU APIs such as CUDA Experience in an Agile development team Experience with highly technical/scientific software, scientific visualization and/or CAD Ability to communicate with stakeholders at all levels Methodical and More ❯
to drive business growth. What You Will Do Enhance our CPU, GPU, HPC, and cloud infrastructure Implement upgrades, patching, and system enhancements Provide expertise with technologies such as Linux, CUDA, SLURM, Python etc. Innovate to maintain the highest standards for our technology stack Drive IT solutions that align with our business objectives Research and evaluate new technology solutions Collaborate More ❯
London, England, United Kingdom Hybrid / WFH Options
Mistral AI
ML codebases Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed/FSDP/SLURM/K8s) Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops Strong software-design instincts: testing, code review, CI/CD Self-starter, low-ego, collaborative Benefits France Competitive cash salary and equity Food: Daily lunch More ❯
culture. When we feel supported in the workplace and at home, there's nothing we can't achieve. - Bachelor's degree in technical discipline with experience in accelerated compute, CUDA, and distributed training. - Experience in business development, product management, management consulting, and managing strategic partnerships. - Experience developing and executing on GTM strategies that are large in scope. - Working knowledge More ❯
success. You excel in ambiguous, fast-paced environments, adept at navigating and thriving amidst change. You get excited about optimizing pre-training runs, for example, including data pre-processing, CUDA optimization, model quantization and optimization, increasing throughput of training jobs (e.g., FP-8). (A plus) You have experience with MLOps or ML Infrastructure, reflecting your ability to streamline More ❯
experience developing and deploying autonomous vehicle software on commercial automobiles, and/or knowledge of ASPICE, DriveOS, or AutoSAR. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Experience with QNX or similar real-time operating systems. A Master’s degree or greater in Computer Science, Electrical Engineering, or a related More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
IC Resources
CUDA Kernel Developer £80,000 - £90,000 + bonus & hybrid working! I'm currently working with a Cambridge-based, multinational Semiconductor scale-up who are focused on developing AI accelerators. You will have the opportunity to work in a rapidly changing environment where your new ideas will become innovative products, services, and customer experiences. They are a successful, growing … business, offering the chance for an engineer to progress their career and achieve future aspirations. They provide a stable and supportive environment. They are looking for a CUDA Kernel Developer to develop and optimise high-performance kernels for ML operators on NPU architectures. They are looking for an exceptional engineer to join a talented team of 5 engineers at … and accelerators specialised for Ai applications. You will also collaborate with the hardware and software teams to integrate kernels into the NPU framework. What's required for a successful CUDA Kernel Developer? Extensive experience in kernel development projects for GPUs Involvement in OpenCL, CUDA or similar parallel programming languages Understanding of ML frameworks - TensorFlow, PyTorch etc Strong C++ More ❯
London, England, United Kingdom Hybrid / WFH Options
Wayve
Wayve, we’re looking for the following skills and experience. Essential Proven experience as a technical lead or senior engineer on complex engineering projects Experience developing GPU kernels (e.g. CUDA, OpenCL, etc) Proficiency in C++ and ML frameworks such as PyTorch Excellent interpersonal and communication skills Ability to mentor and guide a team of engineers Desirable: Experience with ML More ❯
and security. • Debugging and Defect Correction: Troubleshoot and resolve software defects with effective root cause analysis and debugging techniques. • GPU Configuration and Support: Configure and optimize GPU resources using CUDA or other technologies for compute-intensive workloads. • Automated Testing and Deployment: Implement test and deployment automation using Jenkins, GitLab CI/CD, or similar tools. • Collaboration: Work closely with … for software designs, development processes, and interfaces to support long-term project continuity What You Bring: • Active TS/SCI clearance with Full Scope Polygraph • Experience with GPU/CUDA development for high-performance applications • Familiarity with message queue implementations and communication protocols • Proficiency with Linux system programming and development environments • Strong analytical and problem-solving mindset • Excellent verbal More ❯
with research methodology. Desired: Familiarity with current LLM architectures (e.g. Llama3, DeepSeek V3) Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM) Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL More ❯
of satellite communications system design and theory with implementation of concepts into software • Time and/or Frequency geolocation technologies and implementation • Experience with MATLAB • Experience with GPU/CUDA development More ❯
and platform teams Troubleshoot and optimise GPU usage across a variety of hardware and OS environments What they’re looking for Solid experience with low-level GPU programming using CUDA, Vulkan, OpenCL, Metal or similar Strong C or C++ skills and a background in systems or performance engineering Deep understanding of how modern GPUs work, including memory and compute More ❯
and platform teams Troubleshoot and optimise GPU usage across a variety of hardware and OS environments What they’re looking for Solid experience with low-level GPU programming using CUDA, Vulkan, OpenCL, Metal or similar Strong C or C++ skills and a background in systems or performance engineering Deep understanding of how modern GPUs work, including memory and compute More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
as ResNet or U-Net for object detection or segmentation tasks using satellite imagery. • Demonstrated professional or academic experience with version control systems such as Gitlab. • Demonstrated experience leveraging CUDA for GPU accelerated computing. Skills and abilities desired: • Demonstrated professional or academic experience with the HuggingFace Transformers library and hub. • Demonstrated experience with OpenShift and container orchestration within Kubernetes More ❯
London, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
Hounslow, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
focus on training or inference systems Hands-on experience with real-time, low-latency ML pipelines in high-performance environments is a strong plus Strong engineering skills, including Python, CUDA, or C++ Knowledge of machine learning frameworks such as PyTorch, TensorFlow, or JAX Proficiency in GPU programming for training and inference acceleration (e.g., CuDNN, TensorRT) Experience with distributed training More ❯
focus on training or inference systems Hands-on experience with real-time, low-latency ML pipelines in high-performance environments is a strong plus Strong engineering skills, including Python, CUDA, or C++ Knowledge of machine learning frameworks such as PyTorch, TensorFlow, or JAX Proficiency in GPU programming for training and inference acceleration (e.g., CuDNN, TensorRT) Experience with distributed training More ❯
Haves: Python (strong programming fundamentals) Modern C++ (C+/20) TensorRT for model optimisation PyTorch, PyTorch-Ignite Linux & Windows 10 experience GIT and collaborative software development Nice-to-Haves: CUDA OpenCV CMake & Visual Studio Typescript & Semantic UI React SSH and secure deployment workflows Bonus Skills: QT, JIRA, Confluence ClearML TeamCity for CI/CD What We Value: At Hawk More ❯