highly regulated industries, preferably in medical device development Technical Expertise: Experience with multi-tasking systems (real-time preferable) and familiarity with signal processing or AI/ML applications using CUDA on GPUs (preferred), medical device communications protocols (HL7, FHIR) Development Approach: Knowledge of agile methodologies and best practices in software development Tools & Practices: Proficiency with version control systems (e.g. More ❯
industry experience. Expertise in translating complex machine learning algorithms into scalable, production-quality code, with proficiency in Python and a strong understanding of optimization techniques (experience with Cython and CUDA is a plus). Experience in developing Large Language Models (LLMs) is advantageous. In-depth understanding of computer architecture and its implications on AI/ML performance. Comprehensive knowledge More ❯
PyTorch internals and other major ML frameworks. Experience optimizing deep learning performance on accelerator hardware. Solid knowledge of deep learning algorithms and compute patterns. Strong programming skills in C++, CUDA, or OpenCL. Background in performance profiling and optimization. BS/MS in Computer Science, Electrical Engineering, or a related field. Interested? Send your CV to to apply. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Annapurna
across diverse vendor platforms. Working with low-level system and memory management techniques to minimize latency and improve real-time inference performance. Utilizing and implementing GPU programming APIs (e.g., CUDA, OpenCL) to ensure high efficiency and compatibility across GPUs. Profiling and debugging system performance using tools like NVIDIA Nsight, Intel VTune, and vendor-specific profilers, identifying bottlenecks and implementing … autonomous systems. Essential Requirements: 3+ years of experience in C++ programming, with a strong grasp of modern C++ standards. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Strong knowledge of parallel computing concepts, including data locality, memory access patterns, and synchronization. Proficiency with performance profiling tools and techniques for identifying More ❯
across diverse vendor platforms. Working with low-level system and memory management techniques to minimize latency and improve real-time inference performance. Utilizing and implementing GPU programming APIs (e.g., CUDA, OpenCL) to ensure high efficiency and compatibility across GPUs. Profiling and debugging system performance using tools like NVIDIA Nsight, Intel VTune, and vendor-specific profilers, identifying bottlenecks and implementing … autonomous systems. Essential Requirements: 3+ years of experience in C++ programming, with a strong grasp of modern C++ standards. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Strong knowledge of parallel computing concepts, including data locality, memory access patterns, and synchronization. Proficiency with performance profiling tools and techniques for identifying More ❯
Industrial experience in deploying SLAM solutions. Proficiency in C++. Desirable experience: PhD in computer vision or robotics. Experience with machine learning techniques for geometric & semantic estimation. GPU programming skills (CUDA, OpenCL, Vulkan, Metal). Experience with embedded software development. If this role is of any interest please apply directly on LinkedIn or send a copy of your CV to More ❯
City of London, London, United Kingdom Hybrid / WFH Options
European Tech Recruit
Industrial experience in deploying SLAM solutions. Proficiency in C++. Desirable experience: PhD in computer vision or robotics. Experience with machine learning techniques for geometric & semantic estimation. GPU programming skills (CUDA, OpenCL, Vulkan, Metal). Experience with embedded software development. If this role is of any interest please apply directly on LinkedIn or send a copy of your CV to More ❯
South East London, England, United Kingdom Hybrid / WFH Options
European Tech Recruit
Industrial experience in deploying SLAM solutions. Proficiency in C++. Desirable experience: PhD in computer vision or robotics. Experience with machine learning techniques for geometric & semantic estimation. GPU programming skills (CUDA, OpenCL, Vulkan, Metal). Experience with embedded software development. If this role is of any interest please apply directly on LinkedIn or send a copy of your CV to More ❯
products with the latest machine learning advancements. Requirements include strong programming skills in Python, C, C++, experience with deployment platforms, and familiarity with NLP, computer vision, TensorFlow, PyTorch, JAX, CUDA, LLMs, and related technologies. A degree in a relevant field and a solid AI R&D track record are essential. More ❯
collaboratively, thrive in ambiguity, and take full ownership of what you build. Key technical skills Strong back-end development experience (Python, Node.js ) Working knowledge of C++ and GPU computing (CUDA, OpenCL) Proven ability to design, build, and maintain robust APIs Proficiency with cloud platforms (e.g. AWS, GCP, or Azure), containerisation, and CI/CD pipelines Familiarity with scalable data More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Pinepeak
collaboratively, thrive in ambiguity, and take full ownership of what you build. Key technical skills Strong back-end development experience (Python, Node.js ) Working knowledge of C++ and GPU computing (CUDA, OpenCL) Proven ability to design, build, and maintain robust APIs Proficiency with cloud platforms (e.g. AWS, GCP, or Azure), containerisation, and CI/CD pipelines Familiarity with scalable data More ❯
high-impact initiatives and push the boundaries of model performance. You'll also work on re-implementing models in an efficient manner by using PyTorch and underlying technologies like Cuda Kernels, Torch compilation techniques. This would include: Evaluating and optimising compute resource usage (e.g., Hopper GPUs) for cost and time efficiency at training and inference times. Driving the adoption More ❯
Experience deploying SLAM in industrial or embedded environments Proficient in modern C++ development Preferred Familiarity with machine learning for semantic/geometric inference. Experience in GPU computing, e.g. Vulkan, CUDA, OpenCL or Metal Exposure to embedded systems development. Feel free to also refer someone you may know who could be good for the role. If they are successfully placed … we offer a great referral scheme! Key words – Visual-inertial Odometry/SLAM/Computer Vision/Robotics/CUDA/Vulkan/OpenCL/Metal/Sensor Fusion/Embedded Systems/Semantic Inference/Geometric Inference/C++/Spatial AI By applying to this role, you understand that we may collect your personal data & store & process More ❯
City of London, London, United Kingdom Hybrid / WFH Options
European Tech Recruit
Experience deploying SLAM in industrial or embedded environments Proficient in modern C++ development Preferred Familiarity with machine learning for semantic/geometric inference. Experience in GPU computing, e.g. Vulkan, CUDA, OpenCL or Metal Exposure to embedded systems development. Feel free to also refer someone you may know who could be good for the role. If they are successfully placed … we offer a great referral scheme! Key words – Visual-inertial Odometry/SLAM/Computer Vision/Robotics/CUDA/Vulkan/OpenCL/Metal/Sensor Fusion/Embedded Systems/Semantic Inference/Geometric Inference/C++/Spatial AI By applying to this role, you understand that we may collect your personal data & store & process More ❯
South East London, England, United Kingdom Hybrid / WFH Options
European Tech Recruit
Experience deploying SLAM in industrial or embedded environments Proficient in modern C++ development Preferred Familiarity with machine learning for semantic/geometric inference. Experience in GPU computing, e.g. Vulkan, CUDA, OpenCL or Metal Exposure to embedded systems development. Feel free to also refer someone you may know who could be good for the role. If they are successfully placed … we offer a great referral scheme! Key words – Visual-inertial Odometry/SLAM/Computer Vision/Robotics/CUDA/Vulkan/OpenCL/Metal/Sensor Fusion/Embedded Systems/Semantic Inference/Geometric Inference/C++/Spatial AI By applying to this role, you understand that we may collect your personal data & store & process More ❯
in Computer Science, Engineering, Machine Learning, Artificial Intelligence, or a related field. Strong professional experience coding in modern C++ (advanced level). Practical expertise with GPU programming, specifically using CUDA or OpenCL. Solid background in debugging, optimization, and performance tuning. Clear communicator with a collaborative and proactive working style. Hands-on experience in image-based 3D reconstruction techniques such More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Annapurna
to-end AI solutions at scale. Essential Requirements: Proven experience as an Engineering Manager delivering complex engineering projects. Expertise in developing GPU kernels and/or ML compilers (e.g., CUDA, OpenCL, TensorRT, MLIR, TVM). Experience optimizing systems to meet strict utilization and latency requirements. Excellent interpersonal and communication skills. Desirable: Experience with C++ and ML frameworks such as More ❯
to-end AI solutions at scale. Essential Requirements: Proven experience as an Engineering Manager delivering complex engineering projects. Expertise in developing GPU kernels and/or ML compilers (e.g., CUDA, OpenCL, TensorRT, MLIR, TVM). Experience optimizing systems to meet strict utilization and latency requirements. Excellent interpersonal and communication skills. Desirable: Experience with C++ and ML frameworks such as More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
and architecture of GPU IPs - Graphics Hardware Processors (5 - 10+ years' experience) Strong understanding of modern 3D graphics and/or compute APIs, such as Vulkan, D3D12 and OpenCL, CUDA etc. Definition of high-level GPU architecture/micro-architecture Confidence knowledge in the ASIC digital design flow Experience in R&D of the latest products and features Ability More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Techfellow Limited
as-code mindset Hands-on experience resolving GPU workload issues across compute clusters and supporting technologies Familiarity with performance tooling and debugging in live production environments Practical experience with CUDA or systems-level programming in C/C++ Experience with config management frameworks like Salt, Ansible, or Puppet (Preferred) Experience with GPU communication and interconnect technologies (e.g. collective communication More ❯
as-code mindset Hands-on experience resolving GPU workload issues across compute clusters and supporting technologies Familiarity with performance tooling and debugging in live production environments Practical experience with CUDA or systems-level programming in C/C++ Experience with config management frameworks like Salt, Ansible, or Puppet (Preferred) Experience with GPU communication and interconnect technologies (e.g. collective communication More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Techfellow Limited
as-code mindset Hands-on experience resolving GPU workload issues across compute clusters and supporting technologies Familiarity with performance tooling and debugging in live production environments Practical experience with CUDA or systems-level programming in C/C++ Experience with config management frameworks like Salt, Ansible, or Puppet (Preferred) Experience with GPU communication and interconnect technologies (e.g. collective communication More ❯
that bear little resemblance to publicly available substitutes. Utmost integrity, confidentiality, and discretion in both internal and external interactions. What We Value Experience writing and optimizing compute kernels with CUDA or similar languages. History of developing creative approaches to drive high ML accuracy within an alloted computational budget. Competitive Compensation. We provide financial peace of mind with competitive base More ❯