City of London, London, United Kingdom Hybrid / WFH Options
Annapurna
across diverse vendor platforms. Working with low-level system and memory management techniques to minimize latency and improve real-time inference performance. Utilizing and implementing GPU programming APIs (e.g., CUDA, OpenCL) to ensure high efficiency and compatibility across GPUs. Profiling and debugging system performance using tools like NVIDIA Nsight, Intel VTune, and vendor-specific profilers, identifying bottlenecks and implementing … autonomous systems. Essential Requirements: 3+ years of experience in C++ programming, with a strong grasp of modern C++ standards. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Strong knowledge of parallel computing concepts, including data locality, memory access patterns, and synchronization. Proficiency with performance profiling tools and techniques for identifying More ❯
across diverse vendor platforms. Working with low-level system and memory management techniques to minimize latency and improve real-time inference performance. Utilizing and implementing GPU programming APIs (e.g., CUDA, OpenCL) to ensure high efficiency and compatibility across GPUs. Profiling and debugging system performance using tools like NVIDIA Nsight, Intel VTune, and vendor-specific profilers, identifying bottlenecks and implementing … autonomous systems. Essential Requirements: 3+ years of experience in C++ programming, with a strong grasp of modern C++ standards. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Strong knowledge of parallel computing concepts, including data locality, memory access patterns, and synchronization. Proficiency with performance profiling tools and techniques for identifying More ❯
analysis to prototype quickly Desirable Experience Experience with TensorRT , Nvidia Deepstream , or other deployment frameworks Background in neural network design or edge inference Programming in C/C++ and CUDA Realtime or embedded vision applications Why Join AssetCool? Tackle some of the toughest challenges in robotics, vision, and infrastructure tech Join a growing team with global ambitions and a More ❯
PyTorch internals and other major ML frameworks. Experience optimizing deep learning performance on accelerator hardware. Solid knowledge of deep learning algorithms and compute patterns. Strong programming skills in C++, CUDA, or OpenCL. Background in performance profiling and optimization. BS/MS in Computer Science, Electrical Engineering, or a related field. Interested? Send your CV to to apply. More ❯
numerical calculation, compilation, algorithm and chip co-design, runtime, or shared memory Strong background in software development using C/C++ and Python Skilled with GPU compute APIs (e.g., CUDA, OpenCL), deep learning frameworks, and compilers Familiarity with AI models, algorithm trends, and translating application requirements into chip-level solutions Experience with GPU acceleration, inference backends, and frameworks such More ❯
as: Puppet, Chef, SMS, Satellite, etc. Knowledge of interpreted and compiled computer programming languages such as Python, Java, C, Objective C, C++, C Sharp, SQL, Tcl, Perl, PHP. Assembly, CUDA, and GPU language experience desirable. Knowledge of advanced computing technologies such as parallel processing, in-memory databases, graph databases and graph theory, machine learning, deep learning, and neural networks. More ❯
Basildon, Essex, United Kingdom Hybrid / WFH Options
leonardo company
looking for: Essential: C# software development Machine-to-machine networking, working to third-party interface definitions Test frameworks and test development (not test-driven development) Microservices architecture/containerisation CUDA integration (AI/ML) Development of new applications to meet user expectations and within formal constraints. HMI/GUI/UX experience needed. Familiarity with the tools and approaches More ❯
large language models, efficient computing based on low-precision arithmetic, deep learning models including large generative models for language, vision and other modalities . Experience writing C Triton/CUDA kernels for performance optimisation of ML models. Have contributed to open-source projects or published research papers in relevant fields. Knowledge of cloud computing platforms. Keen to present, publish More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Good oral and written English skills "Nice To Have" Skills and Experience : Experience with ML software frameworks (e.g. PyTorch) Familiarity with ML hardware accelerators (e.g. NPUs, TPUs, GPUs with CUDA support) Knowledge of optimising and profiling software Experience with assembly programming Software development and integration on Linux, Android, or similar systems Knowledge of scripting languages, including Python In Return More ❯
large language models, efficient computing based on low-precision arithmetic, deep learning models including large generative models for language, vision and other modalities . Experience writing C Triton/CUDA kernels for performance optimisation of ML models. Have contributed to open-source projects or published research papers in relevant fields. Knowledge of cloud computing platforms. Keen to present, publish More ❯
Senior CFD Software Engineer London I'm currently supporting a Fortune 100 organisation and a global leader. In this role, you will design and enhance solver features for both CPU and GPU, as well as develop pre- and post-processing More ❯
Senior CFD Software Engineer London I'm currently supporting a Fortune 100 organisation and a global leader. In this role, you will design and enhance solver features for both CPU and GPU, as well as develop pre- and post-processing More ❯
Job Purpose This role is to provide operational and strategic leadership in the provision of research computing infrastructure services within the College of Medical, Veterinary and Life Sciences within the University of Glasgow, part of an overall portfolio of activity More ❯
Essential Skills Masters or higher degree in ML/AI, Computer Science/Engineering, or related disciplines Professional software development experience with modern C++ Experience with GPU compute in CUDA/OpenCL Excellent communication, teamwork and a results-oriented attitude Proficiency in problem-solving and debugging Expertise in image-based 3D reconstruction: Photogrammetry, Neural Radiance Fields (NERF) or Gaussian More ❯
technologies may that be with Frontend/Backend LLVM or MLIR. Strong programming language skills with C and/or C++. Familiarity with a GPGPU API such as SYCL, CUDA or OpenCL. Open Source code commits and reviews are beneficial. Experience of low level software or hardware development that require looking at computer architecture specifications like compilers, debuggers, models. … essential. Keywords: Compiler/Compilation/LLVM/GCC/OpenSource/Linux/C/C++/Low level/Hardware/debuggers/Fortran/OpenCL/CUDA/MLIR/Machine Learning/GPU/GPGPU By applying to this role you understand that we may collect your personal data and store and process it on More ❯
place recognition. Strong software engineering skills in C++ and Python, including performance critical CV/ML code on Linux or embedded platforms. Familiarity with GPU or edge AI acceleration (CUDA, TensorRT, Vulkan, or similar). Demonstrated ability to deliver production quality, well tested code in collaborative, fast moving environments. Preferred Qualifications Familiarity with GPU or edge AI acceleration (CUDAMore ❯
PERSON: Experienced in GPU kernel development and optimization for AI/HPC applications. Strong technical and analytical skills in GPU computing, hardware architecture, and deep understanding of HIP/CUDA/OpenCL/Triton development. Ability to work as part of a team, deliver to project scope, and communicate effectively to both technical and non-technical audiences. KEY RESPONSIBILITIES … driving AI operator performance (GEMM, Attention, Distributed scale-up/out communication, etc.). Apply your knowledge of software engineering best practices. PREFERRED EXPERIENCE: Knowledge of GPU computing (HIP, CUDA, OpenCL, Triton). Experience in optimizing GPU kernels. Proficiency with profiling and debugging tools. Core understanding of GPU hardware. Excellent C/C Python programming and software design skills More ❯
Industrial experience in deploying SLAM solutions. Proficiency in C++. Desirable experience: PhD in computer vision or robotics. Experience with machine learning techniques for geometric & semantic estimation. GPU programming skills (CUDA, OpenCL, Vulkan, Metal). Experience with embedded software development. If this role is of any interest please apply directly on LinkedIn or send a copy of your CV to More ❯
City of London, London, United Kingdom Hybrid / WFH Options
European Tech Recruit
Industrial experience in deploying SLAM solutions. Proficiency in C++. Desirable experience: PhD in computer vision or robotics. Experience with machine learning techniques for geometric & semantic estimation. GPU programming skills (CUDA, OpenCL, Vulkan, Metal). Experience with embedded software development. If this role is of any interest please apply directly on LinkedIn or send a copy of your CV to More ❯
with AFSIM, including plugin/module development and simulation integration. Hands-on experience with LIDAR and SAR data processing, including algorithm development and performance modeling. Proficiency in C++, Python, CUDA, and embedded systems programming. Strong background in machine learning, signal processing, and parallel computing. Proven track record as a Principal Investigator on SBIR/STTR or other government-funded More ❯
products with the latest machine learning advancements. Requirements include strong programming skills in Python, C, C++, experience with deployment platforms, and familiarity with NLP, computer vision, TensorFlow, PyTorch, JAX, CUDA, LLMs, and related technologies. A degree in a relevant field and a solid AI R&D track record are essential. More ❯
reason through quantitative problems and communicate effectively with trading researchers Reliable and predictable availability Bonus Points Experience with HPC and distributed large model training Experience with GPU performance optimization (CUDA or ROCm) Experience with end-to-end model development, especially in LLMs Prior academic publications and/or contributions to open-source AI research Strong opinions on best practices More ❯
Bristol, Avon, England, United Kingdom Hybrid / WFH Options
Adecco
catastrophe modelling, actuarial science, quantitative finance, or data science. Statistical Modelling Expertise: Real-world commercial experience in statistical modelling and probability. Programming Skills: Proficiency in scientific Python (essential), Spark, CUDA, and SQL. Familiarity with Databricks is a plus! Problem Solver: Strong analytical skills with the ability to work independently and collaboratively. Communication Skills: Ability to present complex technical information More ❯
catastrophe modelling, actuarial science, quantitative finance, or data science. Statistical Modelling Expertise: Real-world commercial experience in statistical modelling and probability. Programming Skills: Proficiency in scientific Python (essential), Spark, CUDA, and SQL. Familiarity with Databricks is a plus! Problem Solver: Strong analytical skills with the ability to work independently and collaboratively. Communication Skills: Ability to present complex technical information More ❯
catastrophe modelling, actuarial science, quantitative finance, or data science. Statistical Modelling Expertise: Real-world commercial experience in statistical modelling and probability. Programming Skills: Proficiency in scientific Python (essential), Spark, CUDA, and SQL. Familiarity with Databricks is a plus! Problem Solver: Strong analytical skills with the ability to work independently and collaboratively. Communication Skills: Ability to present complex technical information More ❯
Employment Type: Permanent
Salary: £70000 - £95000/annum pension, healthcare, life cover