Guildford, Surrey, United Kingdom Hybrid / WFH Options
Ecm Selection
Strong mathematical and analytical problem-solving skills. Desirable Skills: Experience with C# .NET, WinForms, WPF, or the Qt/QML framework, or HTML5. Experience in GPU programming (e.g., OpenCL, CUDA). Knowledge of AI and Machine Learning techniques. Expertise in graphics development (2D/3D) using technologies such as OpenGL, OpenGL Shaders, VTK, OSG, or Vulkan. Why apply for More ❯
London, England, United Kingdom Hybrid / WFH Options
PhysicsX
scaling and optimising ML models, training and serving foundation models at scale (federated learning a bonus); distributed computing frameworks (e.g., Spark, Dask) and high-performance computing frameworks (MPI, OpenMP, CUDA, Triton); cloud computing (on hyper-scaler platforms, e.g., AWS, Azure, GCP); building machine learning models and pipelines in Python, using common libraries and frameworks (e.g., NumPy, SciPy, Pandas, PyTorch More ❯
London, England, United Kingdom Hybrid / WFH Options
NVIDIA
/Digital Twins. Proficiency in deploying AI models and optimizing inference using TensorRT, ONNX Runtime, Triton, or TensorRT-LLM is a plus. Proven experience implementing and optimizing workloads with CUDA and Nsight Tools. Experience with high performance networking technologies, e.g. DPDK, DOCA, RMDA, RoCEv2 is a plus. Published record of thought leadership in a technical area or industry segment. More ❯
London, England, United Kingdom Hybrid / WFH Options
PhysicsX Ltd
scaling and optimising ML models, training and serving foundation models at scale (federated learning a bonus); distributed computing frameworks (e.g., Spark, Dask) and high-performance computing frameworks (MPI, OpenMP, CUDA, Triton); cloud computing (on hyper-scaler platforms, e.g., AWS, Azure, GCP); building machine learning models and pipelines in Python, using common libraries and frameworks (e.g., NumPy, SciPy, Pandas, PyTorch More ❯
London, England, United Kingdom Hybrid / WFH Options
InstaDeep
learning models across diverse hardware platforms (GPU/TPU) and optimising system performance under heavy load. Low-Level Optimisation: Write efficient Python, C/C++, XLA, Pallas, Triton, or CUDA code to achieve performance breakthroughs. ML Systems Design: Architect robust distributed systems for training, deployment, and monitoring, ensuring computational efficiency and scalability. Data Pipeline Automation: Develop automated pipelines for …/or PyTorch) Passion for profiling, identifying bottlenecks, and delivering efficient solutions. Fundamentals of modern Deep Learning Desired Skills Track record of successfully scaling ML models. Experience writing custom CUDA kernels or XLA operations. Understanding of GPU/TPU architectures and their implications for efficient ML systems. Representative projects Profile algorithm, identifying opportunities for custom XLA/CUDAMore ❯
Farnborough, Hampshire, United Kingdom Hybrid / WFH Options
Lenovo
/OSU, TensorFlow, PyTorch, MLperf. 1+ years of experience of using parallel and/or accelerated applications using more than one of OneAPI, Intel Gaudi software suite, OpenMP, MPI, CUDA, ROCm, OpenCL, SYCL. We will also consider candidates who lack some of the requirements but can demonstrate ability to learn quickly. The following competencies are welcome: Experience of running … performance tests and benchmarking codes. Experience of using or developing parallel and/or accelerated applications using more than one of OpenMP, MPI, CUDA, ROCm, OpenCL, SYCL paradigms. Experience of production HPC environment: large-scale filesystems (ideally Storage Scale), batch scheduling (ideally SLURM) as well as common HPC SW and management tools. Experience with analysis and profiling tools for More ❯
London, England, United Kingdom Hybrid / WFH Options
Canonical
internationally twice a year for company events up to two weeks long Nice-to-have skills Experience with LXC/LXD Experience with AI/ML and/or CUDA/OpenVINO Knowledge of system and language package managers internals What we offer colleagues We consider geographical location, experience, and performance in shaping compensation worldwide. We revisit compensation annually More ❯
Guildford, Surrey, United Kingdom Hybrid / WFH Options
Ecm Selection
Qt, QML); 3D graphics toolkits (OpenGL, Vulkan or shaders); CI experience (CMake, JIRA, Git, Jenkins); GIS development tools (GDAL API, MapBox API); multithreading/parallel computing (GPU programming or CUDA); MATLAB/Python scripting for mathematical/geology problems would be advantageous. Due to specific requirements, applicants without the relevant project experiences will not be considered (similarly exposure to More ❯
Guildford, Surrey, United Kingdom Hybrid / WFH Options
ECM Selection (Holdings) Limited
Qt, QML); 3D graphics toolkits (OpenGL, Vulkan or shaders); CI experience (CMake, JIRA, Git, Jenkins); GIS development tools (GDAL API, MapBox API); multithreading/parallel computing (GPU programming or CUDA); MATLAB/Python scripting for mathematical/geology problems would be advantageous. Due to specific requirements, applicants without the relevant project experiences will not be considered (similarly exposure to More ❯
London, England, United Kingdom Hybrid / WFH Options
InstaDeep Ltd
optimise state-of-the-art algorithms and architectures, ensuring compute efficiency and performance. Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs. Required Skills Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.) Expertise … with machine learning frameworks (JAX, Tensorflow, PyTorch etc.) Passion for profiling, identifying bottlenecks, and delivering efficient solutions. Highly Desirable Track record of successfully scaling ML models. Experience writing custom CUDA kernels or XLA operations. Understanding of GPU/TPU architectures and their implications for efficient ML systems. Fundamentals of modern Deep Learning Actively following ML trends and a desire … to push boundaries. Example Projects: Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development. Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects. Adapt algorithms for large-scale distributed architectures across HPC clusters. Employ memory-efficient techniques within models for increased parameter counts and longer context lengths. What We Offer: Real More ❯
London, England, United Kingdom Hybrid / WFH Options
Treecode
above in Machine Learning, Computer Science, Engineering, or a related technical discipline or equivalent experience Desirable Strong software engineering experience in Python and other relevant languages (e.g. C++ and CUDA) Direct experience working in at least one of computer vision, robotics, simulation, graphics, or large language models. MS, or above in Machine Learning, Computer Science, Engineering, or a related More ❯
London, England, United Kingdom Hybrid / WFH Options
Wayve
BSc above in Machine Learning, Computer Science, Engineering, or a related technical discipline or equivalent experience Strong software engineering experience in Python and other relevant languages (e.g. C++ and CUDA) Direct experience working in at least one of computer vision, robotics, simulation, graphics, or large language models. MS, or above in Machine Learning, Computer Science, Engineering, or a related More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Annapurna
across diverse vendor platforms. Working with low-level system and memory management techniques to minimize latency and improve real-time inference performance. Utilizing and implementing GPU programming APIs (e.g., CUDA, OpenCL) to ensure high efficiency and compatibility across GPUs. Profiling and debugging system performance using tools like NVIDIA Nsight, Intel VTune, and vendor-specific profilers, identifying bottlenecks and implementing … autonomous systems. Essential Requirements: 3+ years of experience in C++ programming, with a strong grasp of modern C++ standards. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Strong knowledge of parallel computing concepts, including data locality, memory access patterns, and synchronization. Proficiency with performance profiling tools and techniques for identifying More ❯
across diverse vendor platforms. Working with low-level system and memory management techniques to minimize latency and improve real-time inference performance. Utilizing and implementing GPU programming APIs (e.g., CUDA, OpenCL) to ensure high efficiency and compatibility across GPUs. Profiling and debugging system performance using tools like NVIDIA Nsight, Intel VTune, and vendor-specific profilers, identifying bottlenecks and implementing … autonomous systems. Essential Requirements: 3+ years of experience in C++ programming, with a strong grasp of modern C++ standards. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Strong knowledge of parallel computing concepts, including data locality, memory access patterns, and synchronization. Proficiency with performance profiling tools and techniques for identifying More ❯
PyTorch internals and other major ML frameworks. Experience optimizing deep learning performance on accelerator hardware. Solid knowledge of deep learning algorithms and compute patterns. Strong programming skills in C++, CUDA, or OpenCL. Background in performance profiling and optimization. BS/MS in Computer Science, Electrical Engineering, or a related field. Interested? Send your CV to to apply. More ❯
Basildon, Essex, United Kingdom Hybrid / WFH Options
leonardo company
looking for: Essential: C# software development Machine-to-machine networking, working to third-party interface definitions Test frameworks and test development (not test-driven development) Microservices architecture/containerisation CUDA integration (AI/ML) Development of new applications to meet user expectations and within formal constraints. HMI/GUI/UX experience needed. Familiarity with the tools and approaches More ❯
Basildon, England, United Kingdom Hybrid / WFH Options
Leonardo
working to third-party interface definitions Test frameworks and test development (not test-driven development) Microservices architecture/containerisation (development of containerised plugins, with potential to replace existing components) CUDA integration (AI/ML) Development of new applications to meet user expectations within formal constraints, including HMI/GUI/UX experience Familiarity with embedded development tools and approaches More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Good oral and written English skills "Nice To Have" Skills and Experience : Experience with ML software frameworks (e.g. PyTorch) Familiarity with ML hardware accelerators (e.g. NPUs, TPUs, GPUs with CUDA support) Knowledge of optimising and profiling software Experience with assembly programming Software development and integration on Linux, Android, or similar systems Knowledge of scripting languages, including Python In Return More ❯
London, England, United Kingdom Hybrid / WFH Options
Annapurna
of end-to-end AI solutions at scale. Proven experience as an Engineering Manager delivering complex engineering projects. Expertise in developing GPU kernels and/or ML compilers (e.g., CUDA, OpenCL, TensorRT, MLIR, TVM). Experience optimizing systems to meet strict utilization and latency requirements. Excellent interpersonal and communication skills. Experience with C++ and ML frameworks such as PyTorch. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
European Tech Recruit
Industrial experience in deploying SLAM solutions. Proficiency in C++. Desirable experience: PhD in computer vision or robotics. Experience with machine learning techniques for geometric & semantic estimation. GPU programming skills (CUDA, OpenCL, Vulkan, Metal). Experience with embedded software development. If this role is of any interest please apply directly on LinkedIn or send a copy of your CV to More ❯
Industrial experience in deploying SLAM solutions. Proficiency in C++. Desirable experience: PhD in computer vision or robotics. Experience with machine learning techniques for geometric & semantic estimation. GPU programming skills (CUDA, OpenCL, Vulkan, Metal). Experience with embedded software development. If this role is of any interest please apply directly on LinkedIn or send a copy of your CV to More ❯
products with the latest machine learning advancements. Requirements include strong programming skills in Python, C, C++, experience with deployment platforms, and familiarity with NLP, computer vision, TensorFlow, PyTorch, JAX, CUDA, LLMs, and related technologies. A degree in a relevant field and a solid AI R&D track record are essential. More ❯
Bristol, Avon, England, United Kingdom Hybrid / WFH Options
Adecco
catastrophe modelling, actuarial science, quantitative finance, or data science. Statistical Modelling Expertise: Real-world commercial experience in statistical modelling and probability. Programming Skills: Proficiency in scientific Python (essential), Spark, CUDA, and SQL. Familiarity with Databricks is a plus! Problem Solver: Strong analytical skills with the ability to work independently and collaboratively. Communication Skills: Ability to present complex technical information More ❯
catastrophe modelling, actuarial science, quantitative finance, or data science. Statistical Modelling Expertise: Real-world commercial experience in statistical modelling and probability. Programming Skills: Proficiency in scientific Python (essential), Spark, CUDA, and SQL. Familiarity with Databricks is a plus! Problem Solver: Strong analytical skills with the ability to work independently and collaboratively. Communication Skills: Ability to present complex technical information More ❯
catastrophe modelling, actuarial science, quantitative finance, or data science. Statistical Modelling Expertise: Real-world commercial experience in statistical modelling and probability. Programming Skills: Proficiency in scientific Python (essential), Spark, CUDA, and SQL. Familiarity with Databricks is a plus! Problem Solver: Strong analytical skills with the ability to work independently and collaboratively. Communication Skills: Ability to present complex technical information More ❯
Employment Type: Permanent
Salary: £70000 - £95000/annum pension, healthcare, life cover