Essential Skills Masters or higher degree in ML/AI, Computer Science/Engineering, or related disciplines Professional software development experience with modern C++ Experience with GPU compute in CUDA/OpenCL Excellent communication, teamwork and a results-oriented attitude Proficiency in problem-solving and debugging Expertise in image-based 3D reconstruction: Photogrammetry, Neural Radiance Fields (NERF) or Gaussian More ❯
System Maintenance and Performance Optimization • Manage, maintain, and tune bare-metal HPC clusters running Linux-based operating systems (e.g., Fedora, Debian, Ubuntu). • Optimize Nvidia GPU compute environments, including CUDA, NCCL, and GPU resource management in multi-node HPC clusters. • Oversee high-speed networking configurations, including InfiniBand (Mellanox), RDMA, and Ethernet fabric tuning for low-latency HPC workloads. • Configure … Support • Serve as the lead technical resource for diagnosing and resolving complex software, networking, and hardware issues in large-scale GPU clusters. • Analyze logs, conduct performance profiling, and debug CUDA, MPI, and RDMA-related issues. • Work closely with AI/ML research teams, cloud engineers, and enterprise clients to optimize workload performance. Collaboration and Process Improvement • Support the ongoing More ❯
CUDA Developer | High-Performance Computing | Applied AI Location: UK-based Remote Type: Contract, Outside IR35, Remote Sector: Advanced Computing/Applied AI We’re partnering with a company building next-generation GPU-accelerated software for scientific and AI applications. We are recruiting for a CUDA Developer who’s passionate about getting every ounce of performance out of modern … opportunity to work with a small, expert team where your technical decisions will shape the foundation of an emerging AI technology. What You’ll Be Doing Designing and optimising CUDA kernels for high-performance workloads. Translating advanced algorithms into production-ready GPU-accelerated code. Profiling performance and reducing bottlenecks using Nsight, CUPTI, and custom tooling. Working with C++ engineers … and ML researchers to deliver scalable AI computation pipelines. Contributing to architecture decisions on parallelisation, data transfer, and memory efficiency. What We’re Looking For Deep experience with CUDA C/C++ and modern C++ (17/20) . Strong understanding of GPU architecture, memory management, and parallelism . Familiarity with OpenMP, MPI, or other HPC frameworks . Bonus More ❯
Bristol, Avon, South West, United Kingdom Hybrid/Remote Options
Certain Advantage
World Class Defence Organisation based in Bristol is currently looking to recruit an Embedded C++ Software Engineer subcontractor on an initial 6 month contract. The role can be worked on a 4 day week basis (Monday to Thursday) but due More ❯
architectures Develop low-latency inference systems providing real-time, high-accuracy predictions in production Optimise and extend machine learning frameworks to improve training and inference performance Leverage GPU programming (CUDA, cuDNN, TensorRT) to maximise efficiency Automate model experimentation, tuning and retraining in partnership with research teams Work with infrastructure specialists to optimise workflows and reduce compute costs Assess and … ML development and deployment Skills and Experience 5+ years’ experience in machine learning with a focus on training and inference systems Strong programming expertise in Python and C++ or CUDA Proficiency with PyTorch, TensorFlow or JAX Hands-on experience with GPU acceleration and distributed training (Horovod, NCCL or similar) Background in real-time, low-latency ML pipelines Familiarity with More ❯
architectures Develop low-latency inference systems providing real-time, high-accuracy predictions in production Optimise and extend machine learning frameworks to improve training and inference performance Leverage GPU programming (CUDA, cuDNN, TensorRT) to maximise efficiency Automate model experimentation, tuning and retraining in partnership with research teams Work with infrastructure specialists to optimise workflows and reduce compute costs Assess and … ML development and deployment Skills and Experience 5+ years’ experience in machine learning with a focus on training and inference systems Strong programming expertise in Python and C++ or CUDA Proficiency with PyTorch, TensorFlow or JAX Hands-on experience with GPU acceleration and distributed training (Horovod, NCCL or similar) Background in real-time, low-latency ML pipelines Familiarity with More ❯