Permanent CUDA Job Vacancies

76 to 79 of 79 Permanent CUDA Jobs

Staff AI Engineer

United Kingdom, UK
Nscale
models. Develop robust, fault-tolerant systems for data ingestion, processing, and model customisation. Optimise GPU utilisation and system performance using frameworks like DeepSpeed, Triton Inference Server, TensorRT, and custom CUDA/Rocm kernels. Conduct performance testing and resolve bottlenecks across training, fine-tuning, and inference workflows. Document and build tooling to ensure successful use of Nscale’s training, fine … of transformer architectures, LLMs, and multimodal generative models. Expertise in distributed training frameworks like DeepSpeed or Fully Sharded Data Parallel (FSDP). Experience with GPU programming and optimisation e.g CUDA, TensorRT, or ROCm. Knowledge of fine-tuning methods, such as LoRA, prefix-tuning, and adapter-based techniques, and experience improving model performance. Experience with containerised environments and Kubernetes for More ❯
Posted:

Neural Network Optimization Engineer Engineering London, UK

London, United Kingdom
Recraft, Inc
with the latest developments in model optimization, inference engines, quantization methods, and related technologies. Requirements Proven professional experience optimizing neural network inference workloads. Strong expertise with TensorRT, Triton language, CUDA programming. Experience with neural network quantization techniques. Proficiency in Python and PyTorch. Deep understanding of GPU architectures and performance optimization. Excellent problem-solving skills and ability to analyze performance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer

Cambridge, Cambridgeshire, United Kingdom
Ecm Selection
deploying machine learning onto a range of hardware from resource constrained embedded systems through to edge computing is desirable. As is any knowledge of GPU programming languages and frameworks (CUDA, ROCm, etc). Your future colleagues will be similarly highly skilled, with experience across industry and the drive to innovate. You will find yourself in a low-management work More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer

Cambridge, Cambridgeshire, United Kingdom
ECM Selection (Holdings) Limited
deploying machine learning onto a range of hardware from resource constrained embedded systems through to edge computing is desirable. As is any knowledge of GPU programming languages and frameworks (CUDA, ROCm, etc). Your future colleagues will be similarly highly skilled, with experience across industry and the drive to innovate. You will find yourself in a low-management work More ❯
Employment Type: Permanent
Salary: £60000 - £90000/annum DoE + Benefits
Posted:
CUDA
10th Percentile
£41,250
25th Percentile
£61,250
Median
£70,000
75th Percentile
£84,375
90th Percentile
£88,750