models. Develop robust, fault-tolerant systems for data ingestion, processing, and model customisation. Optimise GPU utilisation and system performance using frameworks like DeepSpeed, Triton Inference Server, TensorRT, and custom CUDA/Rocm kernels. Conduct performance testing and resolve bottlenecks across training, fine-tuning, and inference workflows. Document and build tooling to ensure successful use of Nscale’s training, fine … of transformer architectures, LLMs, and multimodal generative models. Expertise in distributed training frameworks like DeepSpeed or Fully Sharded Data Parallel (FSDP). Experience with GPU programming and optimisation e.g CUDA, TensorRT, or ROCm. Knowledge of fine-tuning methods, such as LoRA, prefix-tuning, and adapter-based techniques, and experience improving model performance. Experience with containerised environments and Kubernetes for More ❯
with the latest developments in model optimization, inference engines, quantization methods, and related technologies. Requirements Proven professional experience optimizing neural network inference workloads. Strong expertise with TensorRT, Triton language, CUDA programming. Experience with neural network quantization techniques. Proficiency in Python and PyTorch. Deep understanding of GPU architectures and performance optimization. Excellent problem-solving skills and ability to analyze performance More ❯
deploying machine learning onto a range of hardware from resource constrained embedded systems through to edge computing is desirable. As is any knowledge of GPU programming languages and frameworks (CUDA, ROCm, etc). Your future colleagues will be similarly highly skilled, with experience across industry and the drive to innovate. You will find yourself in a low-management work More ❯
deploying machine learning onto a range of hardware from resource constrained embedded systems through to edge computing is desirable. As is any knowledge of GPU programming languages and frameworks (CUDA, ROCm, etc). Your future colleagues will be similarly highly skilled, with experience across industry and the drive to innovate. You will find yourself in a low-management work More ❯