Staff AI Engineer
shrewsbury, midlands, United Kingdom
Nscale
About Nscale Nscale is taking on the hyperscalers by building a vertically integrated GenAI cloud platform. We own the data centres, software, and applications that power today's AI stack using sustainable technology solutions. We thrive on a culture of relentless innovation, ownership, and accountability, where every team … and implement advanced methodologies like LoRA, prefix-tuning, and adapter-based approaches for fine-tuning AI models. Develop robust, fault-tolerant systems for data ingestion, processing, and model customisation. Optimise GPU utilisation and system performance using frameworks like DeepSpeed, Triton Inference Server, TensorRT, and custom CUDA/Rocm … PyTorch, with a strong understanding of transformer architectures, LLMs, and multimodal generative models. Expertise in distributed training frameworks like DeepSpeed or Fully Sharded Data Parallel (FSDP). Experience with GPU programming and optimisation e.g CUDA, TensorRT, or ROCm. Knowledge of fine-tuning methods, such as LoRA, prefix-tuning More ❯
Posted: