Paying up to £1600 per day. Outside IR35 Experience and skills required for the Senior Machine Learning Engineer (AI), Quantization, PEFT, DeepSpeed, ONNX, TensorRT, PyTorch, multi-LoRa, LoRA Exchange, TitanML ️ Strong experience working with inference servers like multi-LoRa, LoRA Exchange, TitanML ️ Experience with HuggingFace Retrieval-augmented generation, embedding pipelines … language models - OpenAI, Mistral, Claude, LLaMA Large language model for GPU usage, scaling, and data movement Quantization, PEFT, DeepSpeed, ONNX, TensorRT Deep Learning with PyTorch Role and responsibilities for the Senior Machine Learning Engineer (AI), Quantization, PEFT, DeepSpeed, ONNX, TensorRT, PyTorch, multi-LoRa, LoRA Exchange, TitanML ️ Build and fine-tune ...