Machine Learning Engineer
- Hiring Organisation
- Client Server
- Location
- East London, London, United Kingdom
- Employment Type
- Permanent, Work From Home
real test outcomes and push long-context and MoE style architectures to their limits. You'll work hands-on across the full stack: custom PyTorch dataloaders, distributed training (DDP/FSDP), experiment tracking, debugging NCCL issues at 2am, and squeezing performance out of multi-GPU jobs. You'll help design … expectation to do more). About you: You have strong experience with training deep learning models in production You have an indepth knowledge of PyTorch including hands-on experience with torch.distributed (DDP/FSDP-style training, distributed data loading, gradient scaling, etc.) You have experience of training large sequence models ...