Senior Machine Learning Engineer
london, south east england, united kingdom
Hybrid/Remote Options
Hybrid/Remote Options
Synthesia
and inference times Developing customized efficient solutions for inference pipelines (CUDA/Triton kernels) as well as Introducing or enhancing tooling for achieving optimal computational performance (e.g. DL compilers, ONNX, TensorRT) Driving the adoption of best practices for large-model training, including checkpointing, gradient accumulation, and memory optimisation among others Introducing or enhancing tooling for distributed training, performance monitoring, and More ❯
Posted: