Machine Learning Engineer - Generative AI
City of London, London, United Kingdom
Qubit Analytics
conferences (e.g., NeurIPS, CVPR, ICML, ICLR, SIGGRAPH, ECCV). Experience with GPU programming (CUDA) and model optimization for real-time inference (e.g., quantization, pruning, ONNX, TensorRT, custom CUDA kernels). Background in scalable algorithm design for real-time or interactive applications. Experience integrating machine learning models with complex production pipelines More ❯
Posted: