City of London, London, United Kingdom Hybrid / WFH Options
Explore Group
and ability to work in a hybrid, collaborative environment. Nice to Have Experience with transformer-based vision models (ViT, CLIP, SAM). Familiarity with real-time inference and optimisation (ONNX, TensorRT). Previous work on video analytics, 3D vision, or multi-modal ML projects. More ❯
and ability to work in a hybrid, collaborative environment. Nice to Have Experience with transformer-based vision models (ViT, CLIP, SAM). Familiarity with real-time inference and optimisation (ONNX, TensorRT). Previous work on video analytics, 3D vision, or multi-modal ML projects. More ❯
london, south east england, united kingdom Hybrid / WFH Options
Explore Group
and ability to work in a hybrid, collaborative environment. Nice to Have Experience with transformer-based vision models (ViT, CLIP, SAM). Familiarity with real-time inference and optimisation (ONNX, TensorRT). Previous work on video analytics, 3D vision, or multi-modal ML projects. More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Explore Group
and ability to work in a hybrid, collaborative environment. Nice to Have Experience with transformer-based vision models (ViT, CLIP, SAM). Familiarity with real-time inference and optimisation (ONNX, TensorRT). Previous work on video analytics, 3D vision, or multi-modal ML projects. More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Explore Group
and ability to work in a hybrid, collaborative environment. Nice to Have Experience with transformer-based vision models (ViT, CLIP, SAM). Familiarity with real-time inference and optimisation (ONNX, TensorRT). Previous work on video analytics, 3D vision, or multi-modal ML projects. More ❯
the deployment methods for GPU-accelerated serving frameworks in the market, with reference implementations and best-practice recommendations for large-scale serving solutions (eg, NVIDIA Triton Inference Server, TensorRT, ONNX Runtime). Develop repeatable and automated configuration templates for GPU resources. Implement active GPU monitoring, including review and analysis of all relevant metrics (utilization, memory bandwidth, power, temperature, etc.), and … NVIDIA Certified (Preferred). Required Skills Direct experience with GPU services, including resource provisioning, scaling, and optimization. Demonstrable expertise in GPU-accelerated software development (CUDA, OpenCL, TensorRT, PyTorch, TensorFlow, ONNX, etc.). Strong background in performance benchmarking, profiling (Nsight, nvprof, or similar tools), and workload tuning. Experience with Infrastructure as Code (Terraform, HELM Charts, or equivalent) for automated cloud resource More ❯