proposal, etc. Prototype and deploy modern perception algorithms , including transformer-based models, across RGB-D, point cloud, and tactile modalities. Deliver real-time inference pipelines using PyTorch, TensorRT, and CUDA on embedded accelerators (e.g., Jetson). Integrate with ROS 2 : clean modular nodes, lifecycle management, deterministic scheduling, robust fallback behavior. Collaborate tightly with control, planning, and hardware to ensure … manipulation, SLAM, autonomous navigation). Strong proficiency in modern C++ (17/20) and Python for high-performance robotics software. Deep experience with PyTorch (training & deployment), and GPU optimisation (CUDA/TensorRT). Strong working knowledge of ROS 2 (rclcpp, lifecycle nodes, real-time QoS, DDS). Hands-on experience with transformer-based models (e.g., DETR, SAM, DINOv2, ViT More ❯
proposal, etc. Prototype and deploy modern perception algorithms , including transformer-based models, across RGB-D, point cloud, and tactile modalities. Deliver real-time inference pipelines using PyTorch, TensorRT, and CUDA on embedded accelerators (e.g., Jetson). Integrate with ROS 2 : clean modular nodes, lifecycle management, deterministic scheduling, robust fallback behavior. Collaborate tightly with control, planning, and hardware to ensure … manipulation, SLAM, autonomous navigation). Strong proficiency in modern C++ (17/20) and Python for high-performance robotics software. Deep experience with PyTorch (training & deployment), and GPU optimisation (CUDA/TensorRT). Strong working knowledge of ROS 2 (rclcpp, lifecycle nodes, real-time QoS, DDS). Hands-on experience with transformer-based models (e.g., DETR, SAM, DINOv2, ViT More ❯
proposal, etc. Prototype and deploy modern perception algorithms , including transformer-based models, across RGB-D, point cloud, and tactile modalities. Deliver real-time inference pipelines using PyTorch, TensorRT, and CUDA on embedded accelerators (e.g., Jetson). Integrate with ROS 2 : clean modular nodes, lifecycle management, deterministic scheduling, robust fallback behavior. Collaborate tightly with control, planning, and hardware to ensure … manipulation, SLAM, autonomous navigation). Strong proficiency in modern C++ (17/20) and Python for high-performance robotics software. Deep experience with PyTorch (training & deployment), and GPU optimisation (CUDA/TensorRT). Strong working knowledge of ROS 2 (rclcpp, lifecycle nodes, real-time QoS, DDS). Hands-on experience with transformer-based models (e.g., DETR, SAM, DINOv2, ViT More ❯
you will: Design and write high-performant and scalable software for training. Understand architectural modifications and design choices and their effects on training throughput and quality. Write low-level CUDA, triton kernels to squeeze every last bit of performance from our accelerators. Research, implement, and experiment with ideas on our supercompute and data infrastructure. Learn from and work with … if you have: Extremely strong software engineering skills. Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR. Experience writing kernels for GPUs using CUDA, triton, etc Experience using large-scale distributed training strategies. Familiarity with autoregressive sequence models, such as Transformers. Bonus : paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats More ❯