. A proactive ownership mindset and the ability to navigate ambiguity. Excellent collaboration and communication skills for working effectively with teams and stakeholders. Ideally Professional experience GPGPU programming (e.g., CUDA, Triton) for performance optimization. Experience building and maintaining widely-used internal or open-source libraries. Familiarity with the machine learning development lifecycle and core concepts (e.g., bias-variance tradeoff More ❯
code on Linux or embedded platforms. Demonstrated ability to deliver production quality, well tested code in collaborative, fast moving environments. Preferred Qualifications Familiarity with GPU or edge AI acceleration (CUDA, TensorRT, Vulkan, or similar). Experience deploying perception pipelines on resource constrained hardware. Publications in multimodal sensing/neural representations/SLAM for robotics or autonomous navigation in journals More ❯
models, building production systems with large language models, efficient computing with low-precision arithmetic, or large generative models for language, vision, and other modalities. Experience writing C++, Triton, or CUDA kernels for performance optimisation of ML models. Contributions to open-source projects or published research papers in relevant fields. Knowledge of cloud computing platforms. Keen to present, publish, and More ❯
Proficient using Git version control Strong problem-solving abilities and communication skills Desirable skills: Experience in UI development e.g. ImGui Understanding of multithreading techniques Experience with GPU programming e.g. CUDA Experience with a messaging framework, e.g. NATS, RabbitMQ Experience with low level graphics APIs such as OpenGL Experience working in and configuring cloud environments (e.g. AWS, Azure, GCP) Experience More ❯
large language models, efficient computing based on low-precision arithmetic, deep learning models including large generative models for language, vision and other modalities}. Experience writing C Triton/CUDA kernels for performance optimisation of ML models. Have contributed to open-source projects or published research papers in relevant fields. Knowledge of cloud computing platforms. Keen to present, publish More ❯
level Metal knowledge to guide Metal developers to tune their applications for maximum performance on Apple Silicon. Minimum Qualifications Understand the graphics pipeline GPU programming with Metal, DirectX, Vulkan, CUDA, Direct Compute, OpenGL, or OpenCL Programming knowledge of C/C++ Carry forward highly complex software debug efforts Preferred Qualifications Excellent written and oral communication skills including the ability More ❯
reason through quantitative problems and communicate effectively with trading researchers Reliable and predictable availability Bonus Points Experience with HPC and distributed large model training Experience with GPU performance optimization (CUDA or ROCm) Experience with end-to-end model development, especially in LLMs Prior academic publications and/or contributions to open-source AI research Strong opinions on best practices More ❯
Wandsworth, Greater London, UK Hybrid / WFH Options
Treecode
above in Machine Learning, Computer Science, Engineering, or a related technical discipline or equivalent experience Desirable Strong software engineering experience in Python and other relevant languages (e.g. C++ and CUDA) Direct experience working in at least one of computer vision, robotics, simulation, graphics, or large language models. MS, or above in Machine Learning, Computer Science, Engineering, or a related More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Platform Recruitment Limited
novel solutions. About you: C++ is your strongest language Ideally experience in Video or Audio Processing Experience writing performance-critical software Exposure to GPU technology (Vulkan API, OpenGL, OpenCL, CUDA etc.,) Relevant degree Full details are available. Please don't hesitate to get in touch with maxATplatform-recruitment.com to learn more. More ❯
other high-performance media/signal-processing experience (broadcast, streaming, game engines, AR/VR). SIMD/vectorization (SSE/AVX/NEON) and/or GPU compute (CUDA, Metal, Vulkan, DirectCompute) for acceleration. Cross-platform build & packaging (CMake, cross-compilation toolchains, SDK distribution). Please get in touch with to hear more about this incredible position More ❯
ML codebases Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed/FSDP/SLURM/K8s) Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops Strong software-design instincts: testing, code review, CI/CD Self-starter, low-ego, collaborative Benefits France Competitive cash salary and equity Food: Daily lunch More ❯
as PyTorch, TensorFlow, ONNX Knowledge of LLM architectures and inference optimization techniques (e.g., batching, quantization) Experience deploying scalable, reliable, real-time model serving systems (Optional) GPU architecture understanding or CUDA programming experience The compensation range for this role is $190,000 - $240,000. At Perplexity, we have experienced significant growth since launching the world's first conversational answer engine More ❯
and deployment pipeline to accelerate model iteration and improve performance. Qualifications PhD in CS/CE/EE, or equivalent, in industry experience Deep knowledge of PyTorch Experience with Cuda or Triton language for writing custom ops Knowledge of model training framework (e.g. PyTorch Lightning) In-depth knowledge of transformer architecture and ways to accelerate the training and inference More ❯
focus on training or inference systems Hands-on experience with real-time, low-latency ML pipelines in high-performance environments is a strong plus Strong engineering skills, including Python, CUDA, or C++ Knowledge of machine learning frameworks such as PyTorch, TensorFlow, or JAX Proficiency in GPU programming for training and inference acceleration (e.g., CuDNN, TensorRT) Experience with distributed training More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
Contribute to hiring additional talent to our rapidly growing team The role will be exposed to a broad tech stack (e.g. ReactJS, Python, REST & GraphQL, OpenCV, PyTorch, GCP, AWS & CUDA, Kubernetes) and the cutting edge of computer vision and deep learning. Qualifications The right candidate will have a proven track record of relevant publications and previous experience managing applied More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯