role. Experience an any of the following software engineering areas would be beneficial: High-performance computing techniques, such as parallel or distributed CPU/GPU implementations (e.g., SIMD vectorization, CUDA, Ray). Our Diversity, Equity, and Inclusion commitments We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we're More ❯
language, vision and other modalities, machine learning for molecules and proteins (ideally with some background in chemistry and biological sciences) . Lower-level programming for hardware efficiency, e.g. C CUDA/Triton. Practical familiarity with hardware capabilities for deep learning - threads, caches, vector & matrix engines, data dependencies, bus widths and throttling. Practical familiarity with software stacks for deep learning More ❯
large language models, efficient computing based on low-precision arithmetic, deep learning models including large generative models for language, vision and other modalities . Experience writing C Triton/CUDA kernels for performance optimisation of ML models. Have contributed to open-source projects or published research papers in relevant fields. Knowledge of cloud computing platforms. Keen to present, publish More ❯
or create insights, that's a plus. Deeper systems knowledge. Extraexperience with any of the following would be an asset: developing GPU kernels and/or ML compilers (e.g. CUDA, OpenCL, TensorRT Plugins, MLIR, TVM, etc); optimizing systems to meet strict utilization and latency requirements with tools such as Nvidia NSight; and/or you've worked with embedded More ❯
or create insights, that's a plus. Deeper systems knowledge. Extraexperience with any of the following would be an asset: developing GPU kernels and/or ML compilers (e.g. CUDA, OpenCL, TensorRT Plugins, MLIR, TVM, etc); optimizing systems to meet strict utilization and latency requirements with tools such as Nvidia NSight; and/or you've worked with embedded More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Platform Recruitment Limited
novel solutions. About you: C++ is your strongest language Ideally experience in Video or Audio Processing Experience writing performance-critical software Exposure to GPU technology (Vulkan API, OpenGL, OpenCL, CUDA etc.,) Relevant degree Full details are available. Please don't hesitate to get in touch with maxATplatform-recruitment.com to learn more. More ❯
novel solutions. About you: C++ is your strongest language Ideally experience in Video or Audio Processing Experience writing performance-critical software Exposure to GPU technology (Vulkan API, OpenGL, OpenCL, CUDA etc.,) Relevant degree Full details are available. Please don't hesitate to get in touch with (email address removed). com to learn more. More ❯
other high-performance media/signal-processing experience (broadcast, streaming, game engines, AR/VR). SIMD/vectorization (SSE/AVX/NEON) and/or GPU compute (CUDA, Metal, Vulkan, DirectCompute) for acceleration. Cross-platform build & packaging (CMake, cross-compilation toolchains, SDK distribution). Please get in touch with to hear more about this incredible position More ❯
including differentiable systems and backpropagation techniques, beyond just neural networks. Strong mathematical background. Proficiency in programming languages and frameworks such as: PyTorch or TensorFlow Python C/C++ and CUDA (ideally) Fluent in English Minimum of 2 years of AI development experience Preferably, experience applying AI to 3D graphics Parallaxter is part of the V-Nova Group, a London More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
london (city of london), south east england, united kingdom
Ultralytics
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
as PyTorch, TensorFlow, ONNX Knowledge of LLM architectures and inference optimization techniques (e.g., batching, quantization) Experience deploying scalable, reliable, real-time model serving systems (Optional) GPU architecture understanding or CUDA programming experience The compensation range for this role is $190,000 - $240,000. At Perplexity, we have experienced significant growth since launching the world's first conversational answer engine More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
london (city of london), south east england, united kingdom
Ultralytics
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
image or video captioning, speech-to-text generation. Bonus: Publications in top-tier venues demonstrating your expertise in multimodal AI research. Bonus: Experience in writing efficient GPU kernels using CUDA, optimising performance for multimodal tasks. This role is perfect for you if you: Have a deep passion for machine learning and its potential to impact various industries through multimodal More ❯