functions, evaluation metrics, and training procedures Contribute to model selection, architecture design, and technology stack Evaluate model performance Drive innovation initiatives and proof-of-concept projects Export models to ONNX and deploy and integrate them into our C++ environment using TensorRT Optimise existing models for improved accuracy, efficiency, and scalability Build and maintain machine learning infrastructure and deployment pipelines Implement More ❯
computer vision and a strong understanding of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent More ❯
computer vision and a strong understanding of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent More ❯
model robustness on diverse datasets. Leading the full lifecycle of model development, from research and training to validation and performance benchmarking. Mastering model export to various formats such as ONNX, OpenVINO, and TensorRT to support a wide range of hardware. Working on model deployment strategies, including optimizing models for high-performance inference on both cloud and edge devices. Contributing to More ❯
model robustness on diverse datasets. Leading the full lifecycle of model development, from research and training to validation and performance benchmarking. Mastering model export to various formats such as ONNX, OpenVINO, and TensorRT to support a wide range of hardware. Working on model deployment strategies, including optimizing models for high-performance inference on both cloud and edge devices. Contributing to More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
from recent ML papers to solve practical challenges using retail image datasets Benchmark multiple approaches through rigorous experimental pipelines Build scalable inference pipelines using frameworks such as PyTorch, TensorRT, ONNX, and TensorFlow Lite (cloud and edge) Collaborate with engineers and product stakeholders to integrate models into end-to-end systems Perform error analysis and iterate on models continuously in production … models such as diffusion models and vision-language models (VLMs) Proficient in Python and familiar with ML frameworks like PyTorch Familiarity with model optimization and inference acceleration tools (e.g., ONNX, TensorRT, OpenVINO, vLLM) Demonstrated ability to ship ML models in production settings, including performance monitoring and failure analysis Excellent communication skills and proven ability to collaborate across cross-functional teams More ❯
Winchester, Hampshire, United Kingdom Hybrid / WFH Options
Bolt6
implement state-of-the-art models for computer vision problems including object detection, key-point estimation, segmentation; using Python, PyTorch, Ignite, OpenCV, AWS Evaluate model performance Export models to ONNX and deploy and integrate them into our C++ environment using TensorRT Optimise existing models for improved accuracy, efficiency, and scalability Research and evaluate new machine learning techniques and frameworks Build More ❯
implementation. Extensive experience with common machine learning Python frameworks such as TensorFlow and PyTorch; and Python libraries such as pandas, and computer vision libraries such as OpenCV. Experience in ONNX and TensorRT. Very comfortable working in Linux environment. Familiarity with software development tools and agile development practices. 6 years experience in developing, optimizing, and testing deep learning in computer vision More ❯
DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture opportunities, so we More ❯
for neuralnetwork deployment and optimization. Benchmark, analyze, and improve AI workload performance. Collaborate with the hardware team to guide architectural decisions. Extend support to additional frameworks (e.g., TensorFlow, ONNX). Produce developer documentation and resources. Requirements: 5+ years of experience in AI/ML software development. Deep understanding of PyTorch internals and other major ML frameworks. Experience optimizing deep More ❯
bottlenecks, and optimizing them. Good understanding of Docker and containerization. (Good to have) Experience with Pytorch and Python3, and comfortable with C++. (Good to have) Understanding of Torch script, ONNX runtime, TensorRT. (Good to have) Understanding of half-precision inference and int8 quantization. What we offer Company equity % in an early-stage startup. More ❯
of two of the following compiler areas: Front-end - handle the handshaking of common Deep Learning Frameworks with Gensyn's IR for internal IR usage. Write transformation passes in ONNX to alter IR for middle-end consumption. Middle-end - write compiler passes for training-based compute graphs, integrate reproducible Deep Learning kernels into the code generation stage, and debug compilation More ❯
assist the team in building and developing cutting-edge digital systems for a customer in the public sector. Key Responsibilities Design, develop and optimise Machine Learning solutions using YOLO, ONNX, and NVIDIA GPU/vGPU environments. Lead the ML pipeline for object recognition to meet operational objectives, including tuning accuracy for specific scenarios (eg, detecting containers too high/low … background in software engineering which includes machine learning components, and expertise in using YOLO (You Only Look Once) or similar object detection frameworks. Experience with NVIDIA GPU optimisation or ONNX runtime environments. Strong understanding of computer vision, OCR, and image/video processing Proficiency in SOAP API development. Working knowledge of DevOps technologies including OpenShift (OCP 4.x), Kubernetes, and Cilium. More ❯
a multiplatform inference backend. Our remote team is committed to advancing AI technology and making it broadly accessible. Responsibilities: Deploy machine learning models on edge devices using llama.cpp, ggml, onnx frameworks. Collaborate with researchers to code, train, and transition models to production. Integrate AI features into existing products, leveraging latest ML advancements. Requirements: Proficiency in Python, C, and C++ programming. More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
collaborating with remote teams across different time zones. "Nice To Have" Skills and Experience : Experience of Windows WDDM or Linux kernel driver development. Experience of AI frameworks (TensorFlow, PyTorch, ONNX). Knowledge of multimedia use cases including video, camera, display, and GPU. In Return: You will have the opportunity to demonstrate technical expertise and leadership skill to engage with GPU More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Senior GPU Software Applications Engineer Do you want to help Arm's partners build real-world solutions based on Arm GPU technology? This is an excellent opportunity to join Arm's Sales and Partner Enablement group as an Applications Engineer. More ❯
deployment of models (classification, object detection, causal inference) Partnering with: ESA, EO data providers, hardware partners, client field teams The Successful CTO will have: Proficiency in Python, PyTorch, OpenCV, ONNX, TorchScript/TensorRT Experience deploying production-grade AI in edge environments (Jetson/NVIDIA) Comfortable leading projects from concept to field deployment Detail-oriented and execution-driven Mission-aligned: You … UK - EMEA/LATAM To discuss this opportunity in further detail please call James on . Keywords: CTO, Chief Technology Officer, AI Engineer, Software Engineering Director, Python, PyTorch, OpenCV, ONNX, TorchScript/TensorRT, Agriculture, Technology, York, Leeds, Manchester, Midlands, North East, England More ❯
deployment of models (classification, object detection, causal inference) Partnering with: ESA, EO data providers, hardware partners, client field teams The Successful CTO will have: Proficiency in Python, PyTorch, OpenCV, ONNX, TorchScript/TensorRT Experience deploying production-grade AI in edge environments (Jetson/NVIDIA) Comfortable leading projects from concept to field deployment Detail-oriented and execution-driven Mission-aligned: You … LATAM To discuss this opportunity in further detail please call James on (phone number removed). Keywords: CTO, Chief Technology Officer, AI Engineer, Software Engineering Director, Python, PyTorch, OpenCV, ONNX, TorchScript/TensorRT, Agriculture, Technology, York, Leeds, Manchester, Midlands, North East, England More ❯