Permanent ONNX Jobs in London

15 of 15 Permanent ONNX Jobs in London

Head of AI (London)

London, UK
Scrumconnect Consulting
Hands-on experience with large-scale language models (LLMs) and prompt engineering (e.g., GPT, BERT, T5 family). Familiarity with on-device or edge-AI deployments (e.g., TensorFlow Lite, ONNX, mobile/embedded inference). Knowledge of MLOps tooling (MLflow, Weights & Biases, Kubeflow, or similar) for experiment tracking and model governance. Open-source contributions or published papers in top-tier More ❯
Employment Type: Full-time
Posted:

Solutions Engineer/Site Reliability Engineer

London, United Kingdom
Hybrid / WFH Options
Zefr
Workflow Orchestration: Apache Airflow, Ray Machine Learning Stack: Serving: Triton Inference Server MLOps/Experiment Tracking: Weights and Biases, DVC Libraries/Frameworks: Transformers, HuggingFace Model Optimization/Formats: Onnx, TensorRT Data Stores & Databases: Relational Databases: PostgreSQL (including managed versions like AWS Aurora, GCP Cloud SQL) NoSQL Databases: DynamoDB Search Databases: OpenSearch, Elasticsearch Vector Databases: Qdrant Caching: Redis Data Warehousing More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer

London Area, United Kingdom
Ultralytics
computer vision and a strong understanding of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent More ❯
Posted:

Machine Learning Engineer

City of London, London, United Kingdom
Ultralytics
computer vision and a strong understanding of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent More ❯
Posted:

Computer Vision Engineer

London Area, United Kingdom
Ultralytics
model robustness on diverse datasets. Leading the full lifecycle of model development, from research and training to validation and performance benchmarking. Mastering model export to various formats such as ONNX, OpenVINO, and TensorRT to support a wide range of hardware. Working on model deployment strategies, including optimizing models for high-performance inference on both cloud and edge devices. Contributing to More ❯
Posted:

Computer Vision Engineer

City of London, London, United Kingdom
Ultralytics
model robustness on diverse datasets. Leading the full lifecycle of model development, from research and training to validation and performance benchmarking. Mastering model export to various formats such as ONNX, OpenVINO, and TensorRT to support a wide range of hardware. Working on model deployment strategies, including optimizing models for high-performance inference on both cloud and edge devices. Contributing to More ❯
Posted:

Machine Learning Engineer

London, United Kingdom
Woven
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer, End-to-end Autonomy

London, United Kingdom
Woven by Toyota
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer (London)

London, UK
Woven
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Employment Type: Full-time
Posted:

Machine Learning Engineer Wearable Health Algorithms (London)

London, UK
all.health
environments Preferred Qualifications: Experience building ML models with wearable data (e.g., continuous heart rate, motion, respiration) Exposure to embedded AI or edge model deployment (e.g., TensorFlow Lite, Core ML, ONNX) Knowledge of healthcare data privacy and security (e.g., HIPAA, GDPR) Familiarity with GMLP (Good Machine Learning Practice) and clinical evaluation frameworks Seniority level Seniority level Not Applicable Employment type Employment More ❯
Employment Type: Full-time
Posted:

Sr. ML Engineer

London, United Kingdom
Veritone, Inc
implementation. Extensive experience with common machine learning Python frameworks such as TensorFlow and PyTorch; and Python libraries such as pandas, and computer vision libraries such as OpenCV. Experience in ONNX and TensorRT. Very comfortable working in Linux environment. Familiarity with software development tools and agile development practices. 6 years experience in developing, optimizing, and testing deep learning in computer vision More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Remote Machine Learning Compiler Engineer - Gensyn (London)

Surbiton, Greater London, UK
Hybrid / WFH Options
Blockchain Works
of two of the following compiler areas: Front-end - handle the handshaking of common Deep Learning Frameworks with Gensyn's IR for internal IR usage. Write transformation passes in ONNX to alter IR for middle-end consumption. Middle-end - write compiler passes for training-based compute graphs, integrate reproducible Deep Learning kernels into the code generation stage, and debug compilation More ❯
Employment Type: Full-time
Posted:

Remote Machine Learning Compiler Engineer - Gensyn

London, United Kingdom
Hybrid / WFH Options
Blockchain Works
of two of the following compiler areas: Front-end - handle the handshaking of common Deep Learning Frameworks with Gensyn's IR for internal IR usage. Write transformation passes in ONNX to alter IR for middle-end consumption. Middle-end - write compiler passes for training-based compute graphs, integrate reproducible Deep Learning kernels into the code generation stage, and debug compilation More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer - Wearable Health Algorithms (London)

London, UK
all.health
Preferred Qualifications: Experience building ML models with wearable data (e.g., continuous heart rate, motion, respiration). Exposure to embedded AI or edge model deployment (e.g., TensorFlow Lite, Core ML, ONNX). Knowledge of healthcare data privacy and security (e.g., HIPAA, GDPR). Familiarity with GMLP (Good Machine Learning Practice) and clinical evaluation frameworks. #J-18808-Ljbffr More ❯
Employment Type: Full-time
Posted:

Machine Learning Scientist

London, United Kingdom
Symphony Industrial AI, Inc
from recent ML papers to solve practical challenges using retail image datasets Benchmark multiple approaches through rigorous experimental pipelines Build scalable inference pipelines using frameworks such as PyTorch, TensorRT, ONNX, and TensorFlow Lite (cloud and edge) Collaborate with engineers and product stakeholders to integrate models into end-to-end systems Perform error analysis and iterate on models continuously in production … models such as diffusion models and vision-language models (VLMs) Proficient in Python and familiar with ML frameworks like PyTorch Familiarity with model optimization and inference acceleration tools (e.g., ONNX, TensorRT, OpenVINO, vLLM) Demonstrated ability to ship ML models in production settings, including performance monitoring and failure analysis Excellent communication skills and proven ability to collaborate across cross-functional teams More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
ONNX
London
25th Percentile
£92,500
Median
£95,000
75th Percentile
£97,500