Hands-on experience with large-scale language models (LLMs) and prompt engineering (e.g., GPT, BERT, T5 family). Familiarity with on-device or edge-AI deployments (e.g., TensorFlow Lite, ONNX, mobile/embedded inference). Knowledge of MLOps tooling (MLflow, Weights & Biases, Kubeflow, or similar) for experiment tracking and model governance. Open-source contributions or published papers in top-tier More ❯
functions, evaluation metrics, and training procedures Contribute to model selection, architecture design, and technology stack Evaluate model performance Drive innovation initiatives and proof-of-concept projects Export models to ONNX and deploy and integrate them into our C++ environment using TensorRT Optimise existing models for improved accuracy, efficiency, and scalability Build and maintain machine learning infrastructure and deployment pipelines Implement More ❯
functions, evaluation metrics, and training procedures Contribute to model selection, architecture design, and technology stack Evaluate model performance Drive innovation initiatives and proof-of-concept projects Export models to ONNX and deploy and integrate them into our C++ environment using TensorRT Optimise existing models for improved accuracy, efficiency, and scalability Build and maintain machine learning infrastructure and deployment pipelines Implement More ❯
NumPy, Jupyter Notebooks, Matplotlib, Seaborn, Flask (for building APIs), FastAPI, Docker, MLflow, DVC (Data Version Control), AWS SageMaker, Azure Machine Learning, Google Cloud AI Platform, TensorFlow Serving, ONNX (OpenNeuralNetworkExchange) We have several exciting new positions available for Machine Learning Engineers to join a forward-thinking AI company. This is your opportunity to collaborate with top talent in … NumPy, Jupyter Notebooks, Matplotlib, Seaborn, Flask (for building APIs), FastAPI, Docker, MLflow, DVC (Data Version Control), AWS SageMaker, Azure Machine Learning, Google Cloud AI Platform, TensorFlow Serving, ONNX (OpenNeuralNetworkExchange) This is your chance to contribute to an innovative, technically demanding project focused on developing a cutting-edge machine learning application from the ground up. The field of More ❯
computer vision and a strong understanding of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent More ❯
computer vision and a strong understanding of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent More ❯
computer vision and a strong understanding of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent More ❯
computer vision and a strong understanding of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent More ❯
model robustness on diverse datasets. Leading the full lifecycle of model development, from research and training to validation and performance benchmarking. Mastering model export to various formats such as ONNX, OpenVINO, and TensorRT to support a wide range of hardware. Working on model deployment strategies, including optimizing models for high-performance inference on both cloud and edge devices. Contributing to More ❯
model robustness on diverse datasets. Leading the full lifecycle of model development, from research and training to validation and performance benchmarking. Mastering model export to various formats such as ONNX, OpenVINO, and TensorRT to support a wide range of hardware. Working on model deployment strategies, including optimizing models for high-performance inference on both cloud and edge devices. Contributing to More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
Experience working with a modern cloud service (AWS, GCP, Azure etc.) Nice to Have Hands-on experience with autonomous driving systems Experience with model deployment with NVIDIA stack (e.g. ONNX graphs, TensorRT, profiling) Familiarity with recent breakthroughs in ML (e.g. foundation models, pre-training and efficient fine-tuning, multimodal Transformer architectures) Knowledge of autonomous driving, large-scale data curation pipelines More ❯
from recent ML papers to solve practical challenges using retail image datasets Benchmark multiple approaches through rigorous experimental pipelines Build scalable inference pipelines using frameworks such as PyTorch, TensorRT, ONNX, and TensorFlow Lite (cloud and edge) Collaborate with engineers and product stakeholders to integrate models into end-to-end systems Perform error analysis and iterate on models continuously in production … models such as diffusion models and vision-language models (VLMs) Proficient in Python and familiar with ML frameworks like PyTorch Familiarity with model optimization and inference acceleration tools (e.g., ONNX, TensorRT, OpenVINO, vLLM) Demonstrated ability to ship ML models in production settings, including performance monitoring and failure analysis Excellent communication skills and proven ability to collaborate across cross-functional teams More ❯
implementation. Extensive experience with common machine learning Python frameworks such as TensorFlow and PyTorch; and Python libraries such as pandas, and computer vision libraries such as OpenCV. Experience in ONNX and TensorRT. Very comfortable working in Linux environment. Familiarity with software development tools and agile development practices. 6 years experience in developing, optimizing, and testing deep learning in computer vision More ❯
environments Preferred Qualifications: Experience building ML models with wearable data (e.g., continuous heart rate, motion, respiration) Exposure to embedded AI or edge model deployment (e.g., TensorFlow Lite, Core ML, ONNX) Knowledge of healthcare data privacy and security (e.g., HIPAA, GDPR) Familiarity with GMLP (Good Machine Learning Practice) and clinical evaluation frameworks Seniority level Seniority level Not Applicable Employment type Employment More ❯
Preferred Qualifications: Experience building ML models with wearable data (e.g., continuous heart rate, motion, respiration). Exposure to embedded AI or edge model deployment (e.g., TensorFlow Lite, Core ML, ONNX). Knowledge of healthcare data privacy and security (e.g., HIPAA, GDPR). Familiarity with GMLP (Good Machine Learning Practice) and clinical evaluation frameworks. #J-18808-Ljbffr More ❯
Preferred Qualifications: Experience building ML models with wearable data (e.g., continuous heart rate, motion, respiration). Exposure to embedded AI or edge model deployment (e.g., TensorFlow Lite, Core ML, ONNX). Knowledge of healthcare data privacy and security (e.g., HIPAA, GDPR). Familiarity with GMLP (Good Machine Learning Practice) and clinical evaluation frameworks. The successful candidate’s starting pay will More ❯
DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture opportunities, so we More ❯
for neuralnetwork deployment and optimization. Benchmark, analyze, and improve AI workload performance. Collaborate with the hardware team to guide architectural decisions. Extend support to additional frameworks (e.g., TensorFlow, ONNX). Produce developer documentation and resources. Requirements: 5+ years of experience in AI/ML software development. Deep understanding of PyTorch internals and other major ML frameworks. Experience optimizing deep More ❯
bottlenecks, and optimizing them Good understanding of docker and containerization (Good to have) experience with Pytorch and Python3, and comfortable with C++ (Good to have) Understanding of Torch script, ONNX runtime, TensorRT (Good to have) Understanding of half-precision inference and int8 quantization What we offer Company equity % in an early-stage startup #J-18808-Ljbffr More ❯