Stevenage, Hertfordshire, South East, United Kingdom
Guidant Global
with MATLAB/Simulink models. The focus is on optimising GPU and CPU code to enhance simulation speed and runtime efficiency. Key Responsibilities: Proficient in C/C++ and CUDA for high-performance application development Strong foundation in Object-Oriented Programming principles and design Skilled in GPU/CPU performance tuning and optimisation techniques Experienced in managing memory across … GPU and CPU architectures Capable of producing clear, concise, and technically sound documentation What do you need?: Proficient in C/C++ and CUDA for high-performance application development Strong foundation in Object-Oriented Programming principles and design Skilled in GPU/CPU performance tuning and optimisation techniques Experienced in managing memory across GPU and CPU architectures Capable of More ❯
Employment Type: Contract
Rate: Up to £75 per hour PAYE and Umbrella pay option available
no checklist, but you’ll likely thrive in this role if you have: Technical Experience Strong engineering skills in Python, C++, or Rust Proven experience with GPU performance engineering: CUDA, PTX/SASS, Tensor Cores, memory hierarchy, warp-level primitives Familiarity with ML frameworks like PyTorch, and their internals Proficiency in profiling and debugging tools like NSight, CUDAMore ❯
no checklist, but you’ll likely thrive in this role if you have: Technical Experience Strong engineering skills in Python, C++, or Rust Proven experience with GPU performance engineering: CUDA, PTX/SASS, Tensor Cores, memory hierarchy, warp-level primitives Familiarity with ML frameworks like PyTorch, and their internals Proficiency in profiling and debugging tools like NSight, CUDAMore ❯
Stevenage, Hertfordshire, South East, United Kingdom
Morson Talent
The focus of this work is efficiency and run-time improvements of the simulations through the optimisation of GPU and CPU code. Key Skillset Essential: C/C++ and CUDA programming Object-Oriented Programming GPU/CPU optimisation GPU/CPU Memory Management Technical report writing Desirable: Network Programming Configuration control and model release processes Continuous Integration and Testing More ❯
as-code mindset Hands-on experience resolving GPU workload issues across compute clusters and supporting technologies Familiarity with performance tooling and debugging in live production environments Practical experience with CUDA or systems-level programming in C/C++ Experience with config management frameworks like Salt, Ansible, or Puppet (Preferred) Experience with GPU communication and interconnect technologies (e.g. collective communication More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Techfellow Limited
as-code mindset Hands-on experience resolving GPU workload issues across compute clusters and supporting technologies Familiarity with performance tooling and debugging in live production environments Practical experience with CUDA or systems-level programming in C/C++ Experience with config management frameworks like Salt, Ansible, or Puppet (Preferred) Experience with GPU communication and interconnect technologies (e.g. collective communication More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Techfellow Limited
as-code mindset Hands-on experience resolving GPU workload issues across compute clusters and supporting technologies Familiarity with performance tooling and debugging in live production environments Practical experience with CUDA or systems-level programming in C/C++ Experience with config management frameworks like Salt, Ansible, or Puppet (Preferred) Experience with GPU communication and interconnect technologies (e.g. collective communication More ❯
Contribute to hiring additional talent to our rapidly growing team The role will be exposed to a broad tech stack (e.g. ReactJS, Python, REST & GraphQL, OpenCV, PyTorch, GCP, AWS & CUDA, Kubernetes) and the cutting edge of computer vision and deep learning. Qualifications The right candidate will have a proven track record of relevant publications and previous experience managing applied More ❯
Software Engineer with graphics processing experience (GPU, CUDA) is re quired for a long term contract assignment based in Stevenage or Bristol - full time on site. Essential experience: C/C++ and CUDA programming Object-Oriented Programming GPU/CPU optimisation GPU/CPU Memory Management Technical report writing Desirable experience Network Programming Configuration control and model release More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
of model architectures like transformers and CNNs. Hands-on experience with model optimization (i.e. quantization, pruning) and model deployment frameworks such as TensorRT, ONNX Runtime, and OpenVINO. Proficiency with CUDA programming and optimizing code for GPU acceleration. Strong background in MLOps practices, including CI/CD using GitHub Actions and containerization with Docker. Excellent problem-solving skills and the More ❯
level position 5+ years' experience in software development. Good development skills in cloud visualization applications where knowledge is key. Computer Graphics WebGL OpenGL HTML5 MEAN stack. Java/C++ CUDA Augmented/Virtual Reality Game Engines Video streaming a plus. Please, get in touch to discuss and apply for this exciting role More ❯
using Docker and orchestrating them with Kubernetes for scalable model serving. Optimizing the performance of our Ultralytics YOLO11 models for various deployment targets, from high-performance cloud GPUs with CUDA to edge devices using frameworks like TensorRT and OpenVINO. Implementing robust systems for model monitoring and maintenance to track performance and detect data drift. Collaborating closely with our AI … experience with at least one major cloud provider ( GCP , Azure, AWS). Experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible. Familiarity with GPU acceleration using CUDA and model optimization for inference. Knowledge of MLOps tools for experiment tracking, and model serving such as MLflow, Kubeflow, or Weights & Biases. Excellent problem-solving skills and the ability More ❯
using Docker and orchestrating them with Kubernetes for scalable model serving. Optimizing the performance of our Ultralytics YOLO11 models for various deployment targets, from high-performance cloud GPUs with CUDA to edge devices using frameworks like TensorRT and OpenVINO. Implementing robust systems for model monitoring and maintenance to track performance and detect data drift. Collaborating closely with our AI … experience with at least one major cloud provider ( GCP , Azure, AWS). Experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible. Familiarity with GPU acceleration using CUDA and model optimization for inference. Knowledge of MLOps tools for experiment tracking, and model serving such as MLflow, Kubeflow, or Weights & Biases. Excellent problem-solving skills and the ability More ❯
Bristol, Somerset, United Kingdom Hybrid / WFH Options
Certain Advantage
models. The focus of this work is efficiency and run-time improvements of the simulations through the optimisation of GPU and CPU code. Essential experience: : C/C++ and CUDA programming : Object-Oriented Programming : GPU/CPU optimisation : GPU/CPU Memory Management : Technical report writing Desirable experience : Network Programming : Configuration control and model release processes : Continuous Integration and More ❯
Stevenage, Hertfordshire, United Kingdom Hybrid / WFH Options
Certain Advantage
models. The focus of this work is efficiency and run-time improvements of the simulations through the optimisation of GPU and CPU code. Essential experience: : C/C++ and CUDA programming : Object-Oriented Programming : GPU/CPU optimisation : GPU/CPU Memory Management : Technical report writing Desirable experience: : Network Programming : Configuration control and model release processes : Continuous Integration and More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Certain Advantage
models. The focus of this work is efficiency and run-time improvements of the simulations through the optimisation of GPU and CPU code. Essential experience: : C/C++ and CUDA programming : Object-Oriented Programming : GPU/CPU optimisation : GPU/CPU Memory Management : Technical report writing Desirable experience : Network Programming : Configuration control and model release processes : Continuous Integration and More ❯
Stevenage, Hertfordshire, South East, United Kingdom Hybrid / WFH Options
Certain Advantage
models. The focus of this work is efficiency and run-time improvements of the simulations through the optimisation of GPU and CPU code. Essential experience: : C/C++ and CUDA programming : Object-Oriented Programming : GPU/CPU optimisation : GPU/CPU Memory Management : Technical report writing Desirable experience: : Network Programming : Configuration control and model release processes : Continuous Integration and More ❯
and hands-on experience with frameworks like PyTorch, TensorFlow, or JAX Experience building ML models in unique settings (e.g., constrained hardware or novel data) Familiarity with GPU processing (e.g., CUDA) and a range of ML techniques Bonus if you’ve worked in a client-facing or consultancy setting Next Steps This Machine Learning Consultant role is generating a lot More ❯
with the latest developments in model optimization, inference engines, quantization methods, and related technologies. Requirements Proven professional experience optimizing neural network inference workloads. Strong expertise with TensorRT, Triton language, CUDA programming. Experience with neural network quantization techniques. Proficiency in Python and PyTorch. Deep understanding of GPU architectures and performance optimization. Excellent problem-solving skills and ability to analyze performance More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
/CD pipelines using GitHub Actions . Experience with analytics platforms like Google Analytics and business intelligence tools like Tableau or Power BI. Knowledge of GPU-accelerated computing with CUDA is highly desirable. Excellent problem-solving skills and the ability to thrive in a fast-paced, high-intensity startup environment. 🌟 Cultural Fit - Intensity Required Ultralytics is a high-performance More ❯
that bear little resemblance to publicly available substitutes. Utmost integrity, confidentiality, and discretion in both internal and external interactions. What We Value Experience writing and optimizing compute kernels with CUDA or similar languages. History of developing creative approaches to drive high ML accuracy within an alloted computational budget. Competitive Compensation. We provide financial peace of mind with competitive base More ❯
top degree in a STEM subject UK nationality Experience deploying machine learning on hardware, from embedded systems to edge computing, is desirable. Knowledge of GPU programming languages and frameworks (CUDA, ROCm, etc.) is also a plus. Your future colleagues are highly skilled professionals from diverse industry backgrounds, fostering a low-management, team-oriented environment that values individual expertise. Benefits More ❯