slough, south east england, united kingdom Hybrid / WFH Options
microTECH Global LTD
to deploy GPU and ML workloads at scale. Provision and optimise GPU cloud infrastructure (AWS, GCP, Azure) using Terraform/Ansible. Collaborate with GPU engineers and researchers to integrate CUDA, SYCL, Vulkan, and ML kernels into production workflows. Support secure packaging, deployment, and distribution of GPU-accelerated software to partners and clients. Evolve infrastructure to support hybrid AI/… GitLab CI, etc.). Proficiency in containerisation and orchestration (Docker, Kubernetes). Experience with cloud GPU infrastructure (AWS, Azure, GCP) and IaC (Terraform, Ansible). Familiarity with GPU workflows (CUDA, SYCL, Vulkan, OpenCL) or HPC performance optimisation. Strong scripting and programming skills (Python, Bash, C/C++ exposure a plus). Knowledge of monitoring, logging, and performance testing for More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Certain Advantage
World Class Defence Organisation based in Bristol is currently looking to recruit an Embedded C++ Software Engineer subcontractor on an initial 6 month contract. The role can be worked on a 4 day week basis (Monday to Thursday) but due More ❯
architectures Develop low-latency inference systems providing real-time, high-accuracy predictions in production Optimise and extend machine learning frameworks to improve training and inference performance Leverage GPU programming (CUDA, cuDNN, TensorRT) to maximise efficiency Automate model experimentation, tuning and retraining in partnership with research teams Work with infrastructure specialists to optimise workflows and reduce compute costs Assess and … ML development and deployment Skills and Experience 5+ years’ experience in machine learning with a focus on training and inference systems Strong programming expertise in Python and C++ or CUDA Proficiency with PyTorch, TensorFlow or JAX Hands-on experience with GPU acceleration and distributed training (Horovod, NCCL or similar) Background in real-time, low-latency ML pipelines Familiarity with More ❯
architectures Develop low-latency inference systems providing real-time, high-accuracy predictions in production Optimise and extend machine learning frameworks to improve training and inference performance Leverage GPU programming (CUDA, cuDNN, TensorRT) to maximise efficiency Automate model experimentation, tuning and retraining in partnership with research teams Work with infrastructure specialists to optimise workflows and reduce compute costs Assess and … ML development and deployment Skills and Experience 5+ years’ experience in machine learning with a focus on training and inference systems Strong programming expertise in Python and C++ or CUDA Proficiency with PyTorch, TensorFlow or JAX Hands-on experience with GPU acceleration and distributed training (Horovod, NCCL or similar) Background in real-time, low-latency ML pipelines Familiarity with More ❯