16 of 16 vLLM Jobs in the UK excluding London

DevOps Engineer

Hiring Organisation: Aveni
Location: Birmingham, UK
Employment Type: Full-time

scripting skills in Python or Bash Familiar with ML lifecycle tools, model monitoring, and versioning Exposure to tools like KServe, Ray Serve, Triton, or vLLM a big plus Bonus Points: Experience with observability frameworks like Prometheus or OpenTelemetry Knowledge of ML libraries: TensorFlow, PyTorch, HuggingFace Exposure to Azure ...

DevOps Engineer

Hiring Organisation: Aveni
Location: Cheltenham, Gloucestershire, UK
Employment Type: Full-time

DevOps Engineer

Hiring Organisation: Aveni
Location: Oxford, Oxfordshire, UK
Employment Type: Full-time

DevOps Engineer

Hiring Organisation: Aveni
Location: Exeter, Devon, UK
Employment Type: Full-time

DevOps Engineer

Hiring Organisation: Aveni
Location: Wakefield, West Yorkshire, UK
Employment Type: Full-time

DevOps Engineer

Hiring Organisation: Aveni
Location: Hemel Hempstead, Hertfordshire, UK
Employment Type: Full-time

DevOps Engineer

Hiring Organisation: Aveni
Location: Bolton, Greater Manchester, UK
Employment Type: Full-time

Senior PyTorch Engineer

Hiring Organisation: Advanced Micro Devices
Location: East Anglia, United Kingdom
Employment Type: Permanent

test design to ensure high-quality, maintainable software solutions. AI Framework & Deep Learning: Strong understanding of AI frameworks such as PyTorch, Triton and vLLM, with applied knowledge across domains such as Natural Language Processing, Vision, Audio and Recommendation Systems. GPU Computing: Strong experience with GPU Programming models (CUDA, HIP). ...

AI Systems Research Engineer

Hiring Organisation: microTECH Global LTD
Location: Edinburgh, Scotland, United Kingdom

Strong knowledge of distributed systems, operating systems, machine learning systems architecture, Inference serving, and AI Infrastructure. · Hands-on experience with LLM serving frameworks (e.g., vLLM, Ray Serve, TensorRT-LLM, TGI) and distributed KV cache optimization. · Proficiency in C/C++, with additional experience in Python for research prototyping. · Solid grounding ...

esearch (Systems) Engineer

Hiring Organisation: Microtech Global Ltd
Location: Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type: Permanent

field. Strong knowledge of distributed systems, operating systems,machine learning systems architecture, Inference serving, and AI Infrastructure. Hands-on experience withLLM serving frameworks(e.g.,vLLM,Ray Serve,TensorRT-LLM,TGI) anddistributed KV cache optimization. Proficiency inC/C++, with additional experience inPythonfor research prototyping. Solid grounding insystems research methodology,distributed ...

AI Infrastructure Architect

Hiring Organisation: Microtech Global Ltd
Location: Dunfermline, Fife, UK
Employment Type: Full-time

operating systems, and runtime environments; Hands-on experience with Serverless architectures and cloud-native optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling vLLM, SGLang, Ray Serve, etc.); understand common optimization concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation Proficient in using ...

AI Infrastructure Architect

Hiring Organisation: Microtech Global Ltd
Location: Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type: Permanent

AI Infrastructure Architect

Hiring Organisation: Microtech Global Ltd
Location: Broughton, Scottish Borders, UK
Employment Type: Full-time

AI Infrastructure Architect

Hiring Organisation: Microtech Global Ltd
Location: Livingston, West Lothian, UK
Employment Type: Full-time

Infrastructure Architect

Hiring Organisation: microTECH Global LTD
Location: Livingston, West Lothian, UK
Employment Type: Full-time

Infrastructure Architect

Hiring Organisation: microTECH Global LTD
Location: Edinburgh, Scotland, United Kingdom