12 of 12 vLLM Jobs in the UK

AI Engineer

Hiring Organisation
Noxus
Location
City of London, London, United Kingdom
deep learning. Experience with evaluation frameworks and observability tools. Contributions to open-source AI projects or published research. Familiarity with MLOps, model serving (vLLM, TGI), and infrastructure for AI at scale. Background in fine-tuning, RLHF, or working with open-weight models. ...

Senior PyTorch Engineer

Hiring Organisation
Advanced Micro Devices
Location
East Anglia, United Kingdom
Employment Type
Permanent
test design to ensure high-quality, maintainable software solutions. AI Framework & Deep Learning: Strong understanding of AI frameworks such as PyTorch, Triton and vLLM, with applied knowledge across domains such as Natural Language Processing, Vision, Audio and Recommendation Systems. GPU Computing: Strong experience with GPU Programming models (CUDA, HIP). ...

Lead Python Engineer

Hiring Organisation
Harrington Starr
Location
London Area, United Kingdom
exposure Proven experience leading projects or mentoring engineers Strong academic background (Computer Science or related discipline) Nice to have: Production model deployment (HuggingFace/vLLM/Ollama) Model fine-tuning experience Open-source contributions Why Join Competitive salary + equity Direct influence on product and engineering roadmap Work on applied ...

AI/ML Tech Lead

Hiring Organisation
DGC Search
Location
England, United Kingdom
content moderation architecture and NSFW classifiers Strong proficiency in Python and PyTorch Comfortable thriving in fast-paced, remote environments 🛠️ Tech Stack Python • PyTorch • vLLM • HuggingFace • SFT • RLHF • DPO • Generative AI • Content Moderation • Inference Optimization 💰 What's On Offer Salary: £120,000 - £160,000 Location: Fully Remote (must be based ...

AI Systems Research Engineer

Hiring Organisation
microTECH Global LTD
Location
Edinburgh, Scotland, United Kingdom
Strong knowledge of distributed systems, operating systems, machine learning systems architecture, Inference serving, and AI Infrastructure. · Hands-on experience with LLM serving frameworks (e.g., vLLM, Ray Serve, TensorRT-LLM, TGI) and distributed KV cache optimization. · Proficiency in C/C++, with additional experience in Python for research prototyping. · Solid grounding ...

AI Infrastructure Architect

Hiring Organisation
Microtech Global Ltd
Location
Dunfermline, Fife, UK
Employment Type
Full-time
operating systems, and runtime environments; Hands-on experience with Serverless architectures and cloud-native optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling vLLM, SGLang, Ray Serve, etc.); understand common optimization concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation Proficient in using ...

AI Infrastructure Architect

Hiring Organisation
Microtech Global Ltd
Location
Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type
Permanent
operating systems, and runtime environments; Hands-on experience with Serverless architectures and cloud-native optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling vLLM, SGLang, Ray Serve, etc.); understand common optimization concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation Proficient in using ...

AI Infrastructure Architect

Hiring Organisation
Microtech Global Ltd
Location
Broughton, Scottish Borders, UK
Employment Type
Full-time
operating systems, and runtime environments; Hands-on experience with Serverless architectures and cloud-native optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling vLLM, SGLang, Ray Serve, etc.); understand common optimization concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation Proficient in using ...

AI Infrastructure Architect

Hiring Organisation
Microtech Global Ltd
Location
Livingston, West Lothian, UK
Employment Type
Full-time
operating systems, and runtime environments; Hands-on experience with Serverless architectures and cloud-native optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling vLLM, SGLang, Ray Serve, etc.); understand common optimization concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation Proficient in using ...

Infrastructure Architect

Hiring Organisation
microTECH Global LTD
Location
Edinburgh, Scotland, United Kingdom
operating systems, and runtime environments; Hands-on experience with Serverless architectures and cloud-native optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling vLLM, SGLang, Ray Serve, etc.); understand common optimization concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation Proficient in using ...

esearch (Systems) Engineer

Hiring Organisation
Microtech Global Ltd
Location
Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type
Permanent
field. Strong knowledge of distributed systems, operating systems,machine learning systems architecture, Inference serving, and AI Infrastructure. Hands-on experience withLLM serving frameworks(e.g.,vLLM,Ray Serve,TensorRT-LLM,TGI) anddistributed KV cache optimization. Proficiency inC/C++, with additional experience inPythonfor research prototyping. Solid grounding insystems research methodology,distributed ...

DevSecOps Engineer

Hiring Organisation
Formula Recruitment
Location
England, United Kingdom
shared AI platform running multiple LLM-based projects Owning and maintaining Linux-based infrastructure and containerised services Operating and troubleshooting Dockerised services (PostgreSQL, Qdrant, vLLM, LLDAP) Managing NGINX reverse proxies to securely expose internal tools and front ends Supporting and evolving GitLab CI/CD pipelines Deploying and managing infrastructure … using Terraform Supporting open-source LLM deployments (Hugging Face models, vLLM, LangChain) Required skills: Proven experience in DevOps/Platform/Infrastructure Engineering Strong Linux experience (VMs, shell scripting) Hands-on Docker experience in production environments CI/CD experience, ideally GitLab Experience with NGINX or similar reverse proxies Infrastructure ...