4 of 4 Permanent vLLM Jobs in the UK excluding London

Senior Software Engineer

Hiring Organisation: Jobleads-UK
Location: Cambridge, England, United Kingdom

related field. Desirable: Exposure to machine learning frameworks such as PyTorch, JAX, Triton, TensorFlow Experience with distributed workload management systems such as Kubernetes, VLLM, Keras or MLOps pipelines Experience working with hardware simulators or emulators (e.g. QEMU). Experience developing for or working with FPGA-based systems. Experience with people ...

Software Inference Deployment Engineer

Hiring Organisation: Jobleads-UK
Location: Oxford, England, United Kingdom

PyTorch in particular) Practical experience with model deployment workflows - loading, format conversion, quantisation, or framework integration Comfortable working with inference serving stacks (for example vLLM, TensorRT‐LLM, or similar) Familiarity with Linux, containerisation (Docker), and cluster environments Comfortable in a customer‐facing role, able to communicate clearly with ...

Lead Site Reliability Engineer - Operations Excellence

Hiring Organisation: Jobleads-UK
Location: Glasgow, Scotland, United Kingdom

reliability, performance, and cost‐efficiency of the LLM inference platform end to end. You will operate large language model serving stacks (such as vLLM and llm‐d) in production at scale, with deep instrumentation and strong operational rigor. You will partner across engineering to deliver secure software, improve stability … infrastructure Build backend services and APIs that enable reliable operation of AI infrastructure in production Operate and scale LLM serving infrastructure (such as vLLM and llm‐d), including model hosting, request routing, continuous batching, and KV‐cache optimization Deploy, host, and lifecycle‐manage open‐source and proprietary LLMs on Amazon ...

Project Technical Lead - AI Systems Simulation

Hiring Organisation: Jobleads-UK
Location: Cambridge, England, United Kingdom

infrastructure, ML systems, or computer architecture. Familiarity with Agile or other modern technical project management frameworks. Knowledge of modern inference‐serving frameworks (e.g., vLLM). Background in statistics, operations research, or large‐scale datacenter infrastructure. Contributions to open‐source AI or systems projects. Benefits High‐impact role in a rapidly ...