12 of 12 vLLM Jobs in the North of England

Founding AI/ML Research Engineer

Hiring Organisation: A1
Location: Chester, Cheshire, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Lead Machine Learning Engineer

Hiring Organisation: A1
Location: Sheffield, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Lead Machine Learning Engineer

Hiring Organisation: A1
Location: Liverpool, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Lead Machine Learning Engineer

Hiring Organisation: A1
Location: Leeds, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation: A1
Location: Doncaster, South Yorkshire, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation: A1
Location: Wakefield, West Yorkshire, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation: A1
Location: York, North Yorkshire, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation: A1
Location: Hull, East Yorkshire, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation: A1
Location: Newcastle upon Tyne, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Lead Machine Learning Engineer

Hiring Organisation: A1
Location: Newcastle upon Tyne, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Lead Machine Learning Engineer

Hiring Organisation: A1
Location: Hull, East Yorkshire, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Lead Machine Learning Engineer

Hiring Organisation: A1
Location: York, North Yorkshire, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...