51 to 75 of 105 vLLM Jobs

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Slough, Berkshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Northampton, Northamptonshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Norwich, Norfolk, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Chesterfield, Derbyshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
High Wycombe, Buckinghamshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Woking, Surrey, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Dartford, Kent, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Plymouth, Devon, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Bath, Somerset, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Lincoln, Lincolnshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Bedford, Bedfordshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Peterborough, Cambridgeshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding AI/ML Research Engineer

Hiring Organisation
A1
Location
Bournemouth, Dorset, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
United Kingdom
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Luton, Bedfordshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Bournemouth, Dorset, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Basildon, Essex, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Cheltenham, Gloucestershire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Watford, Hertfordshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Ipswich, Suffolk, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Chester, Cheshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Slough, Berkshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Plymouth, Devon, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...