101 to 105 of 105 Permanent vLLM Jobs

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Stockport, Greater Manchester, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Newport, Isle of Wight, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Newcastle upon Tyne, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
A1
Location
Stoke-on-Trent, Staffordshire, UK
Employment Type
Full-time
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation
Bjak
Location
München, Bayern, Germany
Employment Type
Permanent
Salary
EUR 80,000 - 110,000 Annual
open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...