101 to 105 of 105 Permanent vLLM Jobs

Founding Machine Learning Engineer

Hiring Organisation: A1
Location: Stockport, Greater Manchester, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation: A1
Location: Newport, Isle of Wight, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation: A1
Location: Newcastle upon Tyne, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation: A1
Location: Stoke-on-Trent, Staffordshire, UK
Employment Type: Full-time

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...

Founding Machine Learning Engineer

Hiring Organisation: Bjak
Location: München, Bayern, Germany
Employment Type: Permanent
Salary: EUR 80,000 - 110,000 Annual

open-source frontier models Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation) Architect scalable inference systems using vLLM/TensorRT-LLM/DeepSpeed Build data systems for high-quality synthetic and real-world training data Develop alignment, safety, and guardrail strategies Design evaluation … memory efficiency, quantization, mixed precision Comfortable owning ambiguous, zero-to-one technical problems end-to-end Nice to Have Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer) Contributions to open-source ML libraries Background in scientific computing, compilers, or GPU kernels Experience with RLHF pipelines (PPO, DPO, ORPO) Experience ...