1 of 1 vLLM Jobs in London

Machine Learning Engineer

Hiring Organisation
Valent
Location
Greater London, England, United Kingdom
techniques (e.g., PEFT, LoRA, QLoRA) on open-source models (e.g., Llama, Mistral), alignment-tuning (e.g. DPO, ORPO). APIs: OpenAI, Anthropic, Gemini, etc. Inference: vLLM, llama.cpp, SGLang, etc. MLOps & Deployment Containerization: Docker Orchestration: Kubernetes (K8s) (for scalable inference) Cloud Platform: AWS, GCP, or Azure (experience with S3, EC2/ ...