1 of 1 vLLM Jobs in Livingston

AI Infrastructure Architect

Hiring Organisation
Microtech Global Ltd
Location
Livingston, West Lothian, UK
Employment Type
Full-time
operating systems, and runtime environments; Hands-on experience with Serverless architectures and cloud-native optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling vLLM, SGLang, Ray Serve, etc.); understand common optimization concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation Proficient in using ...