AI Engineer
London, United Kingdom
83zero Ltd
Inference experience for high-throughput model serving - Proven ability to work on air-gapped systems with no external package repositories - Experience with GPU orchestration (NVIDIA A100/H100) and CUDA optimisation - Python expertise with offline dependency management and local package mirrors Technical Stack (All On-Premises) Models: Llama 3, Mistral, Qwen (locally hosted) Vector Stores: Chroma, FAISS, Milvus Orchestration More ❯
Employment Type: Contract
Rate: £625 - £650/day
Posted: