Chroma, FAISS, Milvus) with sensitive document ingestion pipelines - Host and optimise LLMs using vLLM/TGI on local GPU clusters without internet connectivity - Implement agent orchestration using LangChain/LangGraph in completely offline environments - Design secure document processing for classified materials with appropriate data sanitisation - Build monitoring and evaluation systems that operate within air-gapped infrastructure Essential Requirements - Active SC … Python expertise with offline dependency management and local package mirrors Technical Stack (All On-Premises) Models: Llama 3, Mistral, Qwen (locally hosted) Vector Stores: Chroma, FAISS, Milvus Orchestration: LangChain, LangGraph for agents Hosting: vLLM, TGI, Ollama on bare metal/private cloud Infrastructure: Air-gapped Kubernetes, local container registries Desirable Skills - Experience with defence/government IT security protocols - Knowledge More ❯