and modern web frameworks Deep experience with AI/ML frameworks (PyTorch, TensorFlow, Transformers, LangChain) Mastery of prompt engineering and fine-tuning Large Language Models Proficient in vector databases (Pinecone, Weaviate, Milvus) and embedding technologies Expert in building RAG (Retrieval-Augmented Generation) systems at scale Strong experience with MLOps practices and model deployment pipelines Proficient in cloud AI services (AWS More ❯
and modern web frameworks Deep experience with AI/ML frameworks (PyTorch, TensorFlow, Transformers, LangChain) Mastery of prompt engineering and fine-tuning Large Language Models Proficient in vector databases (Pinecone, Weaviate, Milvus) and embedding technologies Expert in building RAG (Retrieval-Augmented Generation) systems at scale Strong experience with MLOps practices and model deployment pipelines Proficient in cloud AI services (AWS More ❯
Cambridge, Cambridgeshire, England, United Kingdom Hybrid / WFH Options
Ascent Sourcing Ltd
Practical experience with memory frameworks like Mem0 or Letta. Understanding and production experience with traditional RAG and agentic RAG architectures. Strong grasp of embedding systems and vector databases (e.g., Pinecone, Weaviate, FAISS). Production experience with LiveKit or similar audio/video platforms. What You’ll Be Doing Building and deploying scalable AI features, agents, and services powered by LLMs. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions Ltd
and reliability Essential Skills Strong Python skills and experience with Hugging Face Transformers Familiarity with LLM fine-tuning and inference optimisation Experience with vector search and embeddings (e.g. FAISS, Pinecone) Understanding of prompt engineering and few-shot learning Ability to work independently in a hybrid, agile environment Nice to Have Experience with LangChain, LlamaIndex, or similar orchestration tools Exposure to More ❯