RAG (Retrieval-Augmented Generation), Graph RAG, Agentic RAG, and multi-agent systems. Vector Databases & Embeddings: Expertise in working with various embedding models and vector databases (e.g., Pinecone, Weaviate, Chroma, FAISS). Advanced AI Concepts: Strong grasp of advanced techniques such as complex task decomposition for agents, reasoning engines, knowledge graphs, autonomous agent design, and evaluation methodologies for complex AI systems. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate AIMore ❯
use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate AIMore ❯
Cambridge, Cambridgeshire, England, United Kingdom Hybrid / WFH Options
Ascent Sourcing Ltd
with memory frameworks like Mem0 or Letta. Understanding and production experience with traditional RAG and agentic RAG architectures. Strong grasp of embedding systems and vector databases (e.g., Pinecone, Weaviate, FAISS). Production experience with LiveKit or similar audio/video platforms. What You’ll Be Doing Building and deploying scalable AI features, agents, and services powered by LLMs. Prototyping and More ❯
finetuning largescale transformer models (BERT, GPT) and promptengineering for sentiment tasks Background building activelearning and annotation pipelines to bootstrap training data Familiarity with semantic search or vector databases (Elasticsearch, FAISS, Pinecone) for topic modeling and similarity queries Familiarity with crypto markets, order books, and risk-management frameworks Familiarity with anomalydetection methods for streaming text and timeseries data Experience developing EVM More ❯
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
East London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
Ashton-Under-Lyne, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
and scaling LLMs for real-world applications. Key Skills: Strong Python engineering background Experience with LLMs (e.g. Hugging Face, OpenAI, LangChain) Model fine-tuning, RAG pipelines, vector databases (e.g. FAISS, Pinecone) Cloud (AWS/GCP), CI/CD, Docker Bonus: Knowledge of model optimization, quantization, or open-source contributions. 📩 If interested send your CV to adeeb.rahman@opusrs.com More ❯
large language models and enthusiasm for solving real-world product challenges Clear communication and ability to thrive in a distributed team Nice to Have: Experience with vector databases (e.g., FAISS, Weaviate, pgvector) Knowledge of survey data structures or market research workflows Familiarity with statistics (e.g., weighting, significance testing) Experience with Docker, Hugging Face Transformers, or cloud-based deployment Awareness of More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions Ltd
quality and reliability Essential Skills Strong Python skills and experience with Hugging Face Transformers Familiarity with LLM fine-tuning and inference optimisation Experience with vector search and embeddings (e.g. FAISS, Pinecone) Understanding of prompt engineering and few-shot learning Ability to work independently in a hybrid, agile environment Nice to Have Experience with LangChain, LlamaIndex, or similar orchestration tools Exposure More ❯
Develop and maintain AI microservices using Docker, Kubernetes, and FastAPI, ensuring smooth model serving and error handling; Vector Search & Retrieval: Implement retrieval-augmented workflows: ingest documents, index embeddings (Pinecone, FAISS, Weaviate), and build similaritysearch features. Rapid Prototyping: Create interactive AI demos and proofs-of-concept with Streamlit, Gradio, or Next.js for stakeholder feedback; MLOps & Deployment: Implement CI/CD … fine-tuning LLMs via OpenAI, HuggingFace or similar APIs; Strong proficiency in Python; Deep expertise in prompt engineering and tooling like LangChain or LlamaIndex; Proficiency with vector databases (Pinecone, FAISS, Weaviate) and document embedding pipelines; Proven rapid-prototyping skills using Streamlit or equivalent frameworks for UI demos. Familiarity with containerization (Docker) and at least one orchestration/deployment platform; Excellent More ❯