specific to RAG (Retrieval-Augmented Generation), Graph RAG, Agentic RAG, and multi-agent systems. Vector Databases & Embeddings: Expertise in working with various embedding models and vector databases (e.g., Pinecone, Weaviate, Chroma, FAISS). Advanced AI Concepts: Strong grasp of advanced techniques such as complex task decomposition for agents, reasoning engines, knowledge graphs, autonomous agent design, and evaluation methodologies for complex More ❯
to detail, and a collaborative, growth-focused mindset. Experience working in agile, product-driven engineering teams. Preferred Qualifications: Exposure to Retrieval-Augmented Generation (RAG) pipelines, vector databases (e.g., Pinecone, Weaviate, Milvus), and knowledge bases, with familiarity in integrating them with LLMs. Experience with advanced model monitoring, observability, and governance of LLMs and generative AI systems. Experience with data engineering or More ❯
/services. Strong skills in MLOps: containerisation (Docker, Kubernetes), cloud deployment (AWS, GCP, Azure), and CI/CD pipelines. Experience with prompt engineering, LLM evaluation, and vector databases (Pinecone, Weaviate, FAISS). Excellent communication skills and cross-functional collaboration experience. Benefits: Our team at OneClickComply is central to our 100% customer satisfaction, and we offer some of the best benefits More ❯
building with generative AI applications in production environments. Expertise with microservices architecture and RESTful APIs. Solid understanding of database technologies such as PostgreSQL and vector databases as Elastic, Pinecone, Weaviate, or similar. Familiarity with cloud platforms (AWS, GCP, etc.) and containerized environments (Docker, Kubernetes). You are committed to writing clean, maintainable, and scalable code, following best practices in software More ❯
Cambridge, Cambridgeshire, England, United Kingdom Hybrid / WFH Options
Ascent Sourcing Ltd
experience with memory frameworks like Mem0 or Letta. Understanding and production experience with traditional RAG and agentic RAG architectures. Strong grasp of embedding systems and vector databases (e.g., Pinecone, Weaviate, FAISS). Production experience with LiveKit or similar audio/video platforms. What You’ll Be Doing Building and deploying scalable AI features, agents, and services powered by LLMs. Prototyping More ❯
cycles with weekly releases Prompt Engineering: Able to craft, test, and refine prompts for outputs like chat, image, or video LLM + Vector Tools: Experience with embeddings, Pinecone/Weaviate, and building RAG-style pipelines Why Join Magic Group? No red tape. Just build, launch, and scale A small, senior team with high trust and even higher impact If you More ❯
and maintain AI microservices using Docker, Kubernetes, and FastAPI, ensuring smooth model serving and error handling; Vector Search & Retrieval: Implement retrieval-augmented workflows: ingest documents, index embeddings (Pinecone, FAISS, Weaviate), and build similarity search features. Rapid Prototyping: Create interactive AI demos and proofs-of-concept with Streamlit, Gradio, or Next.js for stakeholder feedback; MLOps & Deployment: Implement CI/CD pipelines … tuning LLMs via OpenAI, HuggingFace or similar APIs; Strong proficiency in Python; Deep expertise in prompt engineering and tooling like LangChain or LlamaIndex; Proficiency with vector databases (Pinecone, FAISS, Weaviate) and document embedding pipelines; Proven rapid-prototyping skills using Streamlit or equivalent frameworks for UI demos. Familiarity with containerization (Docker) and at least one orchestration/deployment platform; Excellent communication More ❯
Our benefits Share Options (EMI) scheme 25 days annual leave, plus flexible bank holidaysand the opportunity to buy additional days Enhanced workplace Pension scheme - opt in salary sacrifice scheme Life Insurance (3x annual salary) Employee Assistance Programme (EAP) and workplace More ❯