using frameworks like Hugging Face Transformers, LangChain, OpenAI APIs, or other LLM orchestration tools. A solid understanding of tokenization, embedding models, vector databases (e.g., Pinecone, Weaviate, FAISS), and retrieval-augmented generation (RAG) pipelines. Experience designing and evaluating LLM-powered systems such as chatbots, summarization tools, content generation workflows, or intelligent More ❯
london (city of london), south east england, united kingdom
Liberty Towers
using frameworks like Hugging Face Transformers, LangChain, OpenAI APIs, or other LLM orchestration tools. A solid understanding of tokenization, embedding models, vector databases (e.g., Pinecone, Weaviate, FAISS), and retrieval-augmented generation (RAG) pipelines. Experience designing and evaluating LLM-powered systems such as chatbots, summarization tools, content generation workflows, or intelligent More ❯
SageMaker, Bedrock, Lambda, S3, ECS, EKS Full-Stack Integration: Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js Vector Search: Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch Required skills & experience: 3–5+ years of experience in ML engineering and software development Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face More ❯
SageMaker, Bedrock, Lambda, S3, ECS, EKS Full-Stack Integration: Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js Vector Search: Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch Required skills & experience: 3–5+ years of experience in ML engineering and software development Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face More ❯
london (city of london), south east england, united kingdom
The Portfolio Group
SageMaker, Bedrock, Lambda, S3, ECS, EKS Full-Stack Integration: Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js Vector Search: Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch Required skills & experience: 3–5+ years of experience in ML engineering and software development Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face More ❯
use cases and ML, AI opportunities. Experience with containerisation technologies ( Docker, Kubernetes ) for scalable data solutions. Experience with vector databases and graph databases (e.g., Pinecone, Neo4j, AWS Neptune ). Understanding of data mesh-fabric approaches and modern data architecture patterns . Familiarity with AI/ML workflows and their data More ❯
use cases and ML, AI opportunities. Experience with containerisation technologies ( Docker, Kubernetes ) for scalable data solutions. Experience with vector databases and graph databases (e.g., Pinecone, Neo4j, AWS Neptune ). Understanding of data mesh-fabric approaches and modern data architecture patterns . Familiarity with AI/ML workflows and their data More ❯
london (city of london), south east england, united kingdom
WeBuild-AI
use cases and ML, AI opportunities. Experience with containerisation technologies ( Docker, Kubernetes ) for scalable data solutions. Experience with vector databases and graph databases (e.g., Pinecone, Neo4j, AWS Neptune ). Understanding of data mesh-fabric approaches and modern data architecture patterns . Familiarity with AI/ML workflows and their data More ❯
City of London, Greater London, UK Hybrid / WFH Options
twentyAI
Develop and optimise RAG pipelines using LangChain, LlamaIndex, or Haystack. · Build ingestion workflows (OCR, chunking, embedding, semantic search) and integrate with vector databases (FAISS, Pinecone, Qdrant). · Ensure seamless integration of GenAI services into business workflows, prioritising security, scalability, and compliance. · Collaborate with cross-functional teams (data scientists, architects, engineers More ❯
City of London, London, United Kingdom Hybrid / WFH Options
twentyAI
Develop and optimise RAG pipelines using LangChain, LlamaIndex, or Haystack. · Build ingestion workflows (OCR, chunking, embedding, semantic search) and integrate with vector databases (FAISS, Pinecone, Qdrant). · Ensure seamless integration of GenAI services into business workflows, prioritising security, scalability, and compliance. · Collaborate with cross-functional teams (data scientists, architects, engineers More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
twentyAI
Develop and optimise RAG pipelines using LangChain, LlamaIndex, or Haystack. · Build ingestion workflows (OCR, chunking, embedding, semantic search) and integrate with vector databases (FAISS, Pinecone, Qdrant). · Ensure seamless integration of GenAI services into business workflows, prioritising security, scalability, and compliance. · Collaborate with cross-functional teams (data scientists, architects, engineers More ❯
LLM orchestration and prompt engineering frameworks such as LangChain or LangGraph, plus designing retrieval-augmented generation (RAG) pipelines. Familiarity with vector databases like Qdrant, Pinecone, or Redis for low-latency AI retrieval. Experience deploying, monitoring, and scaling AI workloads on cloud platforms such as AWS, GCP, or BigQuery. Bonus points More ❯
LLM orchestration and prompt engineering frameworks such as LangChain or LangGraph, plus designing retrieval-augmented generation (RAG) pipelines. Familiarity with vector databases like Qdrant, Pinecone, or Redis for low-latency AI retrieval. Experience deploying, monitoring, and scaling AI workloads on cloud platforms such as AWS, GCP, or BigQuery. Bonus points More ❯
london (city of london), south east england, united kingdom
Techmunity
LLM orchestration and prompt engineering frameworks such as LangChain or LangGraph, plus designing retrieval-augmented generation (RAG) pipelines. Familiarity with vector databases like Qdrant, Pinecone, or Redis for low-latency AI retrieval. Experience deploying, monitoring, and scaling AI workloads on cloud platforms such as AWS, GCP, or BigQuery. Bonus points More ❯
driven products Comfortable working in an agile, fast-paced startup environment DESIRABLE SKILLS Experience with media processing tools (FFmpeg, WebRTC) Familiarity with vector databases (Pinecone, Weaviate) and embedding workflows Background in generative audio/video models or multimodal AI systems HOW TO APPLY Please register your interest by sending your More ❯
london (city of london), south east england, united kingdom
Harnham
driven products Comfortable working in an agile, fast-paced startup environment DESIRABLE SKILLS Experience with media processing tools (FFmpeg, WebRTC) Familiarity with vector databases (Pinecone, Weaviate) and embedding workflows Background in generative audio/video models or multimodal AI systems HOW TO APPLY Please register your interest by sending your More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
Do: Build and scale backend services (Python/FastAPI) for ingesting, indexing, and serving medical content. Develop retrieval infrastructure using vector databases (e.g. PGVector, Pinecone). Deploy on GCP (Cloud Run, Vertex AI) with Terraform, CI/CD, and observability tools. Collaborate across product, mobile, and clinical teams to ship More ❯
for future modular agent deployment Stack AI : Claude API (Anthropic), Exa (search/enrichment) Workflow Orchestration : n8n (webhooks, branching, logic) Memory & DB : PostgreSQL, Redis, Pinecone Front-End : React, Tailwind Infra : Replit, Make.com (light no-code layer), AWS/GCP optional Requirements: Required 7+ years in full-stack development (strong back More ❯
for future modular agent deployment Stack AI : Claude API (Anthropic), Exa (search/enrichment) Workflow Orchestration : n8n (webhooks, branching, logic) Memory & DB : PostgreSQL, Redis, Pinecone Front-End : React, Tailwind Infra : Replit, Make.com (light no-code layer), AWS/GCP optional Requirements: Required 7+ years in full-stack development (strong back More ❯
london (city of london), south east england, united kingdom
Morpheus Talent Solutions
for future modular agent deployment Stack AI : Claude API (Anthropic), Exa (search/enrichment) Workflow Orchestration : n8n (webhooks, branching, logic) Memory & DB : PostgreSQL, Redis, Pinecone Front-End : React, Tailwind Infra : Replit, Make.com (light no-code layer), AWS/GCP optional Requirements: Required 7+ years in full-stack development (strong back More ❯