up. What You’ll Do Build scalable backend microservices in Python (FastAPI) to support RAG workflows and user queries Develop and optimise vector search pipelines using tools like PGVector, Pinecone, or Weaviate Design embedding orchestration and hybrid retrieval mechanisms Implement evaluation frameworks (BLEU, ROUGE, hallucination checks) to monitor answer quality Deploy production systems on GCP (Cloud Run, Vertex AI, BigQuery More ❯
up. What You’ll Do Build scalable backend microservices in Python (FastAPI) to support RAG workflows and user queries Develop and optimise vector search pipelines using tools like PGVector, Pinecone, or Weaviate Design embedding orchestration and hybrid retrieval mechanisms Implement evaluation frameworks (BLEU, ROUGE, hallucination checks) to monitor answer quality Deploy production systems on GCP (Cloud Run, Vertex AI, BigQuery More ❯
up. What You’ll Do Build scalable backend microservices in Python (FastAPI) to support RAG workflows and user queries Develop and optimise vector search pipelines using tools like PGVector, Pinecone, or Weaviate Design embedding orchestration and hybrid retrieval mechanisms Implement evaluation frameworks (BLEU, ROUGE, hallucination checks) to monitor answer quality Deploy production systems on GCP (Cloud Run, Vertex AI, BigQuery More ❯
varied use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
varied use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate More ❯
South East London, England, United Kingdom Hybrid / WFH Options
Anson McCade
varied use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate More ❯
developer tools, open-source culture, and improving developer workflows. Excellent communication and collaboration skills in a remote-first environment. Experience contributing to open-source AI projects. Experience with LangChain, Pinecone, or similar AI frameworks/infrastructure. Past experience building AI features into developer platforms or tools. Benefits Our entire company is distributed, so we take remote work seriously. If you More ❯
building complex architectures from MVP to production Solid hands-on experience with AI/LLM applications and model deployment Comfortable across front-end (HTML, CSS, Tailwind) and back-end (Pinecone, microservices, serverless) Brownie points: SEO know-how Deeper AI/ML chops (Colab, Streamlit, FastAPI, PyTorch, etc.) Entrepreneurial streak and previous startup exposure Ability to pivot quickly and learn on More ❯
ready to justify your pick. • Experience extending an *AWS stack (Terraform, ECS Fargate, ALB, Secrets Manager, KMS)*. • Hands-on with *LLM APIs* and at least one *vector database* (Pinecone, Weaviate, OpenSearch, etc.). • Multi-tenant data design with GDPR awareness. • CI/CD and automated-testing mindset. Nice-to-Haves • *Solution-architect background*—ability to map future services and More ❯
ready to justify your pick. • Experience extending an *AWS stack (Terraform, ECS Fargate, ALB, Secrets Manager, KMS)*. • Hands-on with *LLM APIs* and at least one *vector database* (Pinecone, Weaviate, OpenSearch, etc.). • Multi-tenant data design with GDPR awareness. • CI/CD and automated-testing mindset. Nice-to-Haves • *Solution-architect background*—ability to map future services and More ❯
Liverpool, Lancashire, United Kingdom Hybrid / WFH Options
TEKsystems, Inc
management using frameworks such as LangChain, CrewAI, and Autogen. Engineer and tune prompts to enhance the performance and reliability of generative tasks. Design RAG systems using vector databases like Pinecone, Chroma, and PosgreSQL for contextual retrieval. Incorporate semantic search and embedding strategies for more relevant and grounded LLM responses. Utilize Guardrails to implement applications that adhere to responsible AI guidelines. More ❯
knowledge platform. What You’ll Do: Build and scale backend services (Python/FastAPI) for ingesting, indexing, and serving medical content. Develop retrieval infrastructure using vector databases (e.g. PGVector, Pinecone). Deploy on GCP (Cloud Run, Vertex AI) with Terraform, CI/CD, and observability tools. Collaborate across product, mobile, and clinical teams to ship weekly. Ensure secure, compliant data More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
knowledge platform. What You’ll Do: Build and scale backend services (Python/FastAPI) for ingesting, indexing, and serving medical content. Develop retrieval infrastructure using vector databases (e.g. PGVector, Pinecone). Deploy on GCP (Cloud Run, Vertex AI) with Terraform, CI/CD, and observability tools. Collaborate across product, mobile, and clinical teams to ship weekly. Ensure secure, compliant data More ❯
workflows. Strong experience in prompt engineering and chaining logic for AI-based tasks. Background in backend engineering (Python, NodeJS, or similar). Familiarity with tools like LangChain, OpenAI API, Pinecone, or similar . A problem-solver mindset—focused on delivering tangible outcomes using AI. Comfortable working in a fast-paced, agile environment. Excellent communication and collaboration skills. Nice to Have More ❯
LLM/agent-based prototypes (e.g., copilots, chatbots, A2A agents). Implement multi-step reasoning, memory modules, and RAG pipelines. Use frameworks like LangChain, LangGraph, CrewAI, and tools like Pinecone, FAISS. Optimize performance and ensure responsible AI practices. Deploy via cloud platforms (AWS Bedrock, Azure AI, Google Vertex). Build UIs (Streamlit, Gradio, React) and integrate APIs and databases. Preferred More ❯
LLM/agent-based prototypes (e.g., copilots, chatbots, A2A agents). Implement multi-step reasoning, memory modules, and RAG pipelines. Use frameworks like LangChain, LangGraph, CrewAI, and tools like Pinecone, FAISS. Optimize performance and ensure responsible AI practices. Deploy via cloud platforms (AWS Bedrock, Azure AI, Google Vertex). Build UIs (Streamlit, Gradio, React) and integrate APIs and databases. Preferred More ❯
Cheltenham, England, United Kingdom Hybrid / WFH Options
Eutopia Solutions
Prompt Engineer (AI & Data) – Could suit a Graduate/Junior or Experienced professional Location: Hybrid (within reach of the M4/M5/M50 ideally - my client has offices in West London & the Wye Valley on the Wales/England More ❯
Gloucestershire, England, United Kingdom Hybrid / WFH Options
Eutopia Solutions
Prompt Engineer (AI & Data) – Could suit a Graduate/Junior or Experienced professional 📍 Location: Hybrid (within reach of the M4/M5/M50 ideally - my client has offices in West London & the Wye Valley on the Wales/England More ❯