Retrieval-Augmented Generation Jobs in East London

17 of 17 Retrieval-Augmented Generation Jobs in East London

Generative AI Engineer

South East London, England, United Kingdom
Bayezian
architecture of LLMs. Foundational knowledge of diffusion models for image generation. Can display and present completed project/s using LLMs with a focus on any of the following: RAG, Agentic-RAG, fine-tuning Some experience or familiarity with deploying applications in the Cloud using services such as AWS or Azure. Proven track record in securing web/API applications. More ❯
Posted:

Senior Software Engineer

South East London, England, United Kingdom
Stealth AI Startup
and US investors . Our founders have delivered cutting-edge AI at world-class research labs and high-growth technology companies. Now, operating in stealth, we apply next-generation agentic AI to overhaul mission-critical enterprise workflows that still depend on error-prone, manual processes. Our vision is to bring these high-value operations into the modern era … event buses (Kafka, Pulsar). Wrangle large, heterogeneous data sets —model, transform, and index multi-modal, multi-terabyte enterprise datasets for advanced AI workloads Develop enterprise-level next generation AI systems with the support of our AI specialists Ship complete customer features - from architecture and code to CI/CD, infra-as-code (Terraform), rollout, and user training. … contract. Thrive in an early-stage, high-ownership environment—prototype today, deploy tomorrow, iterate next week. Bonus Points Experience deploying or consuming LLM-powered services (OpenAI, open-source models, RAG, vector stores) can be a bonus. However, we consider many great candidates without previous AI experience. What we're offering: Base salary from £115,000 - £135,000. .. plus meaningful More ❯
Posted:

GCP AI Engineer

South East London, England, United Kingdom
Hybrid / WFH Options
Anson McCade
varied use cases. Build agentic workflows and reasoning pipelines using frameworks such as LangChain, LangGraph, CrewAI, Autogen, and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term … cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will directly shape how next-generation systems interact, reason, and assist. More ❯
Posted:

Founding Backend Engineer – AI/ML

South East London, England, United Kingdom
Harper Russo
healthcare and cutting-edge LLM technology, shipping fast and solving meaningful problems every day. What You’ll Own Architect and develop backend microservices (Python/FastAPI) that power our RAG pipelines and analytics Build scalable infrastructure for retrieval and vector search (PGVector, Pinecone, Weaviate) Design evaluation frameworks to improve search accuracy and reduce hallucinations Deploy and manage services … LlamaIndex What We’re Looking For 5+ years building production-grade backend systems (preferably in Python) Strong background in search, recommender systems, or ML infrastructure at scale Experience with RAG architectures, embeddings, and vector search Confidence working across GCP (or AWS/Azure) and infrastructure-as-code Familiarity with observability, performance tuning, and secure data practices A growth mindset, startup More ❯
Posted:

Artificial Intelligence Engineer

South East London, England, United Kingdom
Explore Group
clinicians to make faster, evidence-based decisions at the bedside. We’re developing a cutting-edge platform that combines LLMs , retrieval-augmented generation (RAG) , and vector search infrastructure to deliver real-time clinical insights. As the Founding Backend Engineer , you’ll play a critical role in shaping our backend systems, architecture, and engineering culture … from the ground up. What You’ll Do Build scalable backend microservices in Python (FastAPI) to support RAG workflows and user queries Develop and optimise vector search pipelines using tools like PGVector, Pinecone, or Weaviate Design embedding orchestration and hybrid retrieval mechanisms Implement evaluation frameworks (BLEU, ROUGE, hallucination checks) to monitor answer quality Deploy production systems on GCP More ❯
Posted:

Full Stack Engineer

South East London, England, United Kingdom
Hybrid / WFH Options
Futuria
infrastructure Working knowledge of Kubernetes, security best practices, and cloud platforms (AWS, GCP, or Azure) Desirable: Experience with prompt engineering, Retrieval-Augmented Generation (RAG), and graph databases Familiarity with multi-agent LLM systems and agentic platforms (e.g., AutoGen, CrewAI), and experience deploying LLM-based applications Experience with tools such as LangChain, LangSmith, or Chainlit More ❯
Posted:

Software Architect

South East London, England, United Kingdom
Zensar Technologies
integrated into enterprise applications to enhance user experience, decision-making, and automation. Exposure to modern AI application patterns such as: Retrieval-Augmented Generation (RAG) for augmenting LLMs with domain-specific knowledge. Prompt engineering and fine-tuning for tailoring model behavior to business-specific contexts. Use of embedding stores and vector databases (e.g., Pinecone, Redis More ❯
Posted:

Data Scientist

South East London, England, United Kingdom
Hybrid / WFH Options
Albert Bow
and maintaining long-lived systems. Bonus Points for: Familiarity with financial services or private equity environments. Experience with Azure, Postgres, Streamlit, or similar tools. Practical experience with document-based RAG, Q&A, or fact extraction tasks. Thriving in small teams with fast feedback and iteration loops. Please, apply to learn more. More ❯
Posted:

Machine Learning Engineer | £50k–£70k + Equity | Remote (UK)

East London, London, United Kingdom
Hybrid / WFH Options
Tellme
object detection (e.g. MobileNet, YOLO). Either way, we’re looking for someone who can help our app understand what the visitor is looking at – reliably and at scale. RAG Systems, Data Pipelines & Internal Agents: You'll design the data pipelines that power our AI features, including retrieval-augmented generation (RAG), internal LLM-based More ❯
Posted:

Machine Learning Engineer – Founding Team (Computer Vision / GenAI)

South East London, England, United Kingdom
Hybrid / WFH Options
Brio Digital
time using image embeddings, similarity search (e.g. CLIP, vector search), and traditional CV approaches (e.g. YOLO, MobileNet). LLM & RAG Systems: Design and implement pipelines that support retrieval-augmented generation, internal AI tools, and scalable content delivery. Experience with vector databases, agent frameworks, or data workflows is highly relevant. Deployment & MLOps: Own model deployment More ❯
Posted:

Machine Learning Engineer | £50k–£70k + Equity | Remote (UK)

South East London, England, United Kingdom
Hybrid / WFH Options
Tellme
object detection (e.g. MobileNet, YOLO). Either way, we’re looking for someone who can help our app understand what the visitor is looking at – reliably and at scale. RAG Systems, Data Pipelines & Internal Agents: You'll design the data pipelines that power our AI features, including retrieval-augmented generation (RAG), internal LLM-based More ❯
Posted:

Software Engineer

South East London, England, United Kingdom
Hybrid / WFH Options
Uniting Cloud
Engineer The role is building AI based automation into back-office administration tasks. You'll be working with GenAI and Agentic AI, lots of AWS, NodeJS, Python, HuggingFace, LangChain, RAG techniques, interfacing with diverse data sets. The opportunity Work at the forefront of the industry. It's exciting, competitive, fast paced and challenging of course! You'll have the support More ❯
Posted:

Senior Artificial Intelligence Engineer

South East London, England, United Kingdom
Hybrid / WFH Options
MBN Solutions
Vision/NLP Strong Software Engineering skills (3 years+) Developed LLM architecture and deployed LLM applications Uptodate with current trends in AI Some experience with applying latest techniques like RAG architecture, GenAI, Parallel training etc The role is hybrid, with adhoc requirements to be on client premises (London) this could be between 1-5 days a week, so we would More ❯
Posted:

Software Engineer

South East London, England, United Kingdom
Seer
AI Research Engineer to help pioneer next-generation language model systems at the frontier of applied AI. In this role, you’ll help build foundational agent and RAG infrastructure, shape internal research initiatives, and accelerate the delivery of LLM-powered features to end users. You’ll collaborate cross-functionally with engineering and product teams to experiment, evaluate, and … System Development : Research, prototype, and build systems powered by large language models, focusing on reliability, efficiency, and relevance. Agent & RAG Architectures : Design and refine agentic workflows and retrieval-augmented generation pipelines that improve performance, accuracy, and cost-efficiency. Evaluation & Alignment : Develop metrics and tools to measure model performance, groundedness, and behaviour; explore fine-tuning More ❯
Posted:

AI Fullstack Product Engineers - Userled - 80-100K + Bens + Equity - London Hybrid (3 days in the of

South East London, England, United Kingdom
Hybrid / WFH Options
Userled
teams reach high-intent buyers through hyper-personalised, multi-channel campaigns. We’re building an AI-first platform from the ground up, with LLMs powering everything from content generation to strategic recommendations. We’re looking for hands-on AI Fullstack Product Engineers who’s excited to build, ship, and fully own production-grade AI features. You’ll have … ll be responsible for architecting and shipping AI-powered features using tools like PydanticAI, LangGraph, FastAPI, and OpenAI/Anthropic APIs. You’ll build and orchestrate intelligent agents and RAG pipelines, moving quickly to prototype, test, and launch in days, not months. Working closely with product, design, and go-to-market teams, you’ll build features that solve real customer More ❯
Posted:

Artificial Intelligence Engineer

South East London, England, United Kingdom
Hybrid / WFH Options
Franklin Bates
platform like Kubeflow. Demonstrated ability to transition models from prototype to production. Experience assessing various AI/ML technologies and models for fit to problem space, including scenarios where RAG is applicable. Incident response experience, and ability to work with large, noisy, and rapidly evolving threat datasets. Strong background in cloud engineering and containerisation (Docker, Kubernetes) with experience deploying AI More ❯
Posted:

Technical Co-Founder (AI Startup)

South East London, England, United Kingdom
Stealth Startup
our agent framework Lead engineering team post-seed Shape product vision and strategy Qualifications: Insatiable desire to build the future & create a highly impactful products Deep expertise in LLMs, RAG, and production ML Experience with agent architectures and fine-tuning Strong API integration skills Builder mentality - you ship fast and iterate Previous startup experience preferred Excited to solve hard technical More ❯
Posted: