1 to 25 of 37 Permanent Retrieval-Augmented Generation Jobs in Central London

Principle AI / ML Engineer

Hiring Organisation
Prolo
Location
City of London, London, United Kingdom
Agentic AI & LLM Engineering Agentic Workflow Design: Develop multi-agent systems using frameworks like LangGraph or CrewAI for autonomous research and automated coding. Advanced RAG & Memory: Build sophisticated Retrieval-Augmented Generation (RAG) pipelines for long-term model memory. Model Optimization: Fine-tune open-source … with AWS Bedrock/SageMaker; Docker/Kubernetes for scaling agent instances. Proven experience building Retrieval-Augmented Generation (RAG) systems; Graph-RAG experience is a plus. Practical experience with model adaptation techniques, including prompting, fine-tuning, instruction tuning, or knowledge distillation. Solid understanding ...

GenAI Architect

Hiring Organisation
HCLTech
Location
City of London, London, United Kingdom
knowledge graphs, and episodic/audit memory. Ensure coherent retrieval strategies across layers. Retrieval-Augmented Generation (RAG) and CAG (Cache Augmented) Architecture : Define architectural patterns for end-to-end RAG pipelines, including chunking, embedding, vector search (e.g., Azure Cognitive Search … least one other. • GenAI & LLM Depth : Demonstrated experience architecting and guiding solutions using GenAI platforms (e.g., Azure OpenAI, Vertex AI, or AWS Bedrock). • RAG & Orchestration : Proven experience designing complex RAG pipelines. • Model Fine-tuning : Experience with instruction tuning or fine-tuning strategies for LLMs. • Leadership & Advisory Skills : Exceptional communication ...

GenAI Full Stack Engineer - Consultant / Senior Consultant

Hiring Organisation
83zero Limited
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£70,000
with the team to develop GenAI proof-of-concepts (POCs) for clients using technologies like Retrieval-Augmented Generation (RAG) and intelligent agents. Scale existing POCs to production-ready solutions for customer use. Design and develop Full Stack applications for both GenAI and non-GenAI ...

Lead GenAI Full Stack Engineer / Managing Consultant

Hiring Organisation
83zero Limited
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£85,000
with the team to develop GenAI proof-of-concepts (POCs) for clients using technologies like Retrieval-Augmented Generation (RAG) and intelligent agents. Scale existing POCs to production-ready solutions for customer use. Design and develop Full Stack applications for both GenAI and non-GenAI ...

Senior AI Scientist

Hiring Organisation
Harnham
Location
City of London, London, United Kingdom
help drive innovation across cutting-edge AI initiatives — from chatbots and voice assistants to advanced retrieval-augmented generation (RAG) systems and agentic workflows. The Role You’ll work closely with the AI Engineering and Data Science teams to: Develop and prototype AI-driven solutions … across customer-facing and internal applications. Build and optimise LLM-based assistants , RAG pipelines , and agentic AI workflows . Collaborate on the architecture and deployment of scalable AI solutions (with support from engineering). Partner with stakeholders to translate business needs into practical, intelligent systems. Mentor junior team members ...

Staff AI Engineer - GenAI (Dubai based)

Hiring Organisation
oryxsearch.io
Location
City of London, London, United Kingdom
scale machine learning algorithms and pipelines to production using big data technologies. Develop and deploy retrieval-augmented generation (RAG) systems and LLM-based applications. Design and evaluate A/B tests and communicate results across cross-functional teams. Define, implement, and monitor key performance ...

LLM Engineer

Hiring Organisation
Ultralytics
Location
City of London, London, United Kingdom
managing multi-API workflows using tools like LiteLLM to ensure flexibility and resilience. Building sophisticated Retrieval-Augmented Generation (RAG) systems, leveraging advanced techniques like embeddings with Voyage AI, rerankers , and query enrichment . Designing and maintaining efficient data pipelines and vector storage solutions using … PyTorch. Proven experience building and deploying applications with LLM APIs such as OpenAI , Anthropic , Gemini , and DeepSeek . Hands-on experience with the full RAG pipeline, including vector embeddings , rerankers , and data indexing in databases like MongoDB. Practical knowledge of LLM fine-tuning, prompt engineering, and performance optimization. Familiarity with ...

Staff Software Engineer (Fullstack / Remote UK)

Hiring Organisation
TalentCo
Location
Central London / West End, London, United Kingdom
intuitive user experiences. Work closely with AI researchers and ML engineers to integrate LLMs, Retrieval-Augmented Generation (RAG), and automation into production-ready applications. Ship robust, minimal-dependency code that performs efficiently in enterprise environments. Continuously iterate and refine AI-driven products, balancing user ...

Server Operation Engineer

Hiring Organisation
Centific
Location
City of London, London, United Kingdom
create high-quality pre-trained datasets, fine-tuned industry-specific Large Language Models(LLMs), and Retrieval-Augmented Generation (RAG) pipelines supported by vector databases. Our innovations can reduce Generative Artificial Intelligence(Gen AI) costs by up to 80% and bring Gen AI solutions ...

AI Engineer

Hiring Organisation
Harnham
Location
City of London, London, United Kingdom
fundamentals across the full lifecycle. Hands‐on experience with ML frameworks (scikit‐learn, XGBoost, PyTorch, TensorFlow, LightGBM). Strong exposure to LLMs, prompt engineering, RAG, and evaluation frameworks. Familiarity with MLOps tooling: CI/CD, experiment tracking, model registries, monitoring, containers. Please note: This role cannot provide VISA sponsorship . ...

Manager of Artificial Intelligence

Hiring Organisation
Fimador
Location
City of London, London, United Kingdom
LangChain or Semantic Kernel. Solid understanding of ML concepts (supervised/unsupervised learning, transformers, CNNs/RNNs, model evaluation). Experience with prompt engineering, RAG pipelines, and model fine-tuning. The judgement to identify where AI adds value — and where it doesn’t. Experience deploying and operationalising LLMs (exposure ...

AI Engineer

Hiring Organisation
trg.recruitment
Location
City of London, London, United Kingdom
Role 2: Applied AI Engineer Strong backend engineering experience (Python preferred) Deep knowledge of CI/CD, Docker, Kubernetes, and Git workflows Experience building RAG pipelines and AI agent orchestration systems Minimum 5 years of experience 📩 Interested? Message me here or email mmatysik@trg-uk.com ...

Solution Engineer

Hiring Organisation
Anson Mccade
Location
Central London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£65,000
both technical and senior stakeholders. Comfortable working across multiple teams and translating business needs into technical solutions. Desirable Experience: Exposure to Generative AI, LLMs, RAG techniques, or related frameworks. Experience with Microsoft Power Platform technologies. Front-end development using modern frameworks such as React, Angular, or Vue. Awareness of Edge ...

Senior Platform Engineer

Hiring Organisation
Harrington Starr
Location
City of London, London, United Kingdom
architecture, frontend/backend patterns, and platform onboarding Mentoring engineers and driving best practices across teams Integrating modern AI tooling (LLMs, vector databases, RAG workflows) into internal platforms What we’re looking for 6+ years experience as a software or platform engineer in complex environments Strong Python engineering background (compiled ...

Artificial Intelligence Engineer

Hiring Organisation
Space Executive
Location
City of London, London, United Kingdom
autonomous AI What You’ll Need: A strong foundation in data science and machine learning Hands-on use of modern AI tools (LLMs, RAG, LangChain, co-pilots, agentic workflows) Curiosity and eagerness to learn in a fast-moving AI landscape Experience collaborating with stakeholders and translating business problems into technical ...

Founding Engineer

Hiring Organisation
Squirrel
Location
City of London, London, United Kingdom
Next.js, React, TypeScript, Python, FastAPI, Postgres, Redis Experience with cloud infrastructure (AWS, GCP) Exposure to or experience with foundation models or Generative AI (LLMs, RAG, fine-tuning, agents, evals, AI IDE, etc.) Experience working directly with users to iterate on products Benefits Early equity in a YC-backed, high-growth ...

Product Engineer

Hiring Organisation
Affinity Labs
Location
City of London, London, United Kingdom
unified decisions. Own operability: CI/CD, monitoring/alerting, performance optimisation, safe rollbacks. Build and integrate AI features (LLM/agent workflows, RAG/vector DB integration, eval hooks, cost/safety guardrails). Help evolve our reusable product templates and shared architecture across the portfolio. Essential Requirements ...

Research Engineer

Hiring Organisation
Harnham
Location
City of London, London, United Kingdom
shipping reliable systems used by real customers. Key responsibilities Build and deploy LLM-powered features across drafting, prosecution, and litigation workflows Design and maintain RAG pipelines over large, complex legal and technical corpora Own systems from prototype through to production Focus on robustness, evaluation, and reliability in precision-critical ...

AI Engineer

Hiring Organisation
Ethiq
Location
City of London, London, United Kingdom
logic that allows an LLM to detect its own hallucinations in a workflow and force a re-plan based on ingested technical documentation. Shave RAG query times down from "noticeable lag" to "instant" across massive, unstructured enterprise datasets. Why a Senior Engineer would join this team You’ll work directly ...

AI Architect - Consulting Innovation Team

Hiring Organisation
Harnham
Location
City of London, London, United Kingdom
Details • Salary: £80k–£100k base (flexible for strong profiles) • Working model: Hybrid, 3 days per week in London office • Tech stack: Python, JavaScript, GenAI (RAG, LangChain, LlamaIndex), agentic systems, Azure/AWS • Visa: No sponsorship available Interested? Please apply below. ...

AI Engineer

Hiring Organisation
Bloc Recruitment
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
assessments, and delight users Work with industry experts to turn messy claim processes into AI-driven workflows Deploy solutions using LLMs, LangChain, and RAG at scale What we're looking for 2+ years in AI/ML engineering or research Proven track record taking projects from idea → production → impact Bias ...

Lead Software Engineer

Hiring Organisation
Dex
Location
City of London, London, United Kingdom
development, along with experience in Docker , PostgreSQL , or DSL/Compilers . Professional or enthusiastic personal experience with LLMs and their ecosystems (e.g., Langchain, RAG, or observability tooling). You are a "creative experimentalist" who can balance the need for speed with long-term engineering maintenance. A commitment to building ...

Senior Software Delivery & Quality Manager - Generative AI

Hiring Organisation
The Portfolio Group
Location
City of London, London, Castle Baynard, United Kingdom
Employment Type
Permanent
ambiguity, managing risk, and maintaining delivery momentum. Exposure to regulated or high-trust domains (Legal, HR, Finance, Healthcare) strongly preferred. Familiarity with Generative AI, RAG, or ML systems advantageous. Why Join? You'll play a pivotal role in ensuring Generative AI systems are delivered coherently, responsibly, and at scale-working ...

AI Engineer

Hiring Organisation
Granola
Location
City of London, London, United Kingdom
production (using e.g. OpenAI, Anthropic, Google, etc.) Proficiency with LLM infra platforms (prompt management, logging/tracing, evals) Experience designing large-context LLM systems (RAG, knowledge graphs, hybrid search, memory) Building features end-to-end with TypeScript, React.js, and Node.js As a person, you... Are first and foremost a builder. ...

Forward Deployed Engineer - SC Cleared

Hiring Organisation
Burns Sheehan
Location
City of London, London, United Kingdom
similar engineering languages is a must Experience working with APIs, cloud infrastructure and data processing. Previous experience within AI tech such as LLMs, RAG, embedding, vector databases is a big plus as is any previous experience within an AI, Data or B2B SaaS company. This is a great time ...