Retrieval-Augmented Generation Jobs in the UK

101 to 125 of 125 Retrieval-Augmented Generation Jobs in the UK

Full Stack AI Engineer

LS22, Wetherby, City and Borough of Leeds, West Yorkshire, United Kingdom
Handshaik
its business milestones. Responsibilities (including but not limited to): Backend APIs (Python/FastAPI): Build reliable, secure services that power AI features and data retrieval at scale. RAG & vector search: Design, implement and iterate retrieval pipelines (chunking, embeddings, hybrid search, ranking, feedback loops). Own pgvector/Vector DB schemas, latency, relevance and cost. LLM integration … hardworking and dedicated, with an entrepreneurial/ownership mindset, strong communication skills and a team player 5+ years of professional experience in full-stack development. Hands-on experience with RAG systems, vector databases (pgvector/FAISS/Weaviate/ES k-NN), embeddings, and hybrid search (BM25 + vectors). Strong grasp of chunking strategies, metadata, indexing, recall/precision More ❯
Employment Type: Permanent
Posted:

Director of AI

Manchester, Lancashire, England, United Kingdom
The Portfolio Group
strategic roadmap to hands-on implementation? Join a fast-scaling, international SaaS company that's transforming its industry through relentless innovation, advanced product development and investment in next-generation AI solutions. This is a rare, high-impact opportunity to define and drive the end-to-end AI agenda of a multi-award-winning business backed by a world … roadmap to technical architecture, delivery, optimisation, and governance. Build and lead cross-functional AI teams, ensuring alignment between technical execution and strategic business goals. Evaluate emerging technologies (e.g. LLMs, RAG, vector search, knowledge graphs) and make evidence-based recommendations to stakeholders. Establish best practices for responsible AI development, including risk management, compliance, and explainability. Partner with senior leadership to integrate … in AI, ML, Data Science, Computer Science or a related STEM field. Demonstrated hands-on expertise in building and deploying advanced ML and Generative AI models in production (including RAG Architecture) Deep technical proficiency with LLMs, NLP, Python, SQL, and major AI/ML frameworks (e.g., PyTorch, TensorFlow). Strong understanding of AI engineering fundamentals including DevOps, CI/CD More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Director of AI

Manchester, United Kingdom
The Portfolio Group
strategic roadmap to hands-on implementation? Join a fast-scaling, international SaaS company that's transforming its industry through relentless innovation, advanced product development and investment in next-generation AI solutions. This is a rare, high-impact opportunity to define and drive the end-to-end AI agenda of a multi-award-winning business backed by a world … roadmap to technical architecture, delivery, optimisation, and governance. Build and lead cross-functional AI teams, ensuring alignment between technical execution and strategic business goals. Evaluate emerging technologies (e.g. LLMs, RAG, vector search, knowledge graphs) and make evidence-based recommendations to stakeholders. Establish best practices for responsible AI development, including risk management, compliance, and explainability. Partner with senior leadership to integrate … in AI, ML, Data Science, Computer Science or a related STEM field. Demonstrated hands-on expertise in building and deploying advanced ML and Generative AI models in production (including RAG Architecture) Deep technical proficiency with LLMs, NLP, Python, SQL, and major AI/ML frameworks (e.g., PyTorch, TensorFlow). Strong understanding of AI engineering fundamentals including DevOps, CI/CD More ❯
Employment Type: Permanent
Posted:

AI Engineer

United Kingdom
Hybrid / WFH Options
WebLife Labs
that form the core of our user-facing features. Your mission will be to translate complex business problems into cutting-edge AI solutions, including sophisticated LLM-powered agents, robust RAG systems for data interaction, and scalable, observable AI workflows. Working closely with product and data engineering, you will own the end-to-end lifecycle of AI model development and deployment … and reliability through comprehensive logging, tracing, and performance monitoring. Agentic System Development Build and deploy LLM and agent-based systems for customer interaction, product enrichment, and decision-making. Implement RAG pipelines using vector databases for context-aware responses. Apply prompt engineering, fine-tuning, and model optimization techniques to improve accuracy and efficiency. Develop conversational interfaces and tools that enable safe More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Legal Content AI Engineer

London, UK
Hybrid / WFH Options
Disability Solutions
learning and innovation Requirements: Experience with at least one programming language such as Python, TypeScript, React, or C sharp Familiarity with large language models, APIs, prompt engineering, retrieval-augmented generation, or vector databases Understanding of software deployment pipelines and continuous integration and continuous delivery tools Ability to troubleshoot and resolve AI-related issues. Experience More ❯
Posted:

Full Stack AI Software Engineer

London, South East, England, United Kingdom
Ada Meher
Full Stack AI Software Engineer - Full Remote UK - £90,000 + Equity This role requires a software engineer with experience in implementing RAG pipelines and Vector Search (and hybrid AI searches, preferably). The client I am working with is an AI focused start-up backed by a £1.7M pre-seed investment. They are on a mission to streamline the … an early stage. What you'll work on: Backend APIs (Python/FastAPI): Build and maintain secure, high-performance services that drive AI features and data access at scale. RAG & vector search: Design and improve retrieval pipelines (embeddings, chunking, hybrid search, ranking, feedback loops), owning schema design, latency, and relevance across vector databases. LLM integration: Connect and orchestrate … AI development. Requirements: A motivated, hands-on engineer with an ownership mindset, strong communication skills, and a collaborative approach. 5+ years’ experience in full-stack development. Strong background in RAG systems , vector databases (pgvector, FAISS, Weaviate, Elasticsearch k-NN), embeddings, and hybrid search methods. Practical knowledge of chunking strategies, indexing, precision/recall trade-offs, reranking, and evaluation techniques. Proficient More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

AI Automation & Solutions Engineer

Sevenoaks, Kent, England, United Kingdom
Hybrid / WFH Options
Searchability
BE DOING Build intelligent workflows using n8n, Microsoft Power Automate, Flowable, and similar tools. Design and implement agentic AI solutions, integrating LLMs and frameworks such as LangChain, AutoGen, and RAG pipelines with platforms like OpenAI. Collaborate with teams across the business to identify automation opportunities and enhance efficiency. Contribute to the company's AI and workflow automation architecture, ensuring scalability … Engineer - Essential Skills Hands-on experience with workflow automation tools such as n8n, Microsoft Power Automate, or Flowable. Strong understanding of LLM integration and agentic AI frameworks (LangChain, AutoGen, RAG). Proficiency in Python or JavaScript for scripting and workflow logic. Familiarity with APIs, event-driven architectures, and cloud platforms (Azure, AWS, or GCP). TO BE CONSIDERED... Please either More ❯
Employment Type: Full-Time
Salary: £45,000 - £75,000 per annum
Posted:

Senior Analytics Developer

London, UK
Disability Solutions
Senior Analytics Developer Are you ready to elevate your analytics career to new heights? Would you thrive in a dynamic environment, developing solutions that drive insights and innovation? About Team: The role will report to the Analytics Manager. You will More ❯
Posted:

AI Architect

South West London, London, United Kingdom
Hybrid / WFH Options
Purview Consultancy Services Ltd
distributed systems, and enterprise architecture Experience with Claude Code for agentic coding and AI-powered development Proven track record in financial services or regulatory compliance environments Expert knowledge of RAG architectures, advanced RAG patterns, and vector database optimization Experience with Small Language Models (SLM), Agent-to-Agent (A2A) communication, and Model Context Protocol (MCP) Proven ability to architect and scale … frameworks using LangGraph, LangMem, and custom agent orchestration Lead technical strategy for Azure OpenAI GPT-5 integration and advanced embedding-based retrieval systems Design and implement advanced RAG architectures including hybrid search, query routing, and contextual retrieval Establish multi-agent systems with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) Architect Small Language More ❯
Employment Type: Contract
Posted:

AI Architect

london, south east england, united kingdom
Hybrid / WFH Options
Purview Consultancy Services Ltd
distributed systems, and enterprise architecture Experience with Claude Code for agentic coding and AI-powered development Proven track record in financial services or regulatory compliance environments Expert knowledge of RAG architectures, advanced RAG patterns, and vector database optimization Experience with Small Language Models (SLM), Agent-to-Agent (A2A) communication, and Model Context Protocol (MCP) Proven ability to architect and scale … frameworks using LangGraph, LangMem, and custom agent orchestration Lead technical strategy for Azure OpenAI GPT-5 integration and advanced embedding-based retrieval systems Design and implement advanced RAG architectures including hybrid search, query routing, and contextual retrieval Establish multi-agent systems with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) Architect Small Language More ❯
Posted:

AI Architect

south west london, south east england, united kingdom
Hybrid / WFH Options
Purview Consultancy Services Ltd
distributed systems, and enterprise architecture Experience with Claude Code for agentic coding and AI-powered development Proven track record in financial services or regulatory compliance environments Expert knowledge of RAG architectures, advanced RAG patterns, and vector database optimization Experience with Small Language Models (SLM), Agent-to-Agent (A2A) communication, and Model Context Protocol (MCP) Proven ability to architect and scale … frameworks using LangGraph, LangMem, and custom agent orchestration Lead technical strategy for Azure OpenAI GPT-5 integration and advanced embedding-based retrieval systems Design and implement advanced RAG architectures including hybrid search, query routing, and contextual retrieval Establish multi-agent systems with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) Architect Small Language More ❯
Posted:

AI Architect

London, UK
Hybrid / WFH Options
Purview Consultancy Services Ltd
distributed systems, and enterprise architecture Experience with Claude Code for agentic coding and AI-powered development Proven track record in financial services or regulatory compliance environments Expert knowledge of RAG architectures, advanced RAG patterns, and vector database optimization Experience with Small Language Models (SLM), Agent-to-Agent (A2A) communication, and Model Context Protocol (MCP) Proven ability to architect and scale … frameworks using LangGraph, LangMem, and custom agent orchestration Lead technical strategy for Azure OpenAI GPT-5 integration and advanced embedding-based retrieval systems Design and implement advanced RAG architectures including hybrid search, query routing, and contextual retrieval Establish multi-agent systems with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) Architect Small Language More ❯
Posted:

Senior Full Stack & AI Automation Engineer

London, South East, England, United Kingdom
Salt Search
Team, you'll design and build next-generation automation platforms, blending Python, React, Postgres, and modern DevOps practices with cutting-edge AI/ML techniques such as RAG, vector databases, and multi-agent systems. You'll work across the full software lifecycle: from system design and test automation, to deployment and performance monitoring, collaborating with some of the … in the industry. What You'll Work On Designing and developing automation solutions aligned with Apple's product requirements. Building agentic AI/ML systems (MCP servers/clients, RAG pipelines, vector DBs). Developing both front-end (React, JS) and back-end (Python, APIs, Postgres) features. Driving test automation with Selenium and iOS functional frameworks. Implementing CI/CD … experience with Selenium and iOS functional automation. Strong React/JavaScript UI development background. Back-end engineering with Python + Postgres. AI/ML expertise in LLMs, embeddings, and RAG systems. Experience with ETL pipelines, vector databases, and modern DevOps (Jenkins). Bonus points for: Multi-agent system design. Graph database experience. Familiarity with Charles Proxy, Git, Jenkins. Role Details More ❯
Employment Type: Contractor
Rate: Salary negotiable
Posted:

Digital Product Owner - AI

Manchester, North West, United Kingdom
Travel Counsellors
independent travel entrepreneurs worldwide, weve always believed in the power of personal service. Now, were using AI to scale care, creativity, and connection like never before. Our next-generation travel platform is being transformed by intelligent features, with TC Co-Pilot at the heart of it. As AI Product Owner, youll lead the charge in embedding smart systems … e-commerce, or hospitality Conversational interfaces, semantic search, or recommendation engines Ethical AI frameworks, GDPR, and trust-by-design principles Building AI features using tools like LangChain, Pinecone, or RAG architectures More ❯
Employment Type: Contract
Posted:

Head of Data Science

Birmingham, West Midlands, England, United Kingdom
Hybrid / WFH Options
Robert Half
how data science drives commercial outcomes. Establish an AI Centre of Excellence to embed data-driven thinking across all functions. Deliver enterprise-scale GenAI solutions (e.g. copilots, virtual assistants, RAG systems) and lead classical ML initiatives. Define governance and ethical standards for responsible AI adoption. What We're Looking For Extensive senior leadership experience leading large, enterprise data science functions. … A strong track record of delivering production-grade AI/ML solutions and driving adoption at scale. Deep expertise in GenAI (LLMs, prompt engineering, RAG) as well as classical ML and experimentation. Hands-on knowledge of MLOps, cloud (AWS/Azure), and modern data science tools. Exceptional stakeholder skills - able to translate complex technical work into board-level strategy and More ❯
Employment Type: Full-Time
Salary: £80,000 - £95,000 per annum
Posted:

Software Engineer - Applied ML (UK/EU)

London, United Kingdom
Cohere
serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Lead

South West London, London, United Kingdom
Hybrid / WFH Options
Purview Consultancy Services Ltd
Role: AI Lead Location: London, UK Hybrid: 3 days a week from office JD: AI Lead to drive the development and deployment of next-generation agentic AI solutions using Azure OpenAI GPT-5, LangGraph frameworks, and intelligent document processing. Lead technical workstreams in building production-ready AI systems for financial automation with hands-on development approach. Required Qualifications … Agent evals, DeepEval) Key Responsibilities Design and develop agentic AI applications using LangGraph and LangMem frameworks Build intelligent document processing pipelines using LlamaParse and Azure Document Intelligence Implement advanced RAG systems with text-embedding-3-large and Azure DB for Postgres Lead hands-on development using Claude Code for rapid agentic workflow creation Establish AI observability and monitoring using Arize More ❯
Employment Type: Contract
Posted:

AI Engineer

London, United Kingdom
83zero Ltd
AI Engineer - Defence RAG Systems ( Security Clearance Essential ) Clearance: Active SC Essential | Sector: Defence Role Overview Defence client requires an SC Cleared AI Engineer to build fully on-premises RAG systems using open-source technologies. You'll develop classified AI capabilities on air-gapped infrastructure with zero external dependencies. Key Responsibilities - Build end-to-end RAG pipelines on isolated defence … if required - Demonstrable experience deploying open-source LLMs (Llama, Mistral, Falcon) on-premises - Expertise with local vector databases (Chroma, FAISS, Weaviate) in offline deployments - Strong vLLM/Text Generation Inference experience for high-throughput model serving - Proven ability to work on air-gapped systems with no external package repositories - Experience with GPU orchestration (NVIDIA A100/H100) and More ❯
Employment Type: Contract
Rate: £625 - £650/day
Posted:

AI Engineer

London, United Kingdom
83zero Ltd
AI Engineer - Defence RAG Systems ( Security Clearance Essential ) On Site 2 X Days a week, Plymouth Clearance: Active SC Essential | Sector: Defence Role Overview Defence client requires an SC Cleared AI Engineer to build fully on-premises RAG systems using open-source technologies. You'll develop classified AI capabilities on air-gapped infrastructure with zero external dependencies. Key Responsibilities - Build … end-to-end RAG pipelines on isolated defence networks using open-source LLMs (Llama 3, Mistral, Qwen) - Deploy local vector stores (Chroma, FAISS, Milvus) with sensitive document ingestion pipelines - Host and optimise LLMs using vLLM/TGI on local GPU clusters without internet connectivity - Implement agent orchestration using LangChain/LangGraph in completely offline environments - Design secure document processing for … if required - Demonstrable experience deploying open-source LLMs (Llama, Mistral, Falcon) on-premises - Expertise with local vector databases (Chroma, FAISS, Weaviate) in offline deployments - Strong vLLM/Text Generation Inference experience for high-throughput model serving - Proven ability to work on air-gapped systems with no external package repositories - Experience with GPU orchestration (NVIDIA A100/H100) and More ❯
Employment Type: Permanent
Salary: £75000 - £80000/annum
Posted:

AI Engineer ( SC CLEARANCE )

Plymouth, Devon, United Kingdom
83zero Ltd
AI Engineer - Defence RAG Systems ( Security Clearance Essential ) On Site 2 X Days a week Plymouth Clearance: Active SC Essential | Sector: Defence Role Overview Defence client requires an SC Cleared AI Engineer to build fully on-premises RAG systems using open-source technologies. You'll develop classified AI capabilities on air-gapped infrastructure with zero external dependencies. Key Responsibilities - Build … end-to-end RAG pipelines on isolated defence networks using open-source LLMs (Llama 3, Mistral, Qwen) - Deploy local vector stores (Chroma, FAISS, Milvus) with sensitive document ingestion pipelines - Host and optimise LLMs using vLLM/TGI on local GPU clusters without internet connectivity - Implement agent orchestration using LangChain/LangGraph in completely offline environments - Design secure document processing for … if required - Demonstrable experience deploying open-source LLMs (Llama, Mistral, Falcon) on-premises - Expertise with local vector databases (Chroma, FAISS, Weaviate) in offline deployments - Strong vLLM/Text Generation Inference experience for high-throughput model serving - Proven ability to work on air-gapped systems with no external package repositories - Experience with GPU orchestration (NVIDIA A100/H100) and More ❯
Employment Type: Permanent
Salary: £75000 - £80000/annum
Posted:

Principal Cloud/AI Engineer

Cambridge, Cambridgeshire, United Kingdom
So Code Limited
next-generation applications. What you'll work on Designing and scaling cloud-native services on Azure Building secure, developer-friendly APIs & SDKs Working hands-on with LLMs, RAG, and AI orchestration tools Deploying and training AI models in real-world environments Collaborating with a cross-functional team in a start-up style setup What you'll bring Strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Member of Technical Staff, Training Performance Engineer

London, United Kingdom
Cohere
serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to … will work on identifying and removing performance bottlenecks, develop cutting-edge training and profiling tools to help Cohere's mission of providing efficient and reliable language understanding and generation capabilities and drive innovation in the field of natural language processing. Please Note: We have offices in London, Toronto, San Francisco, New York but also embrace being remote-friendly More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Research Scientist, Cohere Labs

London, United Kingdom
Cohere
serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to … Have solid machine learning and NLP fundamentals, with experience in shaping and executing original research. Enjoy collaborating across disciplines and geographies. Care about mentoring and supporting the next generation of researchers. At Cohere Labs, we don't just work on today's problems - we're building foundations for the future of AI research. If you want to explore More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Prompt Engineer AI & Data - Could suit Graduate / Junior or Experienced

Gloucester, Gloucestershire, England, United Kingdom
Eutopia Solutions ltd
Experience working with LLMs and prompt design (ideally OpenAI or similar) An understanding of structured data (SQL, JSON, CSV) and unstructured text (PDFs Exposure to RAG architectures and vector stores (such as FAISS, Pinecone...) Prompt Engineer (AI & Data) – Could suit a Graduate/Junior or Experienced professional Location: Remote (M4/M5/M50 area preferred but flexible) Salary … the join the R&D team and design, test and refine prompts , clean and map unstructured text and structured fields into usable metadata and semantic indexes for use in RAG systems and collaborate with product and design teams to embed LLM-powered assistants into customer facing healthcare support tools. What You'll Do Work across LLM design, structured data integration … Bring Experience working with LLMs and prompt design (ideally OpenAI or similar) An understanding of structured data (SQL, JSON, CSV) and unstructured text (PDFs, documents, form data) Exposure to RAG architectures and vector stores (such as FAISS, Pinecone.....) The ability to work with and communicate with multi-disciplinary teams at all levels of an organization Any experience in a professional More ❯
Employment Type: Full-Time
Salary: £30,000 - £60,000 per annum
Posted:

AI Solutions Engineer

London, South East, England, United Kingdom
Hays Specialist Recruitment Limited
AI platform. We're building a new team in London and hiring for a Senior Forward Deployed Engineer to help enterprise clients unlock the full power of LLM + RAG technology What you'll do Work directly with enterprise clients to turn business needs into live AI use cases Lead proof of concepts (pre-sales) and guide customers through onboarding … directly with enterprise customers (consulting, workshops, exec comms) Development skills are desirable (Python/JavaScript/TypeScript) for prototypes and integrations Strong understanding of AI/LLM concepts (prompting, RAG) Track record in digital experience or enterprise SaaS Highly desirable but not a showstopper A significant background in the Digital Experience Platform (DXP) On offer OTE of circa 115k, plus More ❯
Employment Type: Full-Time
Salary: £90,000 - £100,000 per annum, Negotiable, OTE
Posted:
Retrieval-Augmented Generation
10th Percentile
£51,250
25th Percentile
£62,500
Median
£75,000
75th Percentile
£100,000
90th Percentile
£122,500