ensuring robust versioning, monitoring, and adherence to best practices. Drive the integration of external knowledge bases and retrieval systems to augment LLM capabilities. Research and Development: Effective RAG architectures and technologies for organizing complex domain-specific data (e.g. vector databases, knowledge graphs) and effective knowledge extraction. Explore and benchmark state-of-the-art LLMs, tuning, adaptation, and training … technical projects to successful completion in agile environments. Strong communication skills to align technical solutions with business goals. Ability to mentor and foster innovation within the team. LLM and RAG Expertise: Strong expertise in building Retrieval-AugmentedGeneration (RAG) architectures and integrating with vector and graph databases. Transformer and LLM Architectures: In-depth experience More ❯
personal and professional development! Requirements: Strong Python scripting skills Strong understanding of LLMs Experience delivering Gen-AI projects Experience with Retrieval-AugmentedGeneration (RAG) Experience with Microsoft data technologies would be beneficial Experience with Cloud platforms - ideally Azure Strong communication, stakeholder management and problem-solving skills Benefits: Salary of up to around More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
personal and professional development! Requirements: Strong Python scripting skills Strong understanding of LLMs Experience delivering Gen-AI projects Experience with Retrieval-AugmentedGeneration (RAG) Experience with Microsoft data technologies would be beneficial Experience with Cloud platforms - ideally Azure Strong communication, stakeholder management and problem-solving skills Benefits: Salary of up to around More ❯
with enterprise partners. No two weeks will look the same. Fine-tune and privately deploy LLMs - with a focus on Retrieval-AugmentedGeneration (RAG) pipelines Build and scale computer vision systems - from object detection to image segmentation Apply NLP to real-world business problems - summarisation, entity recognition, information extraction, and more Train and deploy More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Clear IT Recruitment Limited
time management skills. Bonus skills • Experience deploying and managing applications with Azure and Docker. • Familiarity with frameworks like LangChain and Retrieval-AugmentedGeneration (RAG) models for AI-driven applications. • Experience with pandas for data manipulation. What’s on offer • Regular salary reviews recognising performance and contribution. • Generous annual leave: 25 days plus three days More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morgan McKinley
third-party services. Develop front-end interfaces using JavaScript and frameworks such as React. Build and deploy Agentic systems and Retrieval-AugmentedGeneration (RAG) systems with Large Language Models (LLMs). Participate in privacy automation programs, including workflows for user approvals and reviews. Implement and maintain CI/CD pipelines in Jenkins to streamline More ❯
or instruction tuning models for specific tasks Embedding-based augmentation (using vector search to give LLMs access to external knowledge) Retrieval-AugmentedGeneration (RAG) or similar techniques You've taken messy, real-world data and turned it into useful, structured outputs You can show how your work had impact - whether that's a model More ❯
East London, London, United Kingdom Hybrid / WFH Options
BroadbandUK
Ofcom, Openreach, independent networks and ISPs to uncover problems that directly impact how the UK connects. Advance the use of retrieval-augmentedgeneration (RAG), combining LLMs with vector search, to push the boundaries of how AI can be applied to connectivity challenges. Our stack Backend: PHP (Laravel), Python (for AI/ML workflows) Frontend More ❯
Ofcom, Openreach, independent networks and ISPs to uncover problems that directly impact how the UK connects. Advance the use of retrieval-augmentedgeneration (RAG), combining LLMs with vector search, to push the boundaries of how AI can be applied to connectivity challenges. Our stack Backend: PHP (Laravel), Python (for AI/ML workflows) Frontend More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Salt Search
designing and managing databases in Postgres. AI/ML Expertise: Build and deploy Agentic systems, MCP servers/clients, and Retrieval-AugmentedGeneration (RAG) systems with Large Language Models (LLMs). Extract, transform, and load (ETL) data to build vector databases with embedding models. DevOps and Maintenance: Implement CI/CD pipelines to streamline More ❯
North London, London, United Kingdom Hybrid / WFH Options
VERTECH GROUP (UK) LTD
experience Solid expertise in Node.js Experience integrating LLM APIs and open-source models into production systems Hands-on work implementing Retrieval-AugmentedGeneration (RAG) pipelines Experience designing and optimizing agentic AI workflows Tremendous opportunity offering plenty of scope for career progression in a friendly, innovative environment where you'll be able to bring ideas More ❯
and managing databases in Postgres. Optional Qualifications: Built LLM applications and familiar with the concepts of MCP servers, Agents, and Retrieval-AugmentedGeneration (RAG) systems. Experience with extract, transform, and load (ETL) data with using Python libraries (e.g. Pandas). Strong understanding of version control systems and CI/CD pipelines to streamline development More ❯
Crofton Park, Greater London, UK Hybrid / WFH Options
Randstad Digital
and agentic applications . Are you ready to apply Make sure you understand all the responsibilities and tasks associated with this role before proceeding. You'll work on LLMs, RAG pipelines, model deployment, and cloud-native solutions that drive real-world impact. What you'll bring: Strong coding skills in Java/TypeScript/Python Experience with LLMs, NLP, and More ❯
Own and deliver the roadmap end to end. Drive discovery to iteration, aligning cross functional squads and keeping initiatives on time and aligned to strategy. Integrate and optimize LLM, RAG, and tool calling pipelines for text. Define requirements and run evaluations to optimize our technology stack to text-based agents. Launch new channels into production. Work with clients and deployment More ❯
productionize generative AI models. Develop scalable GenAI pipelines that generate high-quality content, from product descriptions, reviews, titles, and other product content. Design and evaluate prompt tuning strategies and RAG systems to ensure factual and engaging outputs. Fine-tune foundation models and develop domain-specific adapters using techniques like LoRA, PEFT, and instruction tuning. Define best practices for model monitoring More ❯
and liaise closely with downstream teams using the platform to build agentic applications. These applications range from consumer-facing chatbots to intelligent backend systems, and use techniques such as RAG and multimodal input/output. Your team has independence and the platform enables some of the most difficult problems at the intersection of our three-sided marketplace (riders, consumers, and More ❯
NLP/ML/Knowledge Graph/GenAI systems for commercial applications Practical experience with traditional and state-of-the-art NLP methods, Knowledge Graph algorithms, and GenAI (including RAG and agentic frameworks) Experience writing production code Outstanding communication, problem-solving, and analysis skills Excellent coding skills in Python Proficiency with Git, AWS, Azure for remote model development and deployment More ❯
Alexander Mann Solutions - Public Sector Resourcing
across the team to ensure informed decisions are made on the application design and testing approaches. . Contributing to technical decision making alongside colleagues, including Generative AI solutions around RAG and prompt engineering. . Contributing to the required documentation and Agile project maintenance responsibilities. . Helping to design and develop incoming features around sentiment analysis, data ingestion and synthetic audience More ❯
customer problems and technical possibilities. Have a deep understanding of the customer and new technical breakthroughs to truly challenge the future of natural language interfaces. Integrate and optimize LLM, RAG, and tool calling pipelines for text. Define requirements and run evaluations to optimize our technology stack to text based agents. Launch new channels into production. Work with clients and deployment More ❯
goals into clear technical execution through strong communication and cross-team alignment. Nice to Have Experience with orchestration frameworks ( LangChain, LangGraph ) and multi-agent workflows Knowledge of vector databases, RAG pipelines, and lightweight model hosting Ability to design data pipelines and feedback loops for improving AI-driven features Awareness of emerging AI areas such as multimodal, edge AI, or AI More ❯
Sunbury-On-Thames, London, United Kingdom Hybrid / WFH Options
BP Energy
for AI engineering across the enterprise, ensuring alignment with organizational priorities and technology roadmaps. Architect for Scale : Drive architectural decisions for platforms and applications that leverage LLMs, retrieval-augmentedgeneration, AI agents, and unstructured data processing at enterprise scale. Influence & Align : Partner with senior engineering leaders, product managers, business stakeholders, and governance teams to … Expertise in modern software architecture, distributed systems, and cloud infrastructure (AWS, Azure, or GCP). Deep hands-on experience integrating AI/ML systems into production environments, including LLMs, RAG, vector search, and AI agents. Strong ability to communicate technical strategy to executives, engineers, and business partners alike. Proven ability to set and enforce engineering standards for quality, security, and More ❯
City of London, London, England, United Kingdom Hybrid / WFH Options
Ada Meher
Senior Gen AI Engineer – London (Hybrid) - £90-120k Prompt Engineering | RAG Pipelines/Vector DB | OpenAI & LangChain | Flexible Working Ada Meher is currently working with an equity backed SaaS software vendor in the construction space who are recruiting a Senior AI Engineer with skills in LLM, RAG pipeline builds & fine-tuning and optimisation of prompts & models to join their … Node/Typescript or similar Experience with relevant technologies such as OpenAI, LangChain/LangGraph, LlamaIndex Experience with Hugging Face and LoRA/QLoRA for fine-tuning Experience with RAG & Vector DBs eg. FAISS, Weaviate, Pinecone Any experience of MLOps with MLFlow, AWS (SageMaker), CI/CD (GitHub Actions) or similar would be a benefit to an application The employer More ❯
learning and innovation Requirements: Experience with at least one programming language such as Python, TypeScript, React, or C sharp Familiarity with large language models, APIs, prompt engineering, retrieval-augmentedgeneration, or vector databases Understanding of software deployment pipelines and continuous integration and continuous delivery tools Ability to troubleshoot and resolve AI-related issues. Experience More ❯
South West London, London, United Kingdom Hybrid / WFH Options
Purview Consultancy Services Ltd
distributed systems, and enterprise architecture Experience with Claude Code for agentic coding and AI-powered development Proven track record in financial services or regulatory compliance environments Expert knowledge of RAG architectures, advanced RAG patterns, and vector database optimization Experience with Small Language Models (SLM), Agent-to-Agent (A2A) communication, and Model Context Protocol (MCP) Proven ability to architect and scale … frameworks using LangGraph, LangMem, and custom agent orchestration Lead technical strategy for Azure OpenAI GPT-5 integration and advanced embedding-based retrieval systems Design and implement advanced RAG architectures including hybrid search, query routing, and contextual retrieval Establish multi-agent systems with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) Architect Small Language More ❯
london, south east england, united kingdom Hybrid / WFH Options
Purview Consultancy Services Ltd
distributed systems, and enterprise architecture Experience with Claude Code for agentic coding and AI-powered development Proven track record in financial services or regulatory compliance environments Expert knowledge of RAG architectures, advanced RAG patterns, and vector database optimization Experience with Small Language Models (SLM), Agent-to-Agent (A2A) communication, and Model Context Protocol (MCP) Proven ability to architect and scale … frameworks using LangGraph, LangMem, and custom agent orchestration Lead technical strategy for Azure OpenAI GPT-5 integration and advanced embedding-based retrieval systems Design and implement advanced RAG architectures including hybrid search, query routing, and contextual retrieval Establish multi-agent systems with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) Architect Small Language More ❯