1 to 25 of 270 Retrieval-Augmented Generation Jobs in London

AI Engineer (RAG) - Contract - 6 months - London - Outside IR35

Hiring Organisation: Robson Bale Ltd
Location: London, United Kingdom
Employment Type: Contract
Contract Rate: GBP Daily

Engineer (RAG) - Contract - 6 months - London - Outside IR35 Contract Length: 6 months Rate: £500 per day - Outside IR35 Location: London/Hybrid working Overview We are seeking an experienced AI Engineer to join on a 6-month contract, supporting the design and development of AI-driven solutions with a strong … improving system reliability, performance, and scalability through continuous improvement. Required Skills & Experience Proven experience in AI Engineering, specifically working with RAG (Retrieval-Augmented Generation) approaches. Strong Python development experience in a production environment. Solid software engineering background, with experience designing and building reliable systems. ...

Machine Learning Engineer (0–3 Years Experience)

Hiring Organisation: IT Graduate Recruitment
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £45,000 - £75,000 per annum, OTE

optimise pipelines for data collection, preprocessing, and model evaluation. Collaborate with product engineers to deploy models into scalable production systems. Experiment with prompt engineering, RAG architectures, and multimodal models. Contribute to internal tools for monitoring, testing, and improving AI performance. Stay on the edge of ML/AI research … Intelligence, Deep Learning, NLP, Natural Language Processing, Large Language Models, Generative AI, GenAI, Neural Networks, PyTorch, TensorFlow, Hugging Face, OpenAI, LangChain, RAG, Retrieval-Augmented Generation, Python, Data Science, AI Research, MLOps, Data Pipelines, Prompt Engineering, Model Fine-Tuning, Cloud Computing, AWS, Azure, Google Cloud ...

AI Solutions Architect

Hiring Organisation: Tadaweb
Location: London Area, United Kingdom

will leverage managed AI services across GCP and Azure to deliver features like semantic search, retrieval-augmented generation (RAG), and intelligent data enrichment. Working closely with Data and ML Engineers, you will define patterns, select technologies, and ensure our solutions are secure, performant … Search with embeddings. Cost & Performance : Establish strategies for latency optimization and cost control across clouds. AI Enablement & Delivery RAG System Design : Implement retrieval-augmented generation patterns for small, volatile data chunks; ensure grounding and factuality. Evaluation & Observability : Define quality metrics (precision/recall, groundedness ...

AI Solutions Architect

Hiring Organisation: Tadaweb
Location: City of London, London, United Kingdom

AI Developer/Engineer

Hiring Organisation: Damia Group Ltd
Location: London, United Kingdom
Employment Type: Contract
Contract Rate: £500 - £650 per day

Experience with: AWS: boto3, Bedrock, SageMaker, Lambda, S3, EC2 Azure: Azure OpenAI Service, Cosmos DB Retrieval-Augmented Generation (RAG), Graph RAG Embedding models and LLM training fundamentals Damia Group Limited acts as an employment agency for permanent recruitment and employment business for the supply ...

Senior AI Engineer

Hiring Organisation: HCLTech
Location: London Area, United Kingdom

implement robust, goal-driven AI agents using leading frameworks like LangChain, LangGraph, and the Google Agent Development Kit (ADK). Develop and Evaluate RAG Pipelines: Engineer and optimize end-to-end Retrieval-Augmented Generation (RAG) systems, including data ingestion, chunking strategies, and implementing rigorous … Prompt Engineering and hands-on experience with model fine-tuning techniques including PEFT and QLoRA. Proven experience with models like Gemini, and Llama 3. RAG & Vector Databases: Deep expertise in RAG architecture and evaluation metrics. Proven experience with Vector Databases such as Milvus, Pinecone, or Chroma. Software & Cloud Engineering: Programming ...

Senior AI Engineer

Hiring Organisation: HCLTech
Location: City of London, London, United Kingdom

C# .Net Developer

Hiring Organisation: Damia Group Ltd
Location: London, United Kingdom
Employment Type: Contract
Contract Rate: £500 - £650/day

Senior Data Scientist (Based in Dubai)

Hiring Organisation: Property Finder
Location: London, UK
Employment Type: Full-time

techniques. Drive innovation in Large Language Models (LLMs), Generative AI, and Agentic AI—pioneering new applications such as enhanced personalization, lead qualification, content generation, and workflow automation. Own the end-to-end ML lifecycle: from hypothesis generation, experimentation, evaluation, and explainability, to scalable deployment in production … libraries (PyTorch or TensorFlow). Hands-on experience with LLMs, prompt engineering, fine-tuning, and retrieval-augmented generation (RAG) pipelines. Experience integrating ML models via APIs and embedding AI in enterprise systems. Deep knowledge of cloud platforms (AWS/GCP/Azure) and containerized ...

Senior AI/ML Engineer

Hiring Organisation: Kainos
Location: City of London, London, United Kingdom

field. Demonstrable experience of deploying modern AI/ML solutions into production, including prompt engineering, retrieval-augmented generation (RAG), model evaluation, and monitoring using metrics (e.g. precision, recall, NDCG and drift detection). Strong Python skills with a grounding in software engineering best practices ...

Senior AI/ML Engineer

Hiring Organisation: Kainos
Location: London Area, United Kingdom

Senior ML Engineer

Hiring Organisation: Oscar
Location: City of London, London, United Kingdom

design, build, and deliver next-generation AI systems. You will work across LLMs, retrieval-augmented generation (RAG), and modern agent frameworks to transform large, unstructured data into meaningful insights and production-ready capabilities. This is a hands-on role within a growing … entity extraction, and intelligent automation. Design scalable data and streaming pipelines capable of handling large, heterogeneous datasets. Build and optimize vector search, embeddings, and RAG systems to support high-quality retrieval. Deliver production-ready APIs, services, and model inference systems. Manage deployment, monitoring, observability, and continual improvement of ML models. ...

Senior ML Engineer

Hiring Organisation: Oscar Associates (UK) Limited
Location: London, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £90,000

Senior ML Engineer

Hiring Organisation: Oscar Technology
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £70,000 - £90,000 per annum

GenAI Engineer

Hiring Organisation: Luxoft
Location: London Area, United Kingdom

Chatbots - Document Q&A systems - Report generators - Code assistants - Summarization tools Apply prompt engineering, Retrieval-Augmented Generation (RAG), and context-aware pipelines to enhance model accuracy and relevance. Integrate AI models with enterprise systems, APIs, and data stores using Python, Java, or Node.js. Collaborate … LLMs (e.g., GPT, LLaMA, Claude, Mistral), Transformers, and Diffusion models. Experience with Hugging Face Transformers, LangChain, LLM orchestration frameworks, and prompt tuning. Familiarity with RAG pipelines, embedding models, and vector databases. Experience with cloud platforms (AWS, GCP, Azure) and AI/ML services. Knowledge of MLOps tools and practices (e.g. ...

GenAI Engineer

Hiring Organisation: Luxoft
Location: City of London, London, United Kingdom

Machine Learning Engineer

Hiring Organisation: Anson Mccade
Location: Central London, London, United Kingdom
Employment Type: Permanent

Data Version Control Practical experience building LLM/GenAI applications, including prompt engineering and retrieval-augmented generation (RAG) Familiarity with LLMOps frameworks such as LangChain, LangSmith or LangGraph Understanding of model validation, evaluation techniques and production monitoring Experience working in cross-functional teams from ...

AI Engineer

Hiring Organisation: E.ON
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: Competitive salary

that power all internal GenAI product squads. You will be instrumental in designing and implementing our award winning Generative AI Platform Model Orchestration Layer, RAG infrastructure, communications layer, agentic layer and centralised governance/safety guardrails etc. This is a hybrid role, typically working 1 day per week … access to LLMs and other generative models. Engineer and maintain production-ready Vector Database and Retrieval-Augmented Generation (RAG) infrastructure, including high-throughput indexing pipelines and efficient retrieval strategies for enterprise data. Develop and manage a standardised, secure Agent Framework/ ...

Mid Senior GenAI Platform Engineer

Hiring Organisation: E.ON
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: Competitive salary

Senior AI Scientist

Hiring Organisation: Harnham
Location: City of London, London, United Kingdom

help drive innovation across cutting-edge AI initiatives — from chatbots and voice assistants to advanced retrieval-augmented generation (RAG) systems and agentic workflows. The Role You’ll work closely with the AI Engineering and Data Science teams to: Develop and prototype AI-driven solutions … across customer-facing and internal applications. Build and optimise LLM-based assistants , RAG pipelines , and agentic AI workflows . Collaborate on the architecture and deployment of scalable AI solutions (with support from engineering). Partner with stakeholders to translate business needs into practical, intelligent systems. Mentor junior team members ...

Senior AI Scientist

Hiring Organisation: Harnham
Location: London Area, United Kingdom

AI Engineer - Elite Sports Tech - London

Hiring Organisation: Oho Group Ltd
Location: City of London, London, United Kingdom

Responsibilities: Design, build, and deploy Large Language Model (LLM)-driven applications into production Develop retrieval-augmented generation (RAG) systems for real-time insights from performance and wearable data Optimise and fine-tune LLMs for efficiency, scalability, and accuracy in production environments Build and maintain … software engineering fundamentals Proven experience working with LLMs in production (fine-tuning, prompt engineering, or API integrations) Proficiency in Python Experience designing and deploying RAG pipelines (vector databases, embeddings, retrieval optimisation) Academic Excellence: BSc or MSc in Computer Science, AI, Data Science, or related technical field from ...

AI Engineer - Elite Sports Tech - London

Hiring Organisation: Oho Group Ltd
Location: London Area, United Kingdom

Gen AI Engineer

Hiring Organisation: Investigo Change Solutions
Location: London, United Kingdom
Employment Type: Contract
Contract Rate: GBP 650 - 700 Daily

models for production use Build, maintain and enhance prompt engineering strategies and retrieval-augmented generation (RAG) pipelines Collaborate with architects, cloud teams and business stakeholders to ensure secure, scalable and robust deployments Conduct unit and integration testing, troubleshoot issues and support smooth releases into …/CD pipelines and MLOps practices for AI solutions Hands-on experience in model development and fine-tuning Deep expertise in prompt engineering and RAG solution design Performance monitoring and optimisation of AI models Experience developing and deploying APIs using API Gateway Familiarity with Azure cloud services for LLM deployments ...

Senior Software Engineer - AI & ML (Based in Dubai)

Hiring Organisation: Property Finder
Location: London, UK
Employment Type: Full-time

workloads using languages such as Python, Golang, or Node.js. Productionize ML/GenAI solutions, including: Retrieval-Augmented Generation (RAG) for support, content, and internal tools Recommendation and ranking services Classification, quality scoring, and enrichment pipelines Implement and evolve the AI/ML platform, including … cross-functional Agile teams. Hands-on experience with AI/ML or GenAI in production, such as: Fine-tuning or integrating transformer models Building RAG pipelines and semantic search Working with vector databases (e.g., Pinecone, Weaviate, Milvus, OpenSearch, etc.) Self-motivated and proactive, able to work independently where needed while ...