7 of 7 Retrieval-Augmented Generation Jobs in South London

Machine Learning Engineer

Hiring Organisation: Brio Digital
Location: South London, UK
Employment Type: Full-time

/Vertex AI, including fine-tuning, vector search, and low-latency inference. Build end-to-end LLM applications, leveraging RAG (Retrieval-Augmented Generation), agentic workflows, and prompt engineering. Implement robust evaluation frameworks to monitor LLM quality, hallucinations, token usage, and content safety. Develop … Looking For Essential 5+ years' experience in machine learning engineering or applied AI roles. Recent, demonstrable experience with LLMs, Generative AI, and/or RAG-based systems. Strong Python skills using frameworks such as PyTorch, TensorFlow, Hugging Face, or Google GenAI. Experience with vector databases and retrieval-based ...

Senior Data Scientist SME & AI Architect

Hiring Organisation: Information Tech Consultants
Location: South London, UK
Employment Type: Full-time

Developing and optimizing models for image recognition, object detection, and video analytics. NLP: Building sophisticated systems for sentiment analysis, entity extraction, semantic search, and RAG architectures leveraging LLMs. Generative AI: Exploring and implementing cutting-edge GenAI techniques for content creation, data augmentation, and innovative product features. SME Consulting & Mentorship … with Computer Vision tasks (e.g., CNNs, object detection models like YOLO). NLP: Expert practical experience in NLP techniques, including transformer models, embedding generation, and building complex text-based applications. 3. Leadership & Soft Skills Technical Leadership: Proven track record of leading complex data science projects from research ...

Data and AI Consultant - Full time

Hiring Organisation: Staffworx
Location: South London, UK
Employment Type: Full-time

technical stakeholders GenAI work You will work with GenAI in ways that are grounded in real use cases and business value: Building RAG systems that improve search, content discovery or productivity rather than existing for their own sake Implementing guardrails so models do not leak PII or generate harmful … Experience with PyTorch or TensorFlow GenAI specific Hands on experience with LLM APIs or open source models such as Llama or Mistral Experience building RAG systems with vector databases such as FAISS, Pinecone or Weaviate Ability to evaluate and improve prompts and retrieval quality using clear metrics Understanding ...

AI Application Security Architect

Hiring Organisation: Covenant HR
Location: South London, UK
Employment Type: Full-time

Have Skills: Strong background in application security, cloud security, or security architecture Hands-on experience with securing AI/ML systems, including LLMs, APIs, RAG pipelines, and vector stores Deep familiarity with AI security risks and threat modeling methodologies Working knowledge of modern DevSecOps practices and tools Excellent communication ...

AI Practice Lead

Hiring Organisation: Hancock & Parsons Ltd
Location: South London, UK
Employment Type: Full-time

Centre of Excellence within a consultancy or Microsoft Partner environment. Deep expertise in enterprise AI solution design, including generative AI, Azure OpenAI, Copilot Studio, RAG, cognitive services, and data orchestration patterns. Strong background in the Microsoft ecosystem, especially Dynamics 365, Power Platform, and Azure services. Hands‐on experience shaping ...

Artificial Intelligence Engineer

Hiring Organisation: Innova Recruitment
Location: South London, UK
Employment Type: Full-time

deployment. Deep knowledge of RAG Experience processing high volumens of documentation/text Strong knowledge of NLP and LLMs (transformers, fine-tuning, retrieval-augmented methods, agents). Experience conducting applied research and converting experimental outcomes into production systems. Familiarity with retraining workflows and performance monitoring. Understanding ...

AI Engineer

Hiring Organisation: numi
Location: South London, UK
Employment Type: Full-time

User-Defined Agent Logic: Creating a \"no-code\" engine where users can verbally define workflows and guardrails that the system executes deterministically. Hierarchical & Auditable RAG: Routing queries through complex layers of patient history, clinical guidelines, and organizational policies with full traceability. Resilient Orchestration: Managing long-running conversation states, tool usage … executor patterns and manage shared memory across agents. Prompt Engineering & Optimization: utilize programmatic approaches to compile and iteratively improve prompts based on evaluation metrics. RAG Optimization: Enhance retrieval signal through hybrid search, re-ranking, and query rewriting, ensuring high context precision and recall. Observability & Evals: Build robust tracing ...