Retrieval-Augmented Generation Jobs in the City of London

101 to 125 of 140 Retrieval-Augmented Generation Jobs in the City of London

Full Stack developer, Software Product (AI)

london (city of london), south east england, united kingdom
Nothing
function Comfort working in a fast-moving, ambiguous environment An eye for quality and a bias toward shipping Bonus: experience in building AI-powered tools or creative applications (e.g. RAG systems, assistants, agents, etc. More ❯
Posted:

Principal AI Engineer - TWE 43569

City of London, London, United Kingdom
Hybrid / WFH Options
twentyAI
expertise and strategic thinking. What You’ll Do Design, build, and deploy AI-powered systems and features that enhance product functionality and user experience. Develop and integrate agentic frameworks, RAG pipelines, and vector database architectures to support natural language and intelligent automation use cases. Fine-tune and customise LLMs (e.g. OpenAI, Anthropic, and other models) for specific business applications. Collaborate … and scaling AI models in Azure and/or AWS environments. Skilled in using LangChain, LangGraph, vector databases, and prompt engineering. Understanding of LLM optimisation, including context extension and RAG-based frameworks. Proven ability to collaborate cross-functionally and communicate complex AI concepts to non-technical teams. A proactive, creative mindset — comfortable operating with autonomy to build scalable, production-grade … Why This Role Play a central role in defining and executing an organisation-wide AI roadmap. Work directly with senior leadership on strategy and implementation. Shape the next generation of intelligent, data-driven products. Hybrid working environment with flexibility for deep technical focus and collaboration. Competitive compensation and the opportunity to build at scale in a high-growth More ❯
Posted:

Principal AI Engineer - TWE 43569

london (city of london), south east england, united kingdom
Hybrid / WFH Options
twentyAI
expertise and strategic thinking. What You’ll Do Design, build, and deploy AI-powered systems and features that enhance product functionality and user experience. Develop and integrate agentic frameworks, RAG pipelines, and vector database architectures to support natural language and intelligent automation use cases. Fine-tune and customise LLMs (e.g. OpenAI, Anthropic, and other models) for specific business applications. Collaborate … and scaling AI models in Azure and/or AWS environments. Skilled in using LangChain, LangGraph, vector databases, and prompt engineering. Understanding of LLM optimisation, including context extension and RAG-based frameworks. Proven ability to collaborate cross-functionally and communicate complex AI concepts to non-technical teams. A proactive, creative mindset — comfortable operating with autonomy to build scalable, production-grade … Why This Role Play a central role in defining and executing an organisation-wide AI roadmap. Work directly with senior leadership on strategy and implementation. Shape the next generation of intelligent, data-driven products. Hybrid working environment with flexibility for deep technical focus and collaboration. Competitive compensation and the opportunity to build at scale in a high-growth More ❯
Posted:

Senior Machine Learning Engineer

City of London, London, United Kingdom
fierlo
AI capabilities at scale. We’re hiring an experienced, hands-on Machine Learning Engineer to architect and deliver production-grade AI system — with a strong focus on MCP, RAG, and real-world deployment . This project is a green field build out of AI capabilities. We are starting with RAG use cases as a low bar, but are targeting … the VP of Engineering Architect, build and ship advanced AI/ML & Generative AI solutions — scalable, secure, production-ready Design data ingestion pipelines, integrate vector databases and retrieval-augmented systems Ship models via APIs, containers, or cloud-native services Own engineering excellence — Git, CI/CD, automated ML testing, IaC Influence technical direction and mentor other … translate AI strategy into measurable business outcomes What you bring Expert Python — LangChain, Semantic Kernel, PyTorch, TensorFlow Hands-on cloud delivery — Azure/AWS, Terraform, ECS Proven experience building RAG/MCP architectures 5+ years in applied ML or AI engineering roles Send us your profile now for immediate consideration. November start. More ❯
Posted:

Senior AI Software Engineer

City of London, London, United Kingdom
Tempest Vane Partners
researchers, engineers, and portfolio specialists, solving complex real-world problems. Competitive compensation with a performance-linked bonus. What You’ll Do Take ownership of designing and building next-generation AI systems that make complex financial data instantly accessible and actionable. Partner closely with data scientists, ML researchers, and frontend engineers to turn research concepts into robust, scalable production … deployment practices using Docker, Kubernetes, and CI/CD pipelines. Comfort working in cloud-based environments (AWS preferred), including data connectivity and infrastructure-as-code principles. Experience integrating LLMs, RAG pipelines, or vector databases into production workflows is a major plus. A strong communicator who can translate technical complexity into business value and thrives in collaborative, cross-functional environments. Passion More ❯
Posted:

Senior AI Software Engineer

london (city of london), south east england, united kingdom
Tempest Vane Partners
researchers, engineers, and portfolio specialists, solving complex real-world problems. Competitive compensation with a performance-linked bonus. What You’ll Do Take ownership of designing and building next-generation AI systems that make complex financial data instantly accessible and actionable. Partner closely with data scientists, ML researchers, and frontend engineers to turn research concepts into robust, scalable production … deployment practices using Docker, Kubernetes, and CI/CD pipelines. Comfort working in cloud-based environments (AWS preferred), including data connectivity and infrastructure-as-code principles. Experience integrating LLMs, RAG pipelines, or vector databases into production workflows is a major plus. A strong communicator who can translate technical complexity into business value and thrives in collaborative, cross-functional environments. Passion More ❯
Posted:

GenAI Engineer

City of London, London, United Kingdom
Clarity (formerly Anecdote)
feedback loops. 25% Architect & scale Own reliability, latency, and cost. Design online/offline eval harnesses, canaries, and SLAs; operate GPUs/accelerators where needed. Stand up and harden RAG pipelines (indexing, retrieval policies, grounding, guardrails) and agent frameworks. Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost … building production ML/back‐end systems; 2+ years leading while coding. Expert Python ; strong back‐end chops (e.g., FastAPI, gRPC, Postgres, pub/sub/streams). Agents & RAG: Fluency with at least one agent framework ( ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval More ❯
Posted:

GenAI Engineer

london (city of london), south east england, united kingdom
Clarity (formerly Anecdote)
feedback loops. 25% Architect & scale Own reliability, latency, and cost. Design online/offline eval harnesses, canaries, and SLAs; operate GPUs/accelerators where needed. Stand up and harden RAG pipelines (indexing, retrieval policies, grounding, guardrails) and agent frameworks. Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost … building production ML/back‐end systems; 2+ years leading while coding. Expert Python ; strong back‐end chops (e.g., FastAPI, gRPC, Postgres, pub/sub/streams). Agents & RAG: Fluency with at least one agent framework ( ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval More ❯
Posted:

Agentic Engineer

City Of London, England, United Kingdom
Hybrid / WFH Options
Digital Waffle
their product, data, and software teams to build adaptive solutions that enhance productivity, automate workflows, and enable smarter business processes. This is an opportunity to shape how next-generation AI is applied in a fast-moving SME environment—balancing hands-on technical work with strategic innovation. Key Responsibilities Design, build, and deploy AI agents using frameworks such as … tools. Familiarity with APIs, databases, and cloud infrastructure (AWS, Azure, or GCP). Desirable: Experience in small, agile teams or start-up/SME environments. Knowledge of vector databases, RAG systems, or AI orchestration platforms. Interest in autonomous systems, cognitive architectures, or workflow automation. More ❯
Posted:

Agentic Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Digital Waffle
their product, data, and software teams to build adaptive solutions that enhance productivity, automate workflows, and enable smarter business processes. This is an opportunity to shape how next-generation AI is applied in a fast-moving SME environment—balancing hands-on technical work with strategic innovation. Key Responsibilities Design, build, and deploy AI agents using frameworks such as … tools. Familiarity with APIs, databases, and cloud infrastructure (AWS, Azure, or GCP). Desirable: Experience in small, agile teams or start-up/SME environments. Knowledge of vector databases, RAG systems, or AI orchestration platforms. Interest in autonomous systems, cognitive architectures, or workflow automation. More ❯
Posted:

AI Engineer

City of London, London, United Kingdom
Electric Twin
ll Do Architecture & Development : Design and implement the cognitive systems that give AI agents consistent personalities, memory, and reasoning capabilities, using advanced LLM techniques like chain-of-thought prompting, RAG systems, and agentic tool use. Modeling & Experimentation : Design and run systematic experiments to evaluate agent behavior, test hypotheses about behavioral patterns, and iterate on model architectures based on empirical results … principles Experience working in fast-paced environments where requirements evolve rapidly Technical Skills LLM & Agent Development : Hands-on experience building applications with large language models, implementing advanced prompting techniques, RAG systems, and agentic workflows Backend Engineering : Proficient in Python and backend frameworks (e.g. FastAPI, Django, Flask); understanding of distributed systems and scalable architectures AI/ML Frameworks : Experience with PyTorch More ❯
Posted:

AI Engineer

london (city of london), south east england, united kingdom
Electric Twin
ll Do Architecture & Development : Design and implement the cognitive systems that give AI agents consistent personalities, memory, and reasoning capabilities, using advanced LLM techniques like chain-of-thought prompting, RAG systems, and agentic tool use. Modeling & Experimentation : Design and run systematic experiments to evaluate agent behavior, test hypotheses about behavioral patterns, and iterate on model architectures based on empirical results … principles Experience working in fast-paced environments where requirements evolve rapidly Technical Skills LLM & Agent Development : Hands-on experience building applications with large language models, implementing advanced prompting techniques, RAG systems, and agentic workflows Backend Engineering : Proficient in Python and backend frameworks (e.g. FastAPI, Django, Flask); understanding of distributed systems and scalable architectures AI/ML Frameworks : Experience with PyTorch More ❯
Posted:

Senior Full Stack Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
MyPocketSkill
directly on several key product enhancements. You’ll work with Python/Django, Javascript and AWS. Our platform is also increasingly AI-powered, so familiarity with implementing AI solutions (RAG etc) is an advantage. You’ll be working on a project where everything is hosted in AWS and we have a lightweight automated deployment process. You’ll work alongside a …/or ReactJS. Working understanding of data capture and performance tracking, a willingness to contribute to design and UX decisions. AI familiarity with working on Gen AI projects, including RAG and API integration. Ability to work within project timelines, proactively communicate any delays and contribute to task re-prioritisation to keep things on track Experience in developing websites, web applications More ❯
Posted:

Senior Full Stack Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
MyPocketSkill
directly on several key product enhancements. You’ll work with Python/Django, Javascript and AWS. Our platform is also increasingly AI-powered, so familiarity with implementing AI solutions (RAG etc) is an advantage. You’ll be working on a project where everything is hosted in AWS and we have a lightweight automated deployment process. You’ll work alongside a …/or ReactJS. Working understanding of data capture and performance tracking, a willingness to contribute to design and UX decisions. AI familiarity with working on Gen AI projects, including RAG and API integration. Ability to work within project timelines, proactively communicate any delays and contribute to task re-prioritisation to keep things on track Experience in developing websites, web applications More ❯
Posted:

Data science: AI Reporting Lead

City of London, London, United Kingdom
Clarity (formerly Anecdote)
extraction, sentiment analysis, and qualitative insights Define reporting standards: Set quality bars for what makes a great dashboard (agent performance, app store trends, CSAT drivers, topic evolution, etc.) Optimize RAG pipelines: Design retrieval strategies and grounding approaches for report generation to ensure factual, relevant outputs Customer-Facing Analytics & Enablement (30%) Deliver bespoke insights: Partner with key … for accuracy and relevance, ensure quality before launch Operations & Optimization (10%) Monitor report health: Track delivery, engagement, and quality metrics; debug when outputs degrade Performance tuning: Optimize report generation costs and latency; balance API usage with quality Document everything: Maintain clear documentation for prompts, templates, and best practices What Makes You a Great FitTechnical Foundation 5+ years in … building customer insights, product analytics, or data-driven reporting AI-native experience: Hands-on work building reports or products using LLMs—prompt engineering, structured output generation, embeddings, RAG, summarization pipelines Python proficiency: Comfortable with pandas, OpenAI library, API integrations, and data manipulation for prototyping and analysis SQL fluency: Can write complex queries and understand data modeling Analytics tools More ❯
Posted:

Data science: AI Reporting Lead

london (city of london), south east england, united kingdom
Clarity (formerly Anecdote)
extraction, sentiment analysis, and qualitative insights Define reporting standards: Set quality bars for what makes a great dashboard (agent performance, app store trends, CSAT drivers, topic evolution, etc.) Optimize RAG pipelines: Design retrieval strategies and grounding approaches for report generation to ensure factual, relevant outputs Customer-Facing Analytics & Enablement (30%) Deliver bespoke insights: Partner with key … for accuracy and relevance, ensure quality before launch Operations & Optimization (10%) Monitor report health: Track delivery, engagement, and quality metrics; debug when outputs degrade Performance tuning: Optimize report generation costs and latency; balance API usage with quality Document everything: Maintain clear documentation for prompts, templates, and best practices What Makes You a Great FitTechnical Foundation 5+ years in … building customer insights, product analytics, or data-driven reporting AI-native experience: Hands-on work building reports or products using LLMs—prompt engineering, structured output generation, embeddings, RAG, summarization pipelines Python proficiency: Comfortable with pandas, OpenAI library, API integrations, and data manipulation for prototyping and analysis SQL fluency: Can write complex queries and understand data modeling Analytics tools More ❯
Posted:

Chief Software Architect

City Of London, England, United Kingdom
Harrington Starr
Python preferred; additional exposure to JavaScript/TypeScript or Go beneficial). Strong background in AI systems, cloud-native architectures, and modern ML tooling (LangChain, OpenAI API, vector stores, RAG pipelines). Experience designing distributed, high-availability platforms deployed on AWS or similar cloud environments. Excellent communication and stakeholder management skills, with the ability to translate technical direction into business … to the property, SaaS, or workflow automation space is advantageous. Summary A rare opportunity for a technically strong, execution-focused engineering leader to shape and scale a next-generation AI automation platform from the ground up — with full ownership of the engineering strategy, team, and architecture. More ❯
Posted:

Chief Software Architect

london (city of london), south east england, united kingdom
Harrington Starr
Python preferred; additional exposure to JavaScript/TypeScript or Go beneficial). Strong background in AI systems, cloud-native architectures, and modern ML tooling (LangChain, OpenAI API, vector stores, RAG pipelines). Experience designing distributed, high-availability platforms deployed on AWS or similar cloud environments. Excellent communication and stakeholder management skills, with the ability to translate technical direction into business … to the property, SaaS, or workflow automation space is advantageous. Summary A rare opportunity for a technically strong, execution-focused engineering leader to shape and scale a next-generation AI automation platform from the ground up — with full ownership of the engineering strategy, team, and architecture. More ❯
Posted:

Senior Machine Learning Engineer

City of London, London, United Kingdom
NearTech Search
doing: • Owning the design and build of LLM-based systems end-to-end • Fine-tuning and adapting models (LoRA, instruction tuning, PEFT etc) rather than just prompt-engineering • Building RAG workflows, embedding strategies, memory layers and domain grounding • Working out how to measure output quality, reduce hallucination risk and improve robustness • Optimising inference performance (quantisation, distillation, pruning, batching, caching) • Deploying … prototyping/notebooks • Strong Python and experience with PyTorch/Hugging Face/similar tooling • Experience deploying models into production (not just training them) • Familiarity with vector stores and RAG patterns • Comfortable operating in a hybrid environment and speaking with stakeholders where required • Someone who enjoys solving real problems end-to-end: from understanding the domain → designing the approach → shipping More ❯
Posted:

Senior Machine Learning Engineer

london (city of london), south east england, united kingdom
NearTech Search
doing: • Owning the design and build of LLM-based systems end-to-end • Fine-tuning and adapting models (LoRA, instruction tuning, PEFT etc) rather than just prompt-engineering • Building RAG workflows, embedding strategies, memory layers and domain grounding • Working out how to measure output quality, reduce hallucination risk and improve robustness • Optimising inference performance (quantisation, distillation, pruning, batching, caching) • Deploying … prototyping/notebooks • Strong Python and experience with PyTorch/Hugging Face/similar tooling • Experience deploying models into production (not just training them) • Familiarity with vector stores and RAG patterns • Comfortable operating in a hybrid environment and speaking with stakeholders where required • Someone who enjoys solving real problems end-to-end: from understanding the domain → designing the approach → shipping More ❯
Posted:

Applied AI Engineer - GenAI Consulting

City of London, London, United Kingdom
Harnham
Are you an AI engineer with strong software engineering fundamentals? Have you deployed AI models into production—not just trained them? Want to work on agentic systems, RAG pipelines, and LLM-powered tools? We're working with a high-growth GenAI consultancy building real-world, production-grade AI applications. With a team of 50+, the company partners with global clients … deployment, collaborating with cross-functional teams and mentoring other engineers. Key Responsibilities Build and deploy scalable AI tools using Python and LLM APIs Design end-to-end agentic and RAG-based solutions for enterprise use Own delivery from proof-of-concept through to production Work with cloud-native architectures (e.g. AWS Lambda, Step Functions) Lead and guide junior engineers within More ❯
Posted:

Applied AI Engineer - GenAI Consulting

london (city of london), south east england, united kingdom
Harnham
Are you an AI engineer with strong software engineering fundamentals? Have you deployed AI models into production—not just trained them? Want to work on agentic systems, RAG pipelines, and LLM-powered tools? We're working with a high-growth GenAI consultancy building real-world, production-grade AI applications. With a team of 50+, the company partners with global clients … deployment, collaborating with cross-functional teams and mentoring other engineers. Key Responsibilities Build and deploy scalable AI tools using Python and LLM APIs Design end-to-end agentic and RAG-based solutions for enterprise use Own delivery from proof-of-concept through to production Work with cloud-native architectures (e.g. AWS Lambda, Step Functions) Lead and guide junior engineers within More ❯
Posted:

Knowledge Graph & GenAI Lead

City of London, London, United Kingdom
Hybrid / WFH Options
Intelix.AI
/CDC) and entity resolution for graph population Author complex queries (Cypher, GSQL, AQL, SPARQL etc. depending on stack) Integrate knowledge graph retrieval & reasoning into LLM/RAG/GraphRAG systems Develop and evaluate graph ML/embedding models (link prediction, anomaly detection) Optimize graph performance, scaling, and query efficiency Liaise with client stakeholders: translate business problems into … TigerGraph, ArangoDB, OrientDB, or Stardog Proficiency in query languages (Cypher, GSQL, AQL, SPARQL, etc.) Strong background in pipelines, ETL, and entity resolution Exposure to integrating KG + LLM or RAG architectures Experience with graph algorithms, embeddings, or GNNs Cloud & production engineering literacy (AWS/Azure/GCP, containerization, CI/CD) Excellent communication skills — able to explain complex graph/ More ❯
Posted:

Platform Engineer

City of London, London, United Kingdom
Burns Sheehan
Lead | £100,000 + 20% Bonus | Hybrid - 2 Days per Week in London 💰 £100,000 + 20% Bonus 📍 London - 2 days a week in office 🛠️ AWS, Python, Terraform, Kubernetes, RAG/GenAI interest Here at Burns Sheehan, we're exclusively partnered with a leading UK fintech that's transforming small business lending - processing billions in decisions annually. They've built … CloudFormation) and cloud-native development. Python scripting proficiency at platform engineer level but an understanding of software development lifecycle and production system ownership. Interest or experience in GenAI infrastructure - RAG, vector databases, LLM platforms, or agent frameworks. Strong communication skills and ability to translate technical decisions into business outcomes. This role is NOT an ML Engineering position - you won't More ❯
Posted:

Backend AI Engineer

City of London, London, United Kingdom
Harnham
Do you want to build the infrastructure powering the next generation of AI agents? Have you scaled backend systems that drive automation at speed? Ready to join a profitable AI startup transforming how industries deploy technology? A profitable, fast-growing AI company is building the “roads” for AI agents — providing the infrastructure that lets businesses deploy and integrate …/LLM components into production systems. Role Breakdown: 50% Backend Engineering: FastAPI, Flask, Node.js, CI/CD 30% Data Engineering: ETL, DBT, Airflow 20% AI/LLM Integration: LangChain, RAG pipelines, orchestration Key Responsibilities: Design and build backend services to support AI agent deployment Develop scalable data pipelines and integration layers Implement AI/LLM-powered features with LangChain and … of Front-End, and adoption engineers Reporting Line: CTO Visa: Cannot sponsor Ideal Profile: 5–8 years’ experience in backend or data engineering (Python) Exposure to AI/LLMs, RAG, or personal AI side projects Experience in startups or scale-ups where you’ve helped build or grow systems Solid education and strong coding fundamentals Comfortable working in a small More ❯
Posted:
Retrieval-Augmented Generation
the City of London
10th Percentile
£58,125
25th Percentile
£60,938
Median
£66,250
75th Percentile
£72,500
90th Percentile
£79,250