Remote Reinforcement Learning Jobs in London

13 of 13 Remote Reinforcement Learning Jobs in London

AI Research Internship - Students Pursing PhD

London, England, United Kingdom
Hybrid / WFH Options
MediaTek
the other dedicated to fundamental research that supports both our applications and the broader scientific community. Current areas of interest include large language models (LLMs), optimization methods for deep learning, reinforcement learning (RL), and generative models. Responsibilities:- Contribute to ongoing research in machine learning and artificial intelligence Help develop and implement algorithms Collaborate with researchers and … preparing publications and technical reports Stay up to date with the latest advancements in AI and related fields Requirement Qualifications Required: Currently enrolled in a PhD program in Machine Learning, Artificial Intelligence, Mathematics, Computer Science, Physics, or a related field Strong interest in research and a background in machine learning or a related area Experience with programming languages … or similar Strong problem-solving skills and ability to work independently and collaboratively Good communication skills and ability to present complex ideas clearly Nice-to-have Experience in optimization, reinforcement learning, and/or large language models (LLMs) Familiarity with deep learning frameworks (e.g., TensorFlow, PyTorch, JAX) Previous research publications or submitted papers Why Join Us Gain More ❯
Posted:

AI Research Internship - Students Pursing PhD

london, south east england, united kingdom
Hybrid / WFH Options
MediaTek
the other dedicated to fundamental research that supports both our applications and the broader scientific community. Current areas of interest include large language models (LLMs), optimization methods for deep learning, reinforcement learning (RL), and generative models. Responsibilities:- Contribute to ongoing research in machine learning and artificial intelligence Help develop and implement algorithms Collaborate with researchers and … preparing publications and technical reports Stay up to date with the latest advancements in AI and related fields Requirement Qualifications Required: Currently enrolled in a PhD program in Machine Learning, Artificial Intelligence, Mathematics, Computer Science, Physics, or a related field Strong interest in research and a background in machine learning or a related area Experience with programming languages … or similar Strong problem-solving skills and ability to work independently and collaboratively Good communication skills and ability to present complex ideas clearly Nice-to-have Experience in optimization, reinforcement learning, and/or large language models (LLMs) Familiarity with deep learning frameworks (e.g., TensorFlow, PyTorch, JAX) Previous research publications or submitted papers Why Join Us Gain More ❯
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

West London, London, United Kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Employment Type: Permanent
Posted:

GenAI Lead- £110,000-Hybrid

London, South East, England, United Kingdom
Hybrid / WFH Options
Tenth Revolution Group
also ensuring delivery excellence across all AI engagements. You will lead the integration of state-of-the-art technologies such as GPT-4, Transformers, Diffusion Models, DALL·E, Deep Reinforcement Learning (DRL), and other Large Language Models (LLMs), applying them to deliver measurable business value. In collaboration with data scientists, AI engineers, and business analysts, you will ensure More ❯
Employment Type: Full-Time
Salary: £110,000 per annum
Posted:

Graduate AI Analyst / Engineer

London, South East, England, United Kingdom
Hybrid / WFH Options
Kingsgate Recruitment Ltd
Shape the Future with AI — Start Your Career Here Are you fascinated by artificial intelligence, machine learning, and data-driven innovation? Are you excited by the potential of AI to solve real-world problems and build smarter systems? We’re looking for an ambitious and curious Graduate AI Analyst/Engineer to join our AI & Data Science team. This … is an ideal role for a recent graduate who wants to kickstart their career by working on real AI projects, learning from experts, and gaining hands-on experience with modern AI tools, models, and data pipelines. Whether you studied Computer Science, Engineering, Maths, Data Science, or a related field, if you have a passion for AI and a hunger … hear from you. What You’ll Be Doing You’ll work with experienced AI engineers and data scientists to: Build & Train Models : Support the development and training of machine learning and deep learning models using tools like Python, TensorFlow, PyTorch, or Scikit-learn. Explore & Prepare Data : Help clean, transform, and analyse large datasets for AI applications. Experiment & Research More ❯
Employment Type: Full-Time
Salary: £36,000 - £38,000 per annum
Posted:

AI Architect

City of London, London, United Kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

AI Architect

London Area, United Kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

AI Architect

london, south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

AI Architect

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

Prompt Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

Prompt Engineer

London Area, United Kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

Prompt Engineer

london, south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

Prompt Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:
Reinforcement Learning
London
10th Percentile
£66,650
25th Percentile
£88,750
Median
£95,000
75th Percentile
£110,000
90th Percentile
£175,000