Remote Reinforcement Learning Jobs in London

6 of 6 Remote Reinforcement Learning Jobs in London

Staff Machine Learning Scientist

London, United Kingdom
Hybrid / WFH Options
Intercom
service. Driven by our core values, we push boundaries, build with speed and intensity, and consistently deliver incredible value to our customers. What's the opportunity? Intercom's Machine Learning team is responsible for defining new ML features, researching appropriate algorithms and technologies, and rapidly getting first prototypes in our customers' hands. We are an extremely product focussed team. … dedicated ML product engineers enable us to move to production fast, often shipping to beta in weeks after a successful offline test. We are very passionate about applying machine learning technology, and have productized everything from classic supervised models, to cutting-edge unsupervised clustering algorithms, to novel applications of transformer neural networks. We test and measure the real customer … field (e.g. MSc) Scientific thinking skills Track record shipping ML products PhD or other experience in a research environment Deep experience in an applicable ML area - E.g. NLP, Deep learning, Bayesian methods, Reinforcement learning, clustering Strong stats or math background Benefits We are a well treated bunch, with awesome benefits! If there's something important to you More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

West London, London, United Kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Employment Type: Permanent
Posted:

Senior Data Scientist

London, United Kingdom
Hybrid / WFH Options
ECM Selection (Holdings) Limited
experimental, and it is understood that not all projects succeed, even failed projects contain valuable insights. You will be building upon cutting-edge ML techniques such as transformers and reinforcement learning to create novel multi-modal solutions. Examples include sensor fusion systems, physics-informed neural networks for simulations, and multi-purpose autonomous robots. Projects will be defence focused … surrounding area. Initially this is an 18-month contract with the expectation of extending this as more funding is released. Keywords: AI, ML, RF, EM, GNN, Transformer, Autoencoder, Reinforced Learning, Multi-Modal AI, Sensor Fusion, Python, PyTorch, Radio Frequency, RF Another top job from ECM, the high-tech recruitment experts. Even if this job's not quite right, do More ❯
Employment Type: Permanent
Salary: £50000 - £60000/annum DoE + Benefits
Posted:

AI Architect

City of London, London, United Kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain, AutoGen, CrewAI, or similar Experience supporting More ❯
Posted:

AI Architect

London Area, United Kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain, AutoGen, CrewAI, or similar Experience supporting More ❯
Posted:

LLM Researcher

London, South East, England, United Kingdom
Hybrid / WFH Options
MicroTECH Global Ltd
and regulatory requirements in fintech (SOC2, PCI-DSS, GDPR). Ability to thrive in a fast-moving startup environment. Desirables: Background in fintech, payments, or treasury systems. Experience with reinforcement learning with human feedback (RLHF). More ❯
Employment Type: Full-Time
Salary: Salary negotiable
Posted:
Reinforcement Learning
London
10th Percentile
£86,675
25th Percentile
£92,500
Median
£122,500
75th Percentile
£175,000