Reinforcement Learning Jobs in the UK

76 to 94 of 94 Reinforcement Learning Jobs in the UK

Senior Robotics control Engineer Locomotion

London Area, United Kingdom
Humanoid
while manipulating, fall recovery, and other advanced mobility tasks. The ideal candidate will have strong expertise in classic locomotion pipeline, whole-body control and reinforcement learning. Our Mission: At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into … for biped robots, focusing on locomotion. Proficiency with model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Knowledge of advanced topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Familiarity with real-time control systems and integration with hardware, including actuators and sensors. More ❯
Posted:

Senior Robotics control Engineer Locomotion

london, south east england, united kingdom
Humanoid
while manipulating, fall recovery, and other advanced mobility tasks. The ideal candidate will have strong expertise in classic locomotion pipeline, whole-body control and reinforcement learning. Our Mission: At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into … for biped robots, focusing on locomotion. Proficiency with model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Knowledge of advanced topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Familiarity with real-time control systems and integration with hardware, including actuators and sensors. More ❯
Posted:

Robotics Control Engineer - Locomotion

London Area, United Kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Posted:

Robotics Control Engineer - Locomotion

london, south east england, united kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Posted:

Robotics Control Engineer - Locomotion

London, United Kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Employment Type: Permanent
Salary: GBP 80,000 - 110,000 Annual
Posted:

Robotics Control Engineer - Locomotion

London, Paddington, United Kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Employment Type: Permanent
Salary: £80000 - £110000/annum
Posted:

Sr. Data Associate with English and German, OPTIMA - Alexa Shopping

London, United Kingdom
Amazon
Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. By joining us, you will play a pivotal role in shaping the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Program Manager, OPTIMA

London, United Kingdom
Amazon
a global team and enables Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the … feature teams to deliver superior CX quality by providing them reliable and comprehensive insights and ground truth data to measure and train ML (Machine Learning) models and handle annotation and Root Cause Analysis (RCA) across 10 different languages. We are seeking a highly organized, detail-oriented Program Manager to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Technologist- AI & ML

Chelmsford, Essex
Hybrid / WFH Options
BAE Systems
strong background in the mathematics, statistics and computational techniques which underpin AI/ML. The Data and Decision Support Capability has teams working in: reinforcement learning; NLP/LLMs; knowledge graphs and graph based neural networkss; AI for RF and EW, radar, sonar, acoustics, AI for image and … interest are candidates with experience in one or more of the following domains: AI/ML for imagery including applied to remote sensing applications. Reinforcement learning. Natural Language Processing. Large Language Models. Knowledge graphs and graph-based neural nets. AI/ML for RF and EW, radar, sonar, acoustics. More ❯
Employment Type: Permanent
Posted:

Manipulation Lead

London Area, United Kingdom
Hybrid / WFH Options
Lawrence Harvey
proposals to secure external sponsorships. What We’re Looking For: Ph.D. (or equivalent experience) in AI, ML, robotics, or related fields. Expertise in manipulation, reinforcement learning, motion planning, and perception. A track record of conducting research and publishing in top-tier AI/ML conferences and journals. Proficiency More ❯
Posted:

Manipulation Lead

london, south east england, united kingdom
Hybrid / WFH Options
Lawrence Harvey
proposals to secure external sponsorships. What We’re Looking For: Ph.D. (or equivalent experience) in AI, ML, robotics, or related fields. Expertise in manipulation, reinforcement learning, motion planning, and perception. A track record of conducting research and publishing in top-tier AI/ML conferences and journals. Proficiency More ❯
Posted:

Manipulation Lead

London, England, United Kingdom
Lawrence Harvey
proposals to secure external sponsorships. What We’re Looking For: Ph.D. (or equivalent experience) in AI, ML, robotics, or related fields. Expertise in manipulation, reinforcement learning, motion planning, and perception. A track record of conducting research and publishing in top-tier AI/ML conferences and journals. Proficiency More ❯
Posted:

Senior ML Researcher

London, United Kingdom
Ecm Selection
not all projects succeed; even failed projects contain valuable insights. You will be building upon cutting-edge ML techniques such as GNNs, transformers, and reinforcement learning to create novel multi-modal solutions to challenges in processing RF data such as signal detection, recognition, identification, and sensor fusion. The More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Chief Technology Officer

United Kingdom
MBN Solutions
and stakeholders. Action bias - you ship, you solve, you build. A solid background in AI (Engineering rather than Research) Possess a deep understanding of reinforcement learning, distributed training and Agentic AI Not for you if: You’re more strategist than builder. Your AI experience is just post-ChatGPT. More ❯
Posted:

Talent Lead, GTM

London, United Kingdom
Anima
Anima saves lives every day Hey! Shun here, I'm the CEO and co-founder of Anima. We're building an active learning OS for all of healthcare and life sciences towards maximising human wellbeing globally. My entire life, I've been pulling on a thread that's affected … the 3 existing product lines we have, that millions of patients use, and build out new ones at the very cutting edge of healthcare reinforcement learning and agentic AI. Your work will save countless lives. Top 1% growth We grew 450% in 2024, are cash flow positive and … on coaching and teaching. It started with me. I self taught and wrote a lot of the Anima 1.0 code, and Anima's active learning patent. I run most of the hiring tech chats to this day. I first and foremost see myself as an IC and builder, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Recruiter, GTM

London, United Kingdom
Anima
Anima saves lives every day Hey! Shun here, I'm the CEO and co-founder of Anima. We're building an active learning OS for all of healthcare and life sciences towards maximising human wellbeing globally. My entire life, I've been pulling on a thread that's affected … the 3 existing product lines we have, that millions of patients use, and build out new ones at the very cutting edge of healthcare reinforcement learning and agentic AI. Your work will save countless lives. Top 1% growth. We grew 450% in 2024, are cash flow positive and … on coaching and teaching. It started with me. I self-taught and wrote a lot of the Anima 1.0 code, and Anima's active learning patent. I run most of the hiring tech chats to this day. I first and foremost see myself as an IC and builder, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

London Area, United Kingdom
Augmentti
frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

london, south east england, united kingdom
Augmentti
frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies More ❯
Posted:

Applied Scientist, Amzn Shipping-Prd & Tech, Amzn Shipping-Prd & Tech

London, United Kingdom
Amazon
help improve our ability to plan and execute package movements. As an Applied Scientist in Amazon Shipping, you will work on multiple challenging machine learning problems spread across a wide spectrum of business problems. You will build ML models to help our transportation cost auditing platforms effectively audit off … require you to demonstrate Think Big and Invent and Simplify, by refining and translating Transportation domain-related business problems into one or more Machine Learning problems. You will use techniques from a wide array of machine learning paradigms, such as supervised, unsupervised, semi-supervised, and reinforcement learning. … Your model choices will include, but not be limited to, linear/logistic models, tree based models, deep learning models, ensemble models, and Q-learning models. You will use techniques such as LIME and SHAP to make your models interpretable for your customers. You will employ a family More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Reinforcement Learning
10th Percentile
£76,727
25th Percentile
£90,801
Median
£150,000
75th Percentile
£175,000