Reinforcement Learning Jobs in London

51 to 73 of 73 Reinforcement Learning Jobs in London

Applied Scientist III, Amzn Shipping-Prd & Tech, Amzn Shipping-Prd & Tech

London, United Kingdom
Amazon
help improve our ability to plan and execute package movements. As an Applied Scientist in Amazon Shipping, you will work on multiple challenging machine learning problems spread across a wide spectrum of business problems. You will build ML models to help our transportation cost auditing platforms effectively audit off … require you to demonstrate Think Big and Invent and Simplify, by refining and translating Transportation domain-related business problems into one or more Machine Learning problems. You will use techniques from a wide array of machine learning paradigms, such as supervised, unsupervised, semi-supervised and reinforcement learning. … Your model choices will include, but not be limited to, linear/logistic models, tree based models, deep learning models, ensemble models, and Q-learning models. You will use techniques such as LIME and SHAP to make your models interpretable for your customers. You will employ a family More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Robotics Manipulation Engineer

London, England, United Kingdom
Humanoid
of grant proposals and funding applications to secure external research grants and sponsorships. Expertise Ph.D. or equivalent experience in computer science, artificial intelligence, machine learning, or related field, with a focus on manipulation, robotics, or AI/ML. Strong theoretical and practical knowledge in areas such as reinforcement learning, motion planning, and perception, with a specific emphasis on manipulation tasks. Proficiency in programming languages such as Python, C++, or MATLAB, as well as experience with relevant libraries and frameworks for AI/ML development. Experience in conducting independent research and publishing in top-tier conferences and More ❯
Posted:

Senior Robotics Manipulation Engineer

london, south east england, United Kingdom
Humanoid
of grant proposals and funding applications to secure external research grants and sponsorships. Expertise Ph.D. or equivalent experience in computer science, artificial intelligence, machine learning, or related field, with a focus on manipulation, robotics, or AI/ML. Strong theoretical and practical knowledge in areas such as reinforcement learning, motion planning, and perception, with a specific emphasis on manipulation tasks. Proficiency in programming languages such as Python, C++, or MATLAB, as well as experience with relevant libraries and frameworks for AI/ML development. Experience in conducting independent research and publishing in top-tier conferences and More ❯
Posted:

Senior Robotics control Engineer Locomotion

London Area, United Kingdom
Humanoid
while manipulating, fall recovery, and other advanced mobility tasks. The ideal candidate will have strong expertise in classic locomotion pipeline, whole-body control and reinforcement learning. Our Mission: At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into … for biped robots, focusing on locomotion. Proficiency with model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Knowledge of advanced topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Familiarity with real-time control systems and integration with hardware, including actuators and sensors. More ❯
Posted:

Senior Robotics control Engineer Locomotion

london, south east england, United Kingdom
Humanoid
while manipulating, fall recovery, and other advanced mobility tasks. The ideal candidate will have strong expertise in classic locomotion pipeline, whole-body control and reinforcement learning. Our Mission: At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into … for biped robots, focusing on locomotion. Proficiency with model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Knowledge of advanced topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Familiarity with real-time control systems and integration with hardware, including actuators and sensors. More ❯
Posted:

Program Manager, ACES, OPTIMA

London, United Kingdom
Amazon
a global team and enables Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the … shopping feature teams deliver superior CX quality by providing them reliable and comprehensive insights and ground truth data to measure and train ML (Machine Learning) models and handle annotation and Root Cause Analysis (RCA) across 10 different languages. Minimum Qualifications Minimum 2 years Project/Program management experience in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Robotics Control Engineer - Locomotion

London Area, United Kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Posted:

Robotics Control Engineer - Locomotion

london, south east england, United Kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Posted:

Sr. Data Associate with Italian and English

London, United Kingdom
Amazon
Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the … training and analyzing the data sets for improving customer experience. These data sets generated are consumed by engineering teams like Natural Language Understanding, Machine Learning Science to develop, test and train the Alexa models with the highest quality data for improving the shopping experience. Come, partner with us to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Robotic Manipulation Lead

City, London, United Kingdom
Hybrid / WFH Options
Lawrence Harvey
proposals to secure external sponsorships. What We're Looking For: Ph.D. (or equivalent experience) in AI, ML, robotics, or related fields. Expertise in manipulation, reinforcement learning, motion planning, and perception. A track record of conducting research and publishing in top-tier AI/ML conferences and journals. Proficiency More ❯
Employment Type: Permanent
Salary: GBP 100,000 - 200,000 Annual
Posted:

Process Lead, Alexa Shopping Operations

London, United Kingdom
Amazon
Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior ML Researcher

London, United Kingdom
Ecm Selection
not all projects succeed; even failed projects contain valuable insights. You will be building upon cutting-edge ML techniques such as GNNs, transformers, and reinforcement learning to create novel multi-modal solutions to challenges in processing RF data such as signal detection, recognition, identification, and sensor fusion. The More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Process Lead with Portuguese, Alexa Shopping OPTIMA

London, United Kingdom
Amazon
AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience globally. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We strive to provide an end-to-end data solution for the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Computational Chemist

North London, London, United Kingdom
Exalto Consulting ltd
of the following areas: Artificial Intelligence: Experience with state-of-the-art methods such as graph neural networks, transformers, Gaussian processes, generative modeling, or reinforcement learning. Cheminformatics: Knowledge of chemistry data storage, formats, and synthesis prediction; proficiency with toolkits such as RDKit or OpenEye. Quantum Mechanics: Experience applying QM … processing from heterogeneous sources; familiarity with tools like Apache Spark or Hadoop. Proficiency with cloud platforms (AWS, GCP, Azure). Familiarity with major machine learning frameworks (e.g., scikit-learn, TensorFlow, PyTorch). Open-source contributions or publications demonstrating expertise in machine learning for scientific applications. Hands-on experience More ❯
Employment Type: Permanent
Posted:

Talent Lead, GTM

London, United Kingdom
Anima
Anima saves lives every day Hey! Shun here, I'm the CEO and co-founder of Anima. We're building an active learning OS for all of healthcare and life sciences towards maximising human wellbeing globally. My entire life, I've been pulling on a thread that's affected … the 3 existing product lines we have, that millions of patients use, and build out new ones at the very cutting edge of healthcare reinforcement learning and agentic AI. Your work will save countless lives. Top 1% growth We grew 450% in 2024, are cash flow positive and … on coaching and teaching. It started with me. I self taught and wrote a lot of the Anima 1.0 code, and Anima's active learning patent. I run most of the hiring tech chats to this day. I first and foremost see myself as an IC and builder, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Recruiter, GTM

London, United Kingdom
Anima
Anima saves lives every day Hey! Shun here, I'm the CEO and co-founder of Anima. We're building an active learning OS for all of healthcare and life sciences towards maximising human wellbeing globally. My entire life, I've been pulling on a thread that's affected … the 3 existing product lines we have, that millions of patients use, and build out new ones at the very cutting edge of healthcare reinforcement learning and agentic AI. Your work will save countless lives. Top 1% growth. We grew 450% in 2024, are cash flow positive and … on coaching and teaching. It started with me. I self-taught and wrote a lot of the Anima 1.0 code, and Anima's active learning patent. I run most of the hiring tech chats to this day. I first and foremost see myself as an IC and builder, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Business Analyst, OPTIMA Operations

London, United Kingdom
Amazon
Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

London Area, United Kingdom
Augmentti
frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

london, south east england, United Kingdom
Augmentti
frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies More ❯
Posted:

Tech Application Architect(GCP)

London, UK
Cognizant
expertise on Google Cloud services, including Compute Engine, App Engine, Kubernetes Engine, and Cloud Functions. Strong understanding of AI/ML concepts, including deep learning, natural language processing, and reinforcement learning. Extensive experience required in Cloud Run, Pub/Sub, and Gemini/Vertex AI Models. Expertise in More ❯
Posted:

Cloud Application Architect(GCP)

London Area, United Kingdom
Cognizant
expertise on Google Cloud services, including Compute Engine, App Engine, Kubernetes Engine, and Cloud Functions. Strong understanding of AI/ML concepts, including deep learning, natural language processing, and reinforcement learning. Extensive experience required in Cloud Run, Pub/Sub, and Gemini/Vertex AI Models. Expertise in More ❯
Posted:

Cloud Application Architect(GCP)

london, south east england, United Kingdom
Cognizant
expertise on Google Cloud services, including Compute Engine, App Engine, Kubernetes Engine, and Cloud Functions. Strong understanding of AI/ML concepts, including deep learning, natural language processing, and reinforcement learning. Extensive experience required in Cloud Run, Pub/Sub, and Gemini/Vertex AI Models. Expertise in More ❯
Posted:

Tech Application Architect(GCP)

london, south east england, united kingdom
Cognizant
expertise on Google Cloud services, including Compute Engine, App Engine, Kubernetes Engine, and Cloud Functions. Strong understanding of AI/ML concepts, including deep learning, natural language processing, and reinforcement learning. Extensive experience required in Cloud Run, Pub/Sub, and Gemini/Vertex AI Models. Expertise in More ❯
Posted:
Reinforcement Learning
London
25th Percentile
£125,000
Median
£150,000
75th Percentile
£175,000