Employment Type

Remote Jobs

Hybrid/WFH 66

Sort By

Relevance
Date

Locations

Job Titles

176 to 190 of 190 Reinforcement Learning Jobs in London

Process Lead with French, Alexa Shopping OPTIMA

London, United Kingdom

Amazon

improving Artificial Intelligence (AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the LLM lifecycle, leveraging cutting-edge More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: Yesterday

Senior AI Engineer

London Area, United Kingdom

Nume

tool-using agents (MCP, API integration, data processing) Methods for agent monitoring, logging, and performance analysis Experience with model fine-tuning and evaluation for domain-specific applications Background in reinforcement learning or agent training methodologies Why you? You dream big You want to be part of a highly skilled and passionate team striving to create an amazing product More ❯

Posted: Today

Senior AI Engineer

City of London, London, United Kingdom

Nume

Posted: 2 days ago

Graphic Designer

City of London, London, United Kingdom

algo1

by a top-tier VC, building products that will redefine the retail experience. Currently in stealth mode, we’re blending cutting-edge advances in behavioural science, Generative AI, and reinforcement learning to create systems that anticipate and serve customer needs in radically new ways. We're looking for an exceptional Graphic Designer to join our growing team and … highly desirable. A passion for creating visually stunning and intuitive experiences. Bonus points if you have: Experience working in a fast-paced startup environment. Familiarity with AI or machine learning concepts. Experience in the retail or e-commerce sector. Why join us? This is a unique opportunity to join a stealth-mode company at the ground floor, working on More ❯

Posted: 3 days ago

Graphic Designer

London Area, United Kingdom

algo1

Posted: 3 days ago

Sales Director

London, England, United Kingdom

Federato

cyber-attacks, social inflation, etc. Our vision is understood and well funded by those behind Salesforce, Veeva, Zoom, Box, etc. Federato’s AI/ML-driven platform leverages deep reinforcement learning to help insurance companies optimise the portfolio of risks they insure, allowing them to continue to provide fair and equitable pricing in difficult-to-price areas. Our … Google Workspace, and sales enablement tools such as HubSpot, LI Sales Nav, Gong, etc. Role Requirements: P&C or Specialty Insurance industry experience a plus Base knowledge of Machine Learning and the ability to position these concepts with the clients Prior experience within entrepreneurial, high growth stage software companies Here at Federato, your capabilities are important, but culture fit … is quintessential. We move fast, are eager to listen to our users, take a first principles approach to solving problems, and value learning and the ability to change our minds. Most importantly, we're here to have fun. Our ability to make a difference starts with our people. We would love to work with you! We are an equal More ❯

Posted: Today

Talent Acquisition Lead, GTM

London, England, United Kingdom

Anima

Anima saves lives every day Hey! Shun here, I’m the CEO and co-founder of Anima. We’re building an active learning OS for all of healthcare and life sciences towards maximising human wellbeing globally. My entire life, I’ve been pulling on a thread that’s affected all of us in some way. Millions die every year … you’ll help us extend the 3 existing product lines we have, that millions of patients use, and build out new ones at the very cutting edge of healthcare reinforcement learning and agentic AI. Your work will save countless lives. Top 1% growth We grew 450% in 2024, are cash flow positive and at Series B/C … EMs, but an intensive focus on coaching and teaching. It started with me. I self taught and wrote a lot of the Anima 1.0 code, and Anima’s active learning patent. I run most of the hiring tech chats to this day. I first and foremost see myself as an IC and builder, and still build stuff in my More ❯

Posted: 2 days ago

New Trading Team's 1st C++ Quant Developer | HFT

London Area, United Kingdom

Augmentti

optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯

Posted: Yesterday

New Trading Team's 1st C++ Quant Developer | HFT

City of London, London, United Kingdom

Augmentti

Posted: 2 days ago

Senior Government Affairs Manager

London, England, United Kingdom

Rocket Lab

and AI challenges that make a meaningful impact in the world Our work frequently takes us right up to the state of the art in technical innovation, be it reinforcement learning, distributed systems, generative AI, or deployment infrastructure. The defence industry is entering the most exciting phase of the technological development curve. Advances in our field of world More ❯

Posted: Today

EA Team Lead

London, England, United Kingdom

Helsing

Posted: 3 days ago

Field Engineer - Maritime

London, England, United Kingdom

Rocket Lab

and AI challenges that make a meaningful impact in the world. Our work frequently takes us right up to the state of the art in technical innovation, be it reinforcement learning, distributed systems, generative AI, or deployment infrastructure. The defence industry is entering the most exciting phase of the technological development curve. Advances in our field of world More ❯

Posted: Today

Drone Field Engineer

London, United Kingdom

helsing.ai

Employment Type: Permanent

Salary: GBP Annual

Posted: 2 days ago

Senior Software Engineer

London Area, United Kingdom

Humanoid

directly with robots in both simulation and real hardware environments. You will be part of a focused team responsible for the application level software that connects control, navigation, perception, learning, and platform systems. Your work will ensure that these components operate as a coherent and reliable system that users can interact with seamlessly. This role involves close collaboration with … practical, cross disciplinary challenges. What You Will Do You will develop and maintain application level software for humanoid robots You will integrate software components from controls, navigation, computer vision, reinforcement learning, and platform teams You will contribute to the structure and evolution of the application architecture and its interfaces You will work closely with the product and engineering … closely related field You are highly proficient in C++ and have experience delivering production grade software You have a solid understanding of robotic subsystems including control, perception, navigation, and learning You are familiar with ROS or ROS2 or equivalent middleware platforms You are comfortable reading, understanding, and integrating code from a range of other engineering teams You have experience More ❯

Posted: 3 days ago

Senior Software Engineer

City of London, London, United Kingdom

Humanoid

Posted: 3 days ago

5 6 78

Salary Guide

Reinforcement Learning
London

10th Percentile: £92,500
25th Percentile: £125,000
Median: £150,000
75th Percentile: £175,000

More Reinforcement Learning insights »