Reinforcement Learning Jobs in London

1 to 25 of 72 Reinforcement Learning Jobs in London

Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options
Locus Robotics
to sorting and pack-out-Locus Robotics empowers businesses to meet peak demands and adapt to ever-changing operational needs. Are you a Machine Learning Engineer with a passion for reinforcement learning, multi-agent systems, and simulation at scale? We want to hear from you! At Locus … tools and ML systems to optimize the behavior of large autonomous fleets in dynamic environments. In this role, you will work on cutting-edge reinforcement learning (RL) models, multi-agent systems, and faster-than-real-time simulations to drive innovation in logistics, robotics, and beyond. You'll collaborate … real-time modelling of 1,000+ autonomous agents for various use cases such as fleet optimization, logistics, or robotics. Develop, deploy, and maintain machine learning models, with a strong focus on reinforcement learning (RL) and multi-agent systems to optimize fleet behavior in dynamic environments. Implement and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer: Graph Machine Learning

London, United Kingdom
Atmanlabs
Research Engineer: Graph Machine Learning Atman Labs, London About Atman Labs At Atman Labs we are building software to emulate proactive human expertise. Emulating human experts with deep knowledge and proactive assistance has largely been impossible to do via standalone Artificial Intelligence techniques. As an applied research and commercialization … proactive shopping assistance, to personal teachers to healthcare concierges - and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Machine Reasoning: Web-scale Knowledge Graph … Exploration using Reinforcement Learning Human experts can form and explore structured mental models in their heads to solve open-ended problems across different domains. Our research seeks to emulate this process through a novel combination of using reinforcement learning agents to perform exploration through a knowledge More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

NLP / LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence

London, United Kingdom
NLP PEOPLE
NLP/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence The Machine Learning Center of Excellence invites the successful candidate to apply sophisticated machine learning methods to a wide variety of complex tasks including natural language processing, speech analytics, time series, reinforcement learning … environment together with the business, technologists and control partners to deploy solutions into production. The candidate must also have a strong passion for machine learning and invest independent time towards learning, researching and experimenting with new innovations in the field. The candidate must have solid expertise in Deep … Learning with hands-on implementation experience and possess strong analytical thinking, a deep desire to learn and be highly motivated. Job Responsibilities Research and explore new machine learning methods through independent study, attending industry-leading conferences, experimentation and participating in our knowledge sharing community. Develop state-of-the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Member of Technical Staff, AI - Reinforcement Systems

London, United Kingdom
Microsoft
Help build the world's most advanced reinforcement learning systems at Microsoft AI. We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you'll help advance state-of-the-art model … priorities Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics … competitive programming, and related domains are a plus. Responsibilities Collaborate with research teams to advance state-of-the-art algorithms for reinforcement learning in LLMs Develop the core systems for adapting reinforcement learning to unprecedented scales and heterogeneous environments. Embody our culture of collaboration, innovation, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Robotics Control Engineer (Locomotion)

London Area, United Kingdom
Hybrid / WFH Options
Seer
the frontier of what legged machines can do. As part of this growth, they’re hiring Robotics Control Engineers with deep expertise in locomotion , reinforcement learning , and dynamic control systems to join their R&D headquarters. The Role: You’ll design and implement locomotion control policies — from walking … and stair climbing to fall recovery and manipulation-balanced motion. You’ll work at the intersection of classical control theory and reinforcement learning, deploying your work on humanoid platforms in the wild. Key Details: 🏢 Location : Hybrid or Onsite – US or EU HQs 💰 Salary : Highly competitive + equity + … Mechatronics , or similar 2+ years experience in control systems for biped or humanoid robots Strong understanding of: Model Predictive Control (MPC) , optimal & feedback control Reinforcement learning in physical systems Humanoid dynamics , balance control, and full-body coordination Proficiency in Python and C++ for real-time algorithm development Experience More ❯
Posted:

Robotics Control Engineer (Locomotion)

london, south east england, United Kingdom
Hybrid / WFH Options
Seer
the frontier of what legged machines can do. As part of this growth, they’re hiring Robotics Control Engineers with deep expertise in locomotion , reinforcement learning , and dynamic control systems to join their R&D headquarters. The Role: You’ll design and implement locomotion control policies — from walking … and stair climbing to fall recovery and manipulation-balanced motion. You’ll work at the intersection of classical control theory and reinforcement learning, deploying your work on humanoid platforms in the wild. Key Details: 🏢 Location : Hybrid or Onsite – US or EU HQs 💰 Salary : Highly competitive + equity + … Mechatronics , or similar 2+ years experience in control systems for biped or humanoid robots Strong understanding of: Model Predictive Control (MPC) , optimal & feedback control Reinforcement learning in physical systems Humanoid dynamics , balance control, and full-body coordination Proficiency in Python and C++ for real-time algorithm development Experience More ❯
Posted:

Staff Data Scientist Data and Insights London

London, United Kingdom
loveholidays
way to get away. About the team Our Data Science team comprises eight members, including four Senior Data Scientists, two Data Scientists, a Machine Learning Engineer and the Head of Data Science. We specialise in various areas such as Recommender Systems, Time Series Forecasting, Deep Learning, and Reinforcement Learning, fostering a collaborative learning environment. Our focus is on modelling and problem-solving, leveraging advanced machine learning techniques to create solutions to challenging business problems. We prioritise clean, well-tested code with a culture of documentation and knowledge sharing. Our tech stack includes GCP, Python … seeking an exceptional technical leader who can drive innovation while maintaining production excellence. The following qualities are essential: Technical Excellence: Deep expertise in machine learning approaches with the ability to assess and implement cutting-edge algorithms Strategic Thinking: Ability to break down high-level optimisation goals into lower-level More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist Data and Insights London Hybrid Remote

London, United Kingdom
Hybrid / WFH Options
loveholidays
Data Scientists, four Data Scientists, and the Head of Data Science. We specialise in various areas such as Recommender Systems, Time Series Forecasting, Deep Learning, and Reinforcement Learning, fostering a collaborative learning environment. Our focus is on modelling and problem-solving, leveraging advanced machine learning … planning/prioritisation to delivery including monitoring and alerting Designing experiments and modelling to generate actionable insights and enhance business performance Proficient in machine learning and statistical methods for predictive modelling and forecasting Experience deploying ML models to production at scale Solid understanding of SQL Proficiency in unit testing … CI/CD, model management and experiment tracking Desirable Experience with Deep Learning, Generative AI and Reinforcement Learning Experience with Time Series Forecasting and Recommender Systems Previous experience working in e-commerce, retail, or the travel industry. Conducted and analysed large scale A/B experiments Experience More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Researcher Scientist

London, United Kingdom
Merantix
imagine, design, and make a better world. We are a team of scientists, researchers, engineers, and designers working together on projects that range from learning-based design systems, computer vision, graphics, robotics, human-computer interaction, sustainability, simulation, manufacturing, architectural design and construction. As a member of the AI Lab … in Autodesk Research you will be an expert in research areas such as artificial intelligence, deep learning, generative AI, machine learning , computer vision, reinforcement learning, information retrieval, and natural language processing. Autodesk's AI Lab is active in the wider research community, targeting publications at CVPR … related to AI/ML such as: Computer Science, Mathematics, Statistics, Physics, Linguistics, Mechanical Engineering, Architecture or related disciplines Publication track record in machine learning conferences and/or journals Significant post-graduate research experience, or 5 or greater years of work experience (actual job title/position will More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Applied AI Scientist (Recommender Systems)

London Area, United Kingdom
algo1
are a VC-backed startup focused on behavioural AI, currently in stealth mode. We are building for retail with a focus on Recommendation Systems, Reinforcement Learning and GenAI. We are looking for an Applied AI Scientist with deep experience in advanced Recommender Systems to work with our team … Recommender Systems. Solution development: Translate latest Recommender Systems advances into impactful solutions and products, from MVPs to fully deployed systems. Performance optimisation: Optimise machine learning models for performance in modern environments (e.g., distributed clusters, GPUs). Requirements: Hands on experience: Implemented Recommender System solutions in commercial context. Education: MSc … in Computer Science, Machine Learning, or a closely related field. Machine learning expertise : Strong foundation in machine learning and deep learning algorithms (e.g., deep neural networks, supervised/unsupervised learning, predictive analysis, forecast modelling). Programming proficiency: Excellent Python programming skills with experience in developing More ❯
Posted:

Applied AI Scientist (Recommender Systems)

london, south east england, United Kingdom
algo1
are a VC-backed startup focused on behavioural AI, currently in stealth mode. We are building for retail with a focus on Recommendation Systems, Reinforcement Learning and GenAI. We are looking for an Applied AI Scientist with deep experience in advanced Recommender Systems to work with our team … Recommender Systems. Solution development: Translate latest Recommender Systems advances into impactful solutions and products, from MVPs to fully deployed systems. Performance optimisation: Optimise machine learning models for performance in modern environments (e.g., distributed clusters, GPUs). Requirements: Hands on experience: Implemented Recommender System solutions in commercial context. Education: MSc … in Computer Science, Machine Learning, or a closely related field. Machine learning expertise : Strong foundation in machine learning and deep learning algorithms (e.g., deep neural networks, supervised/unsupervised learning, predictive analysis, forecast modelling). Programming proficiency: Excellent Python programming skills with experience in developing More ❯
Posted:

Senior Machine Learning Engineer (UAE Based)

London Area, United Kingdom
AI71
Role: Senior Machine Learning Engineer Location: Abu Dhabi, UAE (Full Relocation provided) Company: AI71 About Us AI71 is an applied research team dedicated to building responsible and impactful AI agents that empower knowledge workers. We work closely with our industry partners and leverage cutting-edge research from the Technology … Innovation Institute (TII) to develop AI products that drive transformative change. Senior Machine Learning Engineer Are you an experienced Senior Machine Learning Engineer with a passion for AI? Do you have a proven track record of harnessing the power of data, developing advanced algorithms, and building predictive models … opportunity for you to play a pivotal role in shaping AI-driven solutions that have the potential to redefine industries. As a Senior Machine Learning Engineer at AI71, you’ll be responsible for developing, deploying, and optimizing machine learning models and systems that power our AI solutions. You More ❯
Posted:

Senior Machine Learning Engineer (UAE Based)

london, south east england, United Kingdom
AI71
Role: Senior Machine Learning Engineer Location: Abu Dhabi, UAE (Full Relocation provided) Company: AI71 About Us AI71 is an applied research team dedicated to building responsible and impactful AI agents that empower knowledge workers. We work closely with our industry partners and leverage cutting-edge research from the Technology … Innovation Institute (TII) to develop AI products that drive transformative change. Senior Machine Learning Engineer Are you an experienced Senior Machine Learning Engineer with a passion for AI? Do you have a proven track record of harnessing the power of data, developing advanced algorithms, and building predictive models … opportunity for you to play a pivotal role in shaping AI-driven solutions that have the potential to redefine industries. As a Senior Machine Learning Engineer at AI71, you’ll be responsible for developing, deploying, and optimizing machine learning models and systems that power our AI solutions. You More ❯
Posted:

Gen AI Architect

London Area, United Kingdom
HCLTech
Ensure the system's scalability, flexibility, and efficiency to handle large algorithms. Algorithm Development: Develop and implement advanced AI algorithms and models, including machine learning, deep learning, and neural networks. Continuously evaluate and improve these algorithms to enhance system performance and accuracy. Data Integration: Identify relevant data sources … engineers to ensure the quality and relevance of data for AI model training. Model Training and Evaluation: Train AI models using supervised, unsupervised, or reinforcement learning techniques. Implement evaluation methodologies to measure the performance and effectiveness of trained models. Fine-tune models based on feedback and data insights. … Neural Network Design: Design and optimize deep learning neural networks for various AI tasks, such as natural language processing, computer vision, recommendation systems, and predictive analytics. Implement innovative architectures and techniques to improve model accuracy and efficiency. System Integration: Collaborate with software developers and engineers to integrate AI systems More ❯
Posted:

Gen AI Architect

london, south east england, United Kingdom
HCLTech
Ensure the system's scalability, flexibility, and efficiency to handle large algorithms. Algorithm Development: Develop and implement advanced AI algorithms and models, including machine learning, deep learning, and neural networks. Continuously evaluate and improve these algorithms to enhance system performance and accuracy. Data Integration: Identify relevant data sources … engineers to ensure the quality and relevance of data for AI model training. Model Training and Evaluation: Train AI models using supervised, unsupervised, or reinforcement learning techniques. Implement evaluation methodologies to measure the performance and effectiveness of trained models. Fine-tune models based on feedback and data insights. … Neural Network Design: Design and optimize deep learning neural networks for various AI tasks, such as natural language processing, computer vision, recommendation systems, and predictive analytics. Implement innovative architectures and techniques to improve model accuracy and efficiency. System Integration: Collaborate with software developers and engineers to integrate AI systems More ❯
Posted:

Founding AI Engineer - Reinforcement Learning & LLMs

London Area, United Kingdom
algo1
Founding AI Engineer - Reinforcement Learning & LLMs (London, In-person) VC-backed generative AI startup building platform to take LLM-powered tasks to 99%+ accuracy. Early-stage, engineering-led, everyone codes. Role: Build, train and optimise LLM systems capable of beating industry benchmarks. Deploy robust generative AI systems … ability with startup experience. Curious and loves to solve hard problems with first-principles thinking. Requirements: Experience deploying LLMs into production. Experience with deep reinforcement learning. Strong ML/data science and data engineering fundamentals. Excellent Python skills. More ❯
Posted:

Founding AI Engineer - Reinforcement Learning & LLMs

london, south east england, United Kingdom
algo1
Founding AI Engineer - Reinforcement Learning & LLMs (London, In-person) VC-backed generative AI startup building platform to take LLM-powered tasks to 99%+ accuracy. Early-stage, engineering-led, everyone codes. Role: Build, train and optimise LLM systems capable of beating industry benchmarks. Deploy robust generative AI systems … ability with startup experience. Curious and loves to solve hard problems with first-principles thinking. Requirements: Experience deploying LLMs into production. Experience with deep reinforcement learning. Strong ML/data science and data engineering fundamentals. Excellent Python skills. More ❯
Posted:

Head of Quant Trading

London Area, United Kingdom
Deeter Investments LLP
Research Algorithm Design: Lead the creation and refinement of proprietary trading algorithms rooted in the firm’s market framework, leveraging advanced statistical and machine‑learning techniques. Modeling & Simulation: Build forecasting, signal‑generation, and risk models; run rigorous back‑tests and simulations to validate performance. Data Analysis: Mine large, heterogeneous … datasets (market microstructure, alternative data, etc.) for actionable insights. Innovation: Continuously evaluate emerging research (deep learning, reinforcement learning, agent‑based modeling) to sharpen our edge. Technical Infrastructure & Implementation System Architecture: Partner with engineering to design high‑throughput trading systems that scale globally. Software Development: Oversee codebases in … language (Python, C++, or Java) and familiarity with Linux, Git, and CI workflows. Data Science: Deep knowledge of statistical modeling, feature engineering, and machine‑learning frameworks (PyTorch, TensorFlow, scikit‑learn). Systems: Proven skill in real‑time data pipelines, distributed/cloud computing, and performance optimization. Markets: Strong grasp More ❯
Posted:

Head of Quant Trading

london, south east england, United Kingdom
Deeter Investments LLP
Research Algorithm Design: Lead the creation and refinement of proprietary trading algorithms rooted in the firm’s market framework, leveraging advanced statistical and machine‑learning techniques. Modeling & Simulation: Build forecasting, signal‑generation, and risk models; run rigorous back‑tests and simulations to validate performance. Data Analysis: Mine large, heterogeneous … datasets (market microstructure, alternative data, etc.) for actionable insights. Innovation: Continuously evaluate emerging research (deep learning, reinforcement learning, agent‑based modeling) to sharpen our edge. Technical Infrastructure & Implementation System Architecture: Partner with engineering to design high‑throughput trading systems that scale globally. Software Development: Oversee codebases in … language (Python, C++, or Java) and familiarity with Linux, Git, and CI workflows. Data Science: Deep knowledge of statistical modeling, feature engineering, and machine‑learning frameworks (PyTorch, TensorFlow, scikit‑learn). Systems: Proven skill in real‑time data pipelines, distributed/cloud computing, and performance optimization. Markets: Strong grasp More ❯
Posted:

Senior Applied Scientist, Alexa

London, United Kingdom
Amazon
intuitive, and adaptive content recommendations across all Amazon surfaces. We aim to facilitate seamless reasoning and customer experiences, surpassing the capabilities of previous machine learning models. We are looking for a passionate, talented, and resourceful Senior Applied Scientist in the field of Natural Language Processing (NLP), Large Language Model … Retrieval, to invent and build scalable solutions for a state-of-the-art context-aware personal assistant. A successful candidate will have strong machine learning background and a desire to push the envelope in one or more of the above areas. The ideal candidate would also enjoy operating in … delightful recommendations and conversations. Your work will directly impact our customers in the form of products and services that make use of various machine learning, deep learning and language model technologies. You will leverage Amazon's heterogeneous data sources and large-scale computing resources to accelerate advances in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer: Superhuman Visual Generation

London, United Kingdom
Atmanlabs
proactive shopping assistance to personal teachers to healthcare concierges - and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Visual Foundation Models: Unlocking Human Engagement … in writing, in person, and in small groups, and are able to educate audiences at all levels on the novel applications and relevance of reinforcement learning. You can easily distinguish authentic and high integrity thinkers from 'posers', while also critically evaluating truth from fiction in your own work. Your … dampen the mood. Your intensity goes from 0 to 1000 when you become authentically interested in a topic. You not only have interests in reinforcement learning, but are deeply curious about a range of interdisciplinary topics, ranging from knowledge graphs, recommendations, web-scale search, deep learning, generative More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

CTO - Entourage

London, United Kingdom
Moonsong Labs, Inc
insight that code serves as a universal, learnable language for agents to interact with their environment, we are developing a protocol that enables collective learning through a shared memory layer where mutually distrusting agents can exchange and validate their experiences. This protocol transforms individual code actions, trajectories and experiences … while reducing costs. This approach, combined with carefully designed token incentives and memory curation mechanisms, creates a self-reinforcing ecosystem where each agent's learning contributes to and benefits from the network's growing capability. Our approach goes beyond existing frameworks, creating the connective tissue for the emerging agent … integrations, scalability, and security. Recruiting, hiring, mentoring, and managing a world-class AI development team. Cultivating a culture of ownership, innovation, collaboration, and continuous learning essential for the fast-moving AI agent ecosystem. Engaging with developers across frameworks and tech stacks to understand their needs and incorporate feedback into More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Research Engineer, Applied ML

London, United Kingdom
Google Inc
years of experience in software development and with data structures/algorithms. 5 years of experience building and architecting large-scale, production quality Machine Learning (ML) systems. 5 years of experience in distributed development and large-scale data processing. Experience coding in either C++ or Python. Experience with ML … fundamentals, algorithms, and techniques, including supervised, unsupervised, and reinforcement learning, and experience in areas like natural language processing (NLP), computer vision, and generative AI. Preferred Qualifications: Experience with generative models (e.g., diffusion models, GANs, transformers) for various media formats (e.g., text, image, video, audio), including prompt engineering, fine … tuning, and evaluation techniques. Experience with RL algorithms and frameworks, including policy gradient methods, Q-learning, and actor-critic architectures. Experience building and leading high-performing research or engineering teams, fostering a positive and inclusive culture. Experience being published in ML/AI conferences or journals, demonstrating a strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Research Engineer, Applied ML

London, United Kingdom
WeAreTechWomen
years of experience in software development and with data structures/algorithms. 5 years of experience building and architecting large-scale, production quality Machine Learning (ML) systems. 5 years of experience in distributed development and large-scale data processing. Experience coding in either C++ or Python. Experience with ML … fundamentals, algorithms, and techniques, including supervised, unsupervised, and reinforcement learning, and experience in areas like natural language processing (NLP), computer vision, and generative AI. Preferred qualifications: Experience with generative models (e.g., diffusion models, GANs, transformers) for various media formats (e.g., text, image, video, audio), including prompt engineering, fine … tuning, and evaluation techniques. Experience with RL algorithms and frameworks, including policy gradient methods, Q-learning, and actor-critic architectures. Experience building and leading high-performing research or engineering teams, fostering a positive and inclusive culture. Experience being published in ML/AI conferences or journals, demonstrating a strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior AI Research Engineer

London, United Kingdom
Lawhive
complex multi-agent systems. Collaborate with engineers to transition experimental models into production-ready systems . Qualifications: Strong background in AI research, applied machine learning, and NLP . Experience with LLM model adaptation, fine-tuning, and inference optimization . Proficiency in Python, Pydantic, FastAPI, and working with LLM APIs … databases, embeddings, and structured AI retrieval . Hands-on experience with LLM-based planning, reasoning, and autonomous task execution . Familiarity with self-supervised learning, reinforcement learning, or adaptive AI techniques . Ability to translate academic AI research into practical experiments and working prototypes . Experience deploying More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Reinforcement Learning
London
25th Percentile
£125,000
Median
£150,000
75th Percentile
£175,000