Remote Reinforcement Learning Jobs in the South East

7 of 7 Remote Reinforcement Learning Jobs in the South East

Robotics Control Engineer (Locomotion)

london, south east england, United Kingdom
Hybrid / WFH Options
Seer
the frontier of what legged machines can do. As part of this growth, they’re hiring Robotics Control Engineers with deep expertise in locomotion , reinforcement learning , and dynamic control systems to join their R&D headquarters. The Role: You’ll design and implement locomotion control policies — from walking … and stair climbing to fall recovery and manipulation-balanced motion. You’ll work at the intersection of classical control theory and reinforcement learning, deploying your work on humanoid platforms in the wild. Key Details: 🏢 Location : Hybrid or Onsite – US or EU HQs 💰 Salary : Highly competitive + equity + … Mechatronics , or similar 2+ years experience in control systems for biped or humanoid robots Strong understanding of: Model Predictive Control (MPC) , optimal & feedback control Reinforcement learning in physical systems Humanoid dynamics , balance control, and full-body coordination Proficiency in Python and C++ for real-time algorithm development Experience More ❯
Posted:

Senior AI Software Engineer (Research and Development)

Culham, Oxfordshire, UK
Hybrid / WFH Options
Luffy AI
with other developers on innovative code bases.Experience of genetic algorithms, low level neural network execution and concepts such as neuroplasticity, recurrent neural networks and reinforcement learning frameworks like OpenAI Gym would be a huge advantage. This role requires specific experience with Python and familiarity with C/C++ … Experience Essentials: At least BSc in Computer Science or relevant discipline 3-5 years professional software development experience Strong Python skills Some experience with Reinforcement Learning Solid grounding in API design, algorithms, design principles Well versed with industry standard development practices, testing frameworks, source control (git), CI, etc … C, C++, or Rust Experience with genetic algorithms or neuroevolution Experience with neural network concepts such as neuroplasticity and recurrent neural networks Experience of reinforcement learning frameworks like OpenAI Gym Experience with software optimisation or high performance computing, Fluent in English with excellent written and verbal communication skills More ❯
Posted:

Senior AI Software Engineer (Research and Development)

culham, south east england, United Kingdom
Hybrid / WFH Options
Luffy AI
with other developers on innovative code bases.Experience of genetic algorithms, low level neural network execution and concepts such as neuroplasticity, recurrent neural networks and reinforcement learning frameworks like OpenAI Gym would be a huge advantage. This role requires specific experience with Python and familiarity with C/C++ … Experience Essentials: At least BSc in Computer Science or relevant discipline 3-5 years professional software development experience Strong Python skills Some experience with Reinforcement Learning Solid grounding in API design, algorithms, design principles Well versed with industry standard development practices, testing frameworks, source control (git), CI, etc … C, C++, or Rust Experience with genetic algorithms or neuroevolution Experience with neural network concepts such as neuroplasticity and recurrent neural networks Experience of reinforcement learning frameworks like OpenAI Gym Experience with software optimisation or high performance computing, Fluent in English with excellent written and verbal communication skills More ❯
Posted:

Research Software Engineer

Guildford, Surrey, United Kingdom
Hybrid / WFH Options
Ecm Selection
experience will also be considered) Full rights to work in the UK without limitation While not required, any additional experience with digital twins, machine learning, reinforcement learning or agentic systems within a scientific context would be beneficial. On offer is a very competitive salary and attractive benefits More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Science Manager (Valuations & Pricing)

Hook, Hampshire, United Kingdom
Hybrid / WFH Options
360 Resourcing Solutions
hands-on, working with our Senior Staff Engineer to provide technical leadership to the team. You'll have great technical understanding of various machine learning approaches, which would include regression methods, probabilistic pricing models, tree based gradient boosting approaches, information theory, neural networks, transfer learning, etc. While your … our experienced MLOps and Data Engineering teams, allowing you more time to focus on model development, research and the implementation of new novel machine learning approaches. Key Responsibilities: Lead the Valuations & Pricing Team ensuring the on-going improvements in KPIs associated with the team's products. Manage, coach and … agreeing priorities with stakeholders and the Head of Data Science and enable the team to deliver against agreed targets. Work closely with the Machine Learning Operations team to deliver products to end users, leveraging and contributing to established engineering standards and best practices. Work collaboratively with the Insight & Analytics More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Head of Applied AI

london, south east england, United Kingdom
Hybrid / WFH Options
Acquired Talent Ltd
far richer, more capable intelligent behaviours. This is a unique opportunity to help build a dedicated AI function focused on embedding cutting-edge multimodal learning models into next-gen robotic platforms. The role will involve defining the long-term roadmap for how intelligence is deployed across the stack, from … successful in this role, you’ll need to bring: Strong technical leadership across applied AI/ML, with deep hands-on experience in robotic learning or embodied intelligence A solid background in multimodal model development—especially in areas that combine computer vision, language understanding, and interactive learning (LLM … VLM or VLA) Real-world deployment experience of learning-based systems, ideally within robotic or physical environments (embodied systems (AI) & reinforcement learning) Comfort collaborating across functions, especially with engineering, hardware, and system design teams Solid programming and prototyping skills using modern deep learning frameworks (e.g. PyTorch More ❯
Posted:

Manipulation Lead

london, south east england, United Kingdom
Hybrid / WFH Options
Lawrence Harvey
proposals to secure external sponsorships. What We’re Looking For: Ph.D. (or equivalent experience) in AI, ML, robotics, or related fields. Expertise in manipulation, reinforcement learning, motion planning, and perception. A track record of conducting research and publishing in top-tier AI/ML conferences and journals. Proficiency More ❯
Posted:
Reinforcement Learning
the South East
10th Percentile
£95,500
25th Percentile
£96,250
Median
£105,000
75th Percentile
£116,250
90th Percentile
£118,500