5 of 5 Reinforcement Learning Jobs in the South East

Reinforcement Learning RL control Engineer

Hiring Organisation
Randstad Technologies
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £120,000 per annum
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

Machine Learning Engineer

Hiring Organisation
Kingsgate Recruitment Ltd
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£30,000 - £40,000 per annum
Machine Learning Engineer (Graduate/Early Career) Location: London Type: Full-time About the Role Kickstart your AI career! This role is designed specifically for recent graduates or early-career professionals eager to launch their careers in machine learning and AI. You’ll work on exciting real-world … required—what matters is curiosity, problem-solving skills, and passion for AI. What You’ll Do Implement, train, and optimize ML models (supervised, unsupervised, reinforcement learning) Preprocess data, engineer features, and evaluate model performance Deploy models to production and monitor performance Collaborate with cross-functional teams to integrate ...

Research Software Scientist / Engineer

Hiring Organisation
ECM Selection (Holdings) Limited
Location
Guildford, Surrey, United Kingdom
Employment Type
Permanent
Salary
£50000 - £80000/annum DoE + Benefits
Sciences which has included demonstrable project experience in scientific programming using Python, C++ or C# (ideally all three). Novel research experience in machine learning, optimisation or probabilistic modelling which should have included the development of new algorithms, use of numerical methods and computational modelling. Any exposure to digital … twins, agent-based systems, reinforcement learning or advanced optimisations methods would be desirable. For senior candidates, additional experience in mentoring, project leadership on innovation-based projects, or strategic direction would be advantageous. The role is fully onsite at their Guildford offices with opportunities for international travel. Compensation includes ...

Software Engineer (Numerical Modelling, AI/ML, C++/Python)

Hiring Organisation
Hays
Location
Guildford, Surrey, South East, United Kingdom
Employment Type
Permanent
level scientific language (e.g., Python, Julia). We are particularly looking at experience with scientific computing, numerical methods, or computational modelling. Desirables are Machine learning, optimization, control, probabilistic modelling, or related fields. Familiarity digital twins, agentic systems, reinforcement learning or advanced optimisation. What ...

AI Engineers

Hiring Organisation
Stackstudio Digital Ltd
Location
Redhill, Surrey, South East, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £500 to £550 per day
vector search technology. Collaborate with cross-functional teams to integrate AI models into products. Stay updated with the latest advancements in AI and machine learning technologies. Conduct research to improve existing AI systems and develop new approaches. Knowledge of NLP, computer vision, or reinforcement learning. Experience deploying … models in production environments. Your Profile Essential Skills/Knowledge/Experience Proven experience in AI, machine learning, or deep learning. Proficiency in programming languages such as Python, R, or Java. Experience with AI frameworks like TensorFlow, PyTorch, or Keras. Experience with large language models (GPT, BERT, etc.). ...