14 of 14 Reinforcement Learning Jobs in London

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
City of London, London, United Kingdom
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Senior Research Engineer

Hiring Organisation
algo1
Location
London Area, United Kingdom
backed startup focused on hyper-personalisation, currently in stealth. Inspired by the latest in recommender systems, we leverage transformers and graph learning alongside decision-making models to build the most engaging customer experiences for in-store retail. Our mission is to change retail forever through hyper-personalised experiences that … both simple and beautiful. About the Job - Foundation Models for Retail We are looking for a Senior Research Engineer with experience in advanced machine learning systems to work with our team of industry leading domain experts and engineers to build foundation models for retail shopping. Key Responsibilities: Translate latest ...

Reinforcement Learning (RL) control Engineer

Hiring Organisation
Randstad Digital
Location
City of London, London, United Kingdom
Employment Type
Permanent
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

Reinforcement Learning RL control Engineer

Hiring Organisation
Randstad Technologies
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £120,000 per annum
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

AI / ML Architect

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £450 to £500 per day
Hybrid): 2 days from office Number of Positions: 4 The Role An AI/ML Developer is responsible for designing, building, and deploying machine learning models and AI solutions that solve business problems. This role focuses on coding, data preparation, and integrating models into production systems. Your Responsibilities … Model Development Design, build, and train machine learning models for predictive analytics, classification, NLP, computer vision, or other AI applications. Experiment with algorithms and optimize hyperparameters for performance. Data Preparation Collect, clean, and preprocess large datasets for training and validation. Implement feature engineering and data augmentation techniques. Integration & Deployment ...

Reinforcement Learning (RL) control Engineer

Location
London, United Kingdom
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world ...

Machine Learning Engineer

Hiring Organisation
Kingsgate Recruitment Ltd
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£30,000 - £40,000 per annum
Machine Learning Engineer (Graduate/Early Career) Location: London Type: Full-time About the Role Kickstart your AI career! This role is designed specifically for recent graduates or early-career professionals eager to launch their careers in machine learning and AI. You’ll work on exciting real-world … required—what matters is curiosity, problem-solving skills, and passion for AI. What You’ll Do Implement, train, and optimize ML models (supervised, unsupervised, reinforcement learning) Preprocess data, engineer features, and evaluate model performance Deploy models to production and monitor performance Collaborate with cross-functional teams to integrate ...

Senior Data Scientist

Hiring Organisation
Anson Mccade
Location
London, United Kingdom
Employment Type
Permanent
Responsibilities End-to-End Delivery: Lead the technical execution of AI projects, from initial problem discovery and hypothesis testing to deploying production-grade machine learning models. Strategic Advisory: Act as a "translator" between technical complexity and business value. You will work closely with C-suite stakeholders to identify … solve their most pressing strategic challenges. Technical Leadership: Architect robust, scalable data pipelines and state-of-the-art models (including LLMs, Reinforcement Learning, or Bayesian Inference) tailored to specific client needs. Mentorship: Guide and upskill junior Data Scientists, fostering a culture of rigorous peer review, clean coding standards ...

Data Scientist – Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
City of London, London, United Kingdom
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Lead ML Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
London Area, United Kingdom
requirements through first principles • You communicate technical concepts clearly to non-technical stakeholders Highly Valued (Differentiated Candidates) • Publications or research in multi-agent systems, reinforcement learning, AI safety, or agent architectures • Experience at AI labs (Anthropic, OpenAI, DeepMind) or leading AI research groups • Production experience with agents : LangChain … Dr. Ayman Hindy, Marcel Cassard, and leading figures in AI, high frequency risk management and financial regulation. Early team of sharp, mission-driven builders. Learning Curve : You'll gain expertise in cutting-edge AI architectures, enterprise software, regulatory frameworks, and category creation simultaneously. This is one of those roles ...

Senior NLP Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense ...

AI Developer/Engineer

Hiring Organisation
Damia Group Ltd
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£500 - £650 per day
Models (LLMs) and other generative architectures. Optimise pre-trained models (OpenAI, Anthropic, or open-source LLMs) for business use cases using prompt engineering and reinforcement learning. Experiment with model configurations to balance performance, cost, and scalability. Build robust data pipelines for continuous model improvement and retraining. Ensure compliance with ...

AI Architect (Wealth)

Hiring Organisation
Teksystems
Location
Central London, London, United Kingdom
Employment Type
Contract, Work From Home
Title: AI Architect (Wealth) Job Description This position is pivotal in designing AI and Machine Learning solutions on cloud-based platforms, exploring emerging AI trends, developing proof-of-concepts, and collaborating with internal and external ecosystems to advance these concepts to production. The role demands expertise in designing … least 6-10 years of hands-on development and architectural experience. Proficiency in Python, PyTorch, TensorFlow, or similar frameworks. experience with supervised, unsupervised, and reinforcement learning. Solid grounding in Natural Language Processing (NLP) concepts such as tokenisation, embeddings, semantic search, text classification, and summarisation. Strong understanding of Large Language ...