5 of 5 Reinforcement Learning Jobs in Central London

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
City of London, London, United Kingdom
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Reinforcement Learning (RL) control Engineer

Hiring Organisation
Randstad Digital
Location
City of London, London, United Kingdom
Employment Type
Permanent
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

Lead ML Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language ...

Senior NLP Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense ...

AI Architect (Wealth)

Hiring Organisation
Teksystems
Location
Central London, London, United Kingdom
Employment Type
Contract, Work From Home
Title: AI Architect (Wealth) Job Description This position is pivotal in designing AI and Machine Learning solutions on cloud-based platforms, exploring emerging AI trends, developing proof-of-concepts, and collaborating with internal and external ecosystems to advance these concepts to production. The role demands expertise in designing … least 6-10 years of hands-on development and architectural experience. Proficiency in Python, PyTorch, TensorFlow, or similar frameworks. experience with supervised, unsupervised, and reinforcement learning. Solid grounding in Natural Language Processing (NLP) concepts such as tokenisation, embeddings, semantic search, text classification, and summarisation. Strong understanding of Large Language ...