1 to 25 of 45 Permanent Reinforcement Learning Jobs in London

NLP / LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence Location: London, United Kingdom The Machine Learning Center of Excellence invites the successful candidate to apply sophisticated machine learning methods to a wide variety of complex tasks including natural language processing, large language models … environment together with the business, technologists and control partners to deploy solutions into production. The candidate must also have a strong passion for machine learning and invest independent time towards learning, researching and experimenting with new innovations in the field. The candidate must have solid expertise in Deep ...

Principal Machine Learning Engineer, AI & Data Platforms (AiDP)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Principal Machine Learning Engineer, AI & Data Platforms (AiDP) London, England, United Kingdom Corporate Functions At Apple, we build AI systems that define experiences for billions of people and we do it with an unwavering commitment to privacy, performance, and craft. The AI & Data Platforms (AiDP) team is seeking … Principlal Machine Learning Engineer to lead the design, fine‐tuning, evaluation, and productionisation of large language models and generative internal AI systems at global scale. This is a deeply hands‐on, high‐impact role: you will work across the full model lifecycle, from reinforcement learning and upstream ...

Applied AI ML Director - NLP / LLM and Graphs

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Responsibilities Create state‐of‐the‐art machine learning models to solve real‐world problems and apply them to NLP, speech recognition and analytics, time‐series predictions, or recommendation systems. Collaborate with multiple partner teams such as Business, Technology, Product Management, Legal, Compliance, Strategy and Business Management to deploy solutions … into production. Drive firm‐wide initiatives by developing large‐scale frameworks to accelerate the application of machine learning models across different areas of the business. Research and explore new machine learning methods through independent study, attending industry‐leading conferences, experimentation and participation in knowledge‐sharing community. Required Qualifications ...

Research Scientist, Reinforcement Learning, DeepMind

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Location DeepMind – London, UK Qualifications PhD in Machine Learning, or equivalent practical experience. 2 years of experience implementing algorithms within research codebases. Experience conducting research in reinforcement learning, including contributions to peer‐reviewed publications. Experience designing and executing end‐to‐end experiments, including setup, analysis, and interpretation. … Preferred qualifications Experience with advanced reinforcement learning topics, such as RL for sequence models, post‐training, preference‐based learning, or agentic systems. Familiarity with modern research stacks (e.g., JAX/Flax or PyTorch) and experience scaling experiments. Strong experimental judgment, including selecting appropriate baselines and designing insightful ...

Research Scientist, Agent Post-Training

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

backbone of upcoming releases. You will help bridge research and engineering by designing scalable experiments and building reliable infrastructure for tool use and reinforcement learning. You will value various experience and backgrounds to create extraordinary impact.Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind … scientific discovery, ensuring safety and ethics are always our highest priority. We are pushing the boundaries across multiple domains. Our global teams offer various learning opportunities and varied career pathways for those driven to achieve exceptional results through collective effort.ResponsibilitiesLead the full research process, from forming hypotheses to delivering ...

Research Engineer - Machine Learning (Contractor)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

fundamental theories of these technologies. We invite you to join us on this exciting journey and drive your career forward. Job Summary The Reinforcement Learning Team at the Huawei London Research Centre is seeking a highly skilled and research-driven Machine Learning Engineer to join our team. … Awards, and industry recognition, including Huawei’s Gold Medals and Best Technology Breakthroughs. This role focuses on advancing the state-of-the-art in reinforcement learning, Bayesian optimisation, AI agents, large language models (LLMs), and/or vision-language models (VLMs). You will work at the intersection ...

Data Scientist - Principal

Hiring Organisation: Aristocrat
Location: Central London, London, United Kingdom
Employment Type: Permanent

high-impact data science initiatives end-to-end, including problem framing, methodology selection, experiment development, implementation partnership, and impact measurement. Build and deliver machine learning and reinforcement learning solutions to improve player engagement, retention, monetization, and operational outcomes. Lead the modeling framework for complex systems, guaranteeing comprehensive … evaluation and monitoring of causal inference, uplift modeling, sequential decisioning, bandits/reinforcement learning, and forecasting. Partner with game teams to define success metrics, guardrails, and decision frameworks, translating analytical results into actionable product and operational actions. Define and uphold engineering standards and guidelines for model development, including ...

Research ML Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Founding ML Engineer focused on Agentic Systems, you will contribute to building the foundational technologies for enterprise superintelligence: you help architect memory, reasoning, and learning capabilities of our autonomous agents. You will tackle hard problems in long-horizon planning, knowledge representation, and agentic learning, combining state … research with world-class engineering to build a new computing paradigm. If you are a world expert in search, retrieval, or reinforcement learning who is driven to define the future of agentic AI, this is your ideal role. What You’ll Do You will define the capabilities ...

Research Scientist - Machine Learning (Contractor)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

fundamental theories of these technologies. We invite you to join us on this exciting journey and drive your career forward. Job Summary The Reinforcement Learning Team at the Huawei London Research Centre is seeking a highly skilled and research‐driven Machine Learning Scientist to join our team. … Awards, and industry recognition, including Huawei’s Gold Medals and Best Technology Breakthroughs. This role focuses on advancing the state‐of‐the‐art in reinforcement learning, Bayesian optimisation, AI agents, large language models (LLMs), and/or vision‐language models (VLMs). You will work at the intersection ...

Research Scientist, Robotics, DeepMind

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Qualifications PhD degree in a technical field or equivalent practical experience. 2 years of experience with reinforcement and imitation learning, multimodal generative modeling, training and inference, and vision/vision-language/video multimodal models. Preferred Qualifications Experience working with simulators and real-world robots, especially dexterous manipulation … human data with or without capture devices into our robotics foundation models. Leverage your broader expertise to participate in a wide variety of research: learning from simulation, reinforcement learning, learning from demonstrations, vision-language-action models, transformers, video generation, robot control, humanoid robots and more. About ...

Forward-Deployed Data Scientist II

Hiring Organisation: Braze
Location: Greater London, United Kingdom
Employment Type: Full Time

Forward-Deployed Data Scientist team is a group of creative technical experts who design and build end-to-end machine learning solutions that power 1-to-1 personalization for some of the world's leading brands. In this role, you will: Design ML use cases from the ground … broader AI deployment team and scale what's possible across engagements Partner with the Braze Product team to refine and advance Braze's reinforcement learning algorithms, pushing the self-learning capabilities of the platform forward Shape BrazeAI product strategy and roadmap by bringing customer-facing insights ...

Senior Lead AI Solutions Consultant

Hiring Organisation: Braze
Location: Greater London, United Kingdom
Employment Type: Full Time

interactive workshop to teach artificial intelligence concepts through guided exercises and discussion Serve as a thought leader, evangelizing the benefits of AI testing & reinforcement learning Maintain deep understanding of competitive and complementary technologies and vendors and how to position Braze in relation to them Collaborate cross-functionally (e.g. … presenter: You are an enthusiastic and engaging presenter, with the ability to communicate effectively with a wide range of stakeholders An expert in machine learning: You have a solid grasp of machine learning, including a familiarity with reinforcement learning Mar-tech savvy: You understand the marketing ...

Applied Scientist - Machine Learning

Hiring Organisation: Spencer Rose Ltd
Location: London, United Kingdom
Employment Type: Permanent
Salary: GBP 80,000 - 110,000 Annual

Applied Machine Learning Scientist London (Hybrid - 2 days per week) £80,000 - £110,000 + Benefits An innovative technology company is building next-generation AI solutions that optimise complex physical systems. Backed by significant recent investment, they're growing their research team to develop machine learning models that … solve real-world engineering challenges at scale. This is an opportunity for an Applied Machine Learning Scientist to work on cutting-edge research where deep learning meets real-world infrastructure. You'll develop advanced machine learning models using large-scale sensor data, combining AI with physical modelling ...

Research Scientist, Agent Post-Training, DeepMind

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Research Scientist, Agent Post-Training, DeepMind DeepMind London, UK Apply Qualifications Bachelor's or Master’s degree in Computer Science, Machine Learning, a related quantitative field, or equivalent practical experience. 2 years of experience with reinforcement learning (RL), supervised fine-tuning (SFT), or agent-based systems, including … modeling or applied infrastructure. 2 years of experience with machine learning frameworks such as JAX, Flax, TensorFlow, or PyTorch, and scaling experiments. Preferred qualifications PhD in Computer Science, Machine Learning, or a related quantitative field. Publications in reinforcement learning (RL), tool-use, or agentic systems ...

Research Engineer, ML — Advancing RL & AI for Science

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

leading tech research organization in Greater London is seeking a skilled Machine Learning Engineer to join their Reinforcement Learning Team. This role focuses on advancing AI through original research and developing cutting-edge algorithms that combine scientific and applied innovation. Candidates should have a strong academic background … Computer Science, expertise in reinforcement learning or Bayesian optimisation, and proficiency in Python. This is an exciting opportunity to contribute to high-impact AI projects within a dynamic multinational team. #J-18808-Ljbffr ...

Reinforcement Learning (RL) Engineer, Manipulation - Full UK Visa Sponsorship Available

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Title: Reinforcement Learning (RL) Engineer, Manipulation Company: Randstad Technologies Recruitment Location: City of London, London Salary: £80,000 – £120,000/annum We are currently supporting an exciting, top‐notch high‐tech robotics company in London as they assemble an elite team to define their 2026 technical roadmap. … opportunity for world‐class talent to solve the most complex challenges in high‐DOF autonomous systems and embodied AI. Expertise areas AI/Machine Learning MLOps Software Engineering Data Science The environment The mission is driven by high‐bandwidth, in‐person collaboration. This is a 5‐day‐a‐week ...

Software Engineer, RL Data

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

down to reading transcripts, supporting users, and wrangling vendors. The company's RL Data team builds the systems that produce high‐quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data … Effective use of AI tools in your own day‐to‐day work. Care about the societal impacts of your work. Preferred qualifications Experience with reinforcement learning on LLMs, particularly on the data side: creating evals, environments, rewards, graders, or training data. Experience helping organizations use AI more effectively ...

Research Engineer, RL Scaling Science

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

policy experts, and business leaders working together to build beneficial AI systems. About the role The company's RL Scaling Science team studies how reinforcement learning behaves as we scale it (across model size, compute, and task horizon) and turns that understanding into the training recipes behind … scale Partner closely with adjacent RL teams across research and engineering and advance our overall RL stack Minimum qualifications Strong empirical research skills in Reinforcement Learning, large-scale ML training, or a closely adjacent area Demonstrated ability to own large experiments end-to-end, from design through interpretation ...

Research Scientist/Engineer - General Decision & Control Agent

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

with continuous self-iterative optimization is an essential path toward artificial general intelligence. We are seeking passionate, creative talents with expertise in Agents and reinforcement learning to join our team and conduct cutting-edge research together. Key Responsibilities Agent Memory: Participate in the design and development of Agent … Agent Harness to realise continuous evolution of the Agent Harness framework. Agentic RL: Investigate agent optimisation techniques based on parametric and non-parametric reinforcement learning, and build collaborative update pipelines linking Agent policy models and Agent Harness. This job description is only an outline of the tasks, responsibilities ...

Manager, Lead Research Scientist, Training Data (Foundational Research)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

curious and open-minded individual with an interest in conducting state-of-theart foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex AI systems in a data-rich, complex academic environment driven by real-world problems.Foundational Research is the dedicated core … Machine Learning research division of Thomson Reuters. We are focused on research and development, with a particular focus on advanced algorithms and training techniques for Large Language Models (LLMs). We are building a strong foundation of research capabilities across different areas and are looking for managers ...

Machine Learning Engineer

Hiring Organisation: Platform Recruitment
Location: London, United Kingdom
Employment Type: Permanent
Salary: £60000 - £70000/annum

Title: Machine Learning Engineer Location: London Salary: Up to £70,000 DOE This is a unique opportunity for a Machine Learning Engineer to join a well-funded early-stage company building frontier AI systems focused on optimising complex operational environments. Working at the intersection of scientific machine learning … hands-on engineering role where you will work closely with a high-calibre founding team, industrial data, and customer environments to take machine learning systems from early validation through to scalable deployment. The Role You will design, build, and deploy advanced machine learning models capable of solving complex ...

Research Scientist, LLM Agents (Foundational Research)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

curious and open-minded individual with an interest in conducting state-of-the-art foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex agent-based AI systems in a data-rich, complex academic environment driven by real-world problems. Foundational Research … dedicated core Machine Learning research division of Thomson Reuters. We are focused on research and development, with a particular focus on advanced algorithms and training techniques for Large Language Models (LLMs). We are building a strong foundation of research capabilities across different areas and are looking for scientists ...

Applied Scientist II, Alexa for Shopping Science UK

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Amazon Development Centre (London) Limited We are looking for a passionate, talented, and inventive Applied Scientist with a strong machine learning background to help build industry‐leading language technology powering Alexa for Shopping, our AI‐driven search and shopping assistant, helping customers with their shopping tasks at every step … enabling shopping directly from images or videos, providing visual inspiration, and more. We do this by leveraging advanced analytics, Natural Language Processing (NLP), Machine Learning (ML), A/B testing, causal inference, and data‐driven insights to continuously improve our systems. Key job responsibilities Develop and maintain LLM agents ...

Manager, Lead Research Scientist, LLM Agents (Foundational Research)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

curious and open-minded individual with an interest in conducting state-of-the-art foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex agent-based AI systems in a data-rich, complex academic environment driven by real-world problems. Foundational Research … sleeves and participate in designing, coding, conducting experiments, and translating findings into concrete deliverables. Our focus areas are: LLM Training (Continued Pretraining, Instruction Tuning, Reinforcement Learning Alignment, Distributed Training, Efficient ML techniques) Post-training techniques for planning, reasoning & complex workflows (e.g., Reasoning Models, LLMs + Knowledge Graphs, Test ...

Staff Machine Learning Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

public roads and tens of billions in simulation across 15+ U.S. states. The DUE ML Core London team builds and operates scalable machine learning systems, simulation workflows, and insight tools designed to improve the evaluation and developer onboarding journeys. By combining expert human judgement with advanced machine learning … novel RL algorithms, reward functions, and training paradigms tailored for generating high-fidelity and insightful driving behaviors. Lead the development of cutting-edge deep learning models and generative AI (LLM/VLM) solutions to enhance human-led triaging, introduce automation for high-volume workflows, and perform nuanced analysis ...