1 to 25 of 54 Permanent Reinforcement Learning Jobs in England

Applied AI Lead - NLP & LLM Scientist (Real-World ML)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence Location: London, United Kingdom The Machine Learning Center of Excellence invites the successful candidate to apply sophisticated machine learning methods to a wide variety of complex tasks including natural language processing, large language models … environment together with the business, technologists and control partners to deploy solutions into production. The candidate must also have a strong passion for machine learning and invest independent time towards learning, researching and experimenting with new innovations in the field. The candidate must have solid expertise in Deep ...

Principal Machine Learning Engineer, AI & Data Platforms (AiDP)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Principal Machine Learning Engineer, AI & Data Platforms (AiDP) London, England, United Kingdom Corporate Functions At Apple, we build AI systems that define experiences for billions of people and we do it with an unwavering commitment to privacy, performance, and craft. The AI & Data Platforms (AiDP) team is seeking … Principlal Machine Learning Engineer to lead the design, fine‐tuning, evaluation, and productionisation of large language models and generative internal AI systems at global scale. This is a deeply hands‐on, high‐impact role: you will work across the full model lifecycle, from reinforcement learning and upstream ...

Applied AI ML Director - NLP / LLM and Graphs

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Responsibilities Create state‐of‐the‐art machine learning models to solve real‐world problems and apply them to NLP, speech recognition and analytics, time‐series predictions, or recommendation systems. Collaborate with multiple partner teams such as Business, Technology, Product Management, Legal, Compliance, Strategy and Business Management to deploy solutions … into production. Drive firm‐wide initiatives by developing large‐scale frameworks to accelerate the application of machine learning models across different areas of the business. Research and explore new machine learning methods through independent study, attending industry‐leading conferences, experimentation and participation in knowledge‐sharing community. Required Qualifications ...

Research Scientist, Reinforcement Learning, DeepMind

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Location DeepMind – London, UK Qualifications PhD in Machine Learning, or equivalent practical experience. 2 years of experience implementing algorithms within research codebases. Experience conducting research in reinforcement learning, including contributions to peer‐reviewed publications. Experience designing and executing end‐to‐end experiments, including setup, analysis, and interpretation. … Preferred qualifications Experience with advanced reinforcement learning topics, such as RL for sequence models, post‐training, preference‐based learning, or agentic systems. Familiarity with modern research stacks (e.g., JAX/Flax or PyTorch) and experience scaling experiments. Strong experimental judgment, including selecting appropriate baselines and designing insightful ...

Research Scientist, Agent Post-Training

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

backbone of upcoming releases. You will help bridge research and engineering by designing scalable experiments and building reliable infrastructure for tool use and reinforcement learning. You will value various experience and backgrounds to create extraordinary impact.Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind … scientific discovery, ensuring safety and ethics are always our highest priority. We are pushing the boundaries across multiple domains. Our global teams offer various learning opportunities and varied career pathways for those driven to achieve exceptional results through collective effort.ResponsibilitiesLead the full research process, from forming hypotheses to delivering ...

Research Engineer - Machine Learning (Contractor)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

fundamental theories of these technologies. We invite you to join us on this exciting journey and drive your career forward. Job Summary The Reinforcement Learning Team at the Huawei London Research Centre is seeking a highly skilled and research-driven Machine Learning Engineer to join our team. … Awards, and industry recognition, including Huawei’s Gold Medals and Best Technology Breakthroughs. This role focuses on advancing the state-of-the-art in reinforcement learning, Bayesian optimisation, AI agents, large language models (LLMs), and/or vision-language models (VLMs). You will work at the intersection ...

Applied Scientist - Machine Learning

Hiring Organisation: Spencer Rose Ltd
Location: London, United Kingdom
Employment Type: Permanent
Salary: GBP 110,000 Annual

Applied Scientist - Machine Learning London - Hybrid Up to £110,000 + Bonus My client, an innovative technology business that's applying cutting-edge AI to solve real-world engineering challenges at scale. Backed by significant investment and experiencing rapid growth, they're building advanced machine learning solutions that … deliver measurable sustainability benefits. This is an opportunity to join a highly collaborative R&D team where you'll work on complex machine learning problems from initial research through to deployment. If you enjoy solving difficult technical challenges, experimenting with new approaches and seeing your work have a genuine ...

Data Scientist - Principal

Hiring Organisation: Aristocrat
Location: Central London, London, United Kingdom
Employment Type: Permanent

high-impact data science initiatives end-to-end, including problem framing, methodology selection, experiment development, implementation partnership, and impact measurement. Build and deliver machine learning and reinforcement learning solutions to improve player engagement, retention, monetization, and operational outcomes. Lead the modeling framework for complex systems, guaranteeing comprehensive … evaluation and monitoring of causal inference, uplift modeling, sequential decisioning, bandits/reinforcement learning, and forecasting. Partner with game teams to define success metrics, guardrails, and decision frameworks, translating analytical results into actionable product and operational actions. Define and uphold engineering standards and guidelines for model development, including ...

Research ML Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Founding ML Engineer focused on Agentic Systems, you will contribute to building the foundational technologies for enterprise superintelligence: you help architect memory, reasoning, and learning capabilities of our autonomous agents. You will tackle hard problems in long-horizon planning, knowledge representation, and agentic learning, combining state … research with world-class engineering to build a new computing paradigm. If you are a world expert in search, retrieval, or reinforcement learning who is driven to define the future of agentic AI, this is your ideal role. What You’ll Do You will define the capabilities ...

Research Scientist - Machine Learning (Contractor)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

fundamental theories of these technologies. We invite you to join us on this exciting journey and drive your career forward. Job Summary The Reinforcement Learning Team at the Huawei London Research Centre is seeking a highly skilled and research‐driven Machine Learning Scientist to join our team. … Awards, and industry recognition, including Huawei’s Gold Medals and Best Technology Breakthroughs. This role focuses on advancing the state‐of‐the‐art in reinforcement learning, Bayesian optimisation, AI agents, large language models (LLMs), and/or vision‐language models (VLMs). You will work at the intersection ...

Applied Research Scientist

Hiring Organisation: OpenSourced Ltd
Location: Bristol, Avon, England, United Kingdom
Employment Type: Full-Time
Salary: £80,000 - £120,000 per annum

+ Benefits What if your research could shape the future of robotics? Are you a Senior Applied AI Research Scientist working in Robotics, Machine Learning, Embodied AI or Foundation Models? Do you want access to real humanoid robots, large-scale AI training infrastructure, and the freedom to explore breakthrough … help identify the technologies that will power future generations of intelligent robotic systems. Key Responsibilities Research robotics foundation models, embodied AI and advanced robot learning systems Design and execute large-scale AI and robotics experiments Build benchmark frameworks to evaluate robot intelligence and manipulation capabilities Develop research prototypes that ...

Senior Robotics Engineer

Hiring Organisation: OpenSourced Ltd
Location: Bristol, Avon, England, United Kingdom
Employment Type: Full-Time
Salary: £60,000 - £120,000 per annum

Senior Robot Learning Engineer (Vision-Language-Action Models) Location: Bristol (On-Site) Job Type: Full-Time, Permanent Salary: Dependent on Experience (DOE) The Opportunity We're recruiting on behalf of a rapidly growing robotics and AI company seeking a Senior Robot Learning Engineer to develop and deploy advanced … improving robot intelligence and deploying those improvements into real-world environments. The Role You'll be responsible for developing, training and deploying advanced robot learning systems that enable increasingly capable manipulation and autonomous behaviours. Working alongside robotics, perception and infrastructure teams, you'll help shape the future of intelligent ...

Research Scientist, Robotics, DeepMind

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Qualifications PhD degree in a technical field or equivalent practical experience. 2 years of experience with reinforcement and imitation learning, multimodal generative modeling, training and inference, and vision/vision-language/video multimodal models. Preferred Qualifications Experience working with simulators and real-world robots, especially dexterous manipulation … human data with or without capture devices into our robotics foundation models. Leverage your broader expertise to participate in a wide variety of research: learning from simulation, reinforcement learning, learning from demonstrations, vision-language-action models, transformers, video generation, robot control, humanoid robots and more. About ...

Forward-Deployed Data Scientist II

Hiring Organisation: Braze
Location: Greater London, United Kingdom
Employment Type: Full Time

Forward-Deployed Data Scientist team is a group of creative technical experts who design and build end-to-end machine learning solutions that power 1-to-1 personalization for some of the world's leading brands. In this role, you will: Design ML use cases from the ground … broader AI deployment team and scale what's possible across engagements Partner with the Braze Product team to refine and advance Braze's reinforcement learning algorithms, pushing the self-learning capabilities of the platform forward Shape BrazeAI product strategy and roadmap by bringing customer-facing insights ...

Senior Lead AI Solutions Consultant

Hiring Organisation: Braze
Location: Greater London, United Kingdom
Employment Type: Full Time

interactive workshop to teach artificial intelligence concepts through guided exercises and discussion Serve as a thought leader, evangelizing the benefits of AI testing & reinforcement learning Maintain deep understanding of competitive and complementary technologies and vendors and how to position Braze in relation to them Collaborate cross-functionally (e.g. … presenter: You are an enthusiastic and engaging presenter, with the ability to communicate effectively with a wide range of stakeholders An expert in machine learning: You have a solid grasp of machine learning, including a familiarity with reinforcement learning Mar-tech savvy: You understand the marketing ...

Research Scientist, Agent Post-Training, DeepMind

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Research Scientist, Agent Post-Training, DeepMind DeepMind London, UK Apply Qualifications Bachelor's or Master’s degree in Computer Science, Machine Learning, a related quantitative field, or equivalent practical experience. 2 years of experience with reinforcement learning (RL), supervised fine-tuning (SFT), or agent-based systems, including … modeling or applied infrastructure. 2 years of experience with machine learning frameworks such as JAX, Flax, TensorFlow, or PyTorch, and scaling experiments. Preferred qualifications PhD in Computer Science, Machine Learning, or a related quantitative field. Publications in reinforcement learning (RL), tool-use, or agentic systems ...

Research Engineer, ML — Advancing RL & AI for Science

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

leading tech research organization in Greater London is seeking a skilled Machine Learning Engineer to join their Reinforcement Learning Team. This role focuses on advancing AI through original research and developing cutting-edge algorithms that combine scientific and applied innovation. Candidates should have a strong academic background … Computer Science, expertise in reinforcement learning or Bayesian optimisation, and proficiency in Python. This is an exciting opportunity to contribute to high-impact AI projects within a dynamic multinational team. #J-18808-Ljbffr ...

Reinforcement Learning (RL) Engineer, Manipulation - Full UK Visa Sponsorship Available

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Title: Reinforcement Learning (RL) Engineer, Manipulation Company: Randstad Technologies Recruitment Location: City of London, London Salary: £80,000 – £120,000/annum We are currently supporting an exciting, top‐notch high‐tech robotics company in London as they assemble an elite team to define their 2026 technical roadmap. … opportunity for world‐class talent to solve the most complex challenges in high‐DOF autonomous systems and embodied AI. Expertise areas AI/Machine Learning MLOps Software Engineering Data Science The environment The mission is driven by high‐bandwidth, in‐person collaboration. This is a 5‐day‐a‐week ...

Senior AI Technologist

Hiring Organisation: Anson Mccade
Location: Chelmsford, Essex, South East, United Kingdom
Employment Type: Permanent
Salary: £85,000

join a rapidly expanding Data and Decision Support Capability team. This role focuses on cutting-edge research and development across various AI domains, including reinforcement learning, NLP/LLMs, knowledge graphs, computer vision, and advanced sensor processing (radar, sonar, acoustics, and RF). Typical Responsibilities Technical Leadership: Lead … Candidates should ideally possess expertise or interest in one or more of the following domains: AI/ML for imagery and remote sensing applications Reinforcement learning Natural Language Processing (NLP) & Large Language Models (LLMs) Knowledge graphs and graph-based neural networks AI/ML applied to RF, Electronic ...

Software Engineer, RL Data

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

down to reading transcripts, supporting users, and wrangling vendors. The company's RL Data team builds the systems that produce high‐quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data … Effective use of AI tools in your own day‐to‐day work. Care about the societal impacts of your work. Preferred qualifications Experience with reinforcement learning on LLMs, particularly on the data side: creating evals, environments, rewards, graders, or training data. Experience helping organizations use AI more effectively ...

Research Engineer, RL Scaling Science

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

policy experts, and business leaders working together to build beneficial AI systems. About the role The company's RL Scaling Science team studies how reinforcement learning behaves as we scale it (across model size, compute, and task horizon) and turns that understanding into the training recipes behind … scale Partner closely with adjacent RL teams across research and engineering and advance our overall RL stack Minimum qualifications Strong empirical research skills in Reinforcement Learning, large-scale ML training, or a closely adjacent area Demonstrated ability to own large experiments end-to-end, from design through interpretation ...

Research Scientist/Engineer - General Decision & Control Agent

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

with continuous self-iterative optimization is an essential path toward artificial general intelligence. We are seeking passionate, creative talents with expertise in Agents and reinforcement learning to join our team and conduct cutting-edge research together. Key Responsibilities Agent Memory: Participate in the design and development of Agent … Agent Harness to realise continuous evolution of the Agent Harness framework. Agentic RL: Investigate agent optimisation techniques based on parametric and non-parametric reinforcement learning, and build collaborative update pipelines linking Agent policy models and Agent Harness. This job description is only an outline of the tasks, responsibilities ...

Machine Learning Engineer

Hiring Organisation: Platform Recruitment
Location: London, United Kingdom
Employment Type: Permanent
Salary: £60000 - £70000/annum

Title: Machine Learning Engineer Location: London Salary: Up to £70,000 DOE This is a unique opportunity for a Machine Learning Engineer to join a well-funded early-stage company building frontier AI systems focused on optimising complex operational environments. Working at the intersection of scientific machine learning … hands-on engineering role where you will work closely with a high-calibre founding team, industrial data, and customer environments to take machine learning systems from early validation through to scalable deployment. The Role You will design, build, and deploy advanced machine learning models capable of solving complex ...

Applied Scientist II, Alexa for Shopping Science UK

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Amazon Development Centre (London) Limited We are looking for a passionate, talented, and inventive Applied Scientist with a strong machine learning background to help build industry‐leading language technology powering Alexa for Shopping, our AI‐driven search and shopping assistant, helping customers with their shopping tasks at every step … enabling shopping directly from images or videos, providing visual inspiration, and more. We do this by leveraging advanced analytics, Natural Language Processing (NLP), Machine Learning (ML), A/B testing, causal inference, and data‐driven insights to continuously improve our systems. Key job responsibilities Develop and maintain LLM agents ...

Manager, Lead Research Scientist, LLM Agents (Foundational Research)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

curious and open-minded individual with an interest in conducting state-of-the-art foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex agent-based AI systems in a data-rich, complex academic environment driven by real-world problems. Foundational Research … sleeves and participate in designing, coding, conducting experiments, and translating findings into concrete deliverables. Our focus areas are: LLM Training (Continued Pretraining, Instruction Tuning, Reinforcement Learning Alignment, Distributed Training, Efficient ML techniques) Post-training techniques for planning, reasoning & complex workflows (e.g., Reasoning Models, LLMs + Knowledge Graphs, Test ...