Reinforcement Learning Jobs in the UK excluding London

51 to 69 of 69 Reinforcement Learning Jobs in the UK excluding London

Senior NLP Engineer (London)

london (city of london), south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
Posted:

Senior NLP Engineer (London)

slough, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
Posted:

Programming Language Engineer

Glasgow, Scotland, United Kingdom
Experis UK
Excellent communication and remote collaboration skills Nice to Have Experience with Julia Knowledge of applied category theory (e.g. algebraic theories, presheaves, coalgebra, polynomial functors) Background in model-based deep reinforcement learning, program synthesis, or theorem proving Willingness to deepen theoretical knowledge as needed Why This Role? This is a unique opportunity to work on a high-impact project More ❯
Posted:

Member of Technical Staff

london, south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

Member of Technical Staff

slough, south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

Member of Technical Staff

london (city of london), south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

AI Engineer

london, south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques, including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

AI Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques, including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

AI Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques, including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

Deep Learning Engineer

slough, south east england, united kingdom
Brio Digital
Deep Learning Engineer – Manipulation (Brio Digital, on behalf of our client) Brio Digital is partnered with a pioneering robotics client who are building advanced, scalable, and safe AI-driven systems designed to transform how humans and machines interact in the real world. Their first-generation platform is tackling labour automation challenges, enabling efficiency and safety across industrial use cases … and logistics. This is a unique opportunity to join an organisation at the frontier of applied AI and robotics, where you’ll be solving cutting-edge challenges in deep learning and embodied intelligence. The Role We’re looking for multiple Deep Learning Engineers (Manipulation) to join the team. This is a deep learning-focused position where you … training policies, curating data, leveraging synthetic datasets, and deploying real-time inference models. Robotics experience is not strictly required, but you must bring a strong track record in deep learning and the ability to adapt quickly to new domains. What You’ll Be Doing Train manipulation policies using representation learning, behaviour cloning, and reinforcement learning. Partner with More ❯
Posted:

Deep Learning Engineer

london, south east england, united kingdom
Brio Digital
Deep Learning Engineer – Manipulation (Brio Digital, on behalf of our client) Brio Digital is partnered with a pioneering robotics client who are building advanced, scalable, and safe AI-driven systems designed to transform how humans and machines interact in the real world. Their first-generation platform is tackling labour automation challenges, enabling efficiency and safety across industrial use cases … and logistics. This is a unique opportunity to join an organisation at the frontier of applied AI and robotics, where you’ll be solving cutting-edge challenges in deep learning and embodied intelligence. The Role We’re looking for multiple Deep Learning Engineers (Manipulation) to join the team. This is a deep learning-focused position where you … training policies, curating data, leveraging synthetic datasets, and deploying real-time inference models. Robotics experience is not strictly required, but you must bring a strong track record in deep learning and the ability to adapt quickly to new domains. What You’ll Be Doing Train manipulation policies using representation learning, behaviour cloning, and reinforcement learning. Partner with More ❯
Posted:

Deep Learning Engineer

london (city of london), south east england, united kingdom
Brio Digital
Deep Learning Engineer – Manipulation (Brio Digital, on behalf of our client) Brio Digital is partnered with a pioneering robotics client who are building advanced, scalable, and safe AI-driven systems designed to transform how humans and machines interact in the real world. Their first-generation platform is tackling labour automation challenges, enabling efficiency and safety across industrial use cases … and logistics. This is a unique opportunity to join an organisation at the frontier of applied AI and robotics, where you’ll be solving cutting-edge challenges in deep learning and embodied intelligence. The Role We’re looking for multiple Deep Learning Engineers (Manipulation) to join the team. This is a deep learning-focused position where you … training policies, curating data, leveraging synthetic datasets, and deploying real-time inference models. Robotics experience is not strictly required, but you must bring a strong track record in deep learning and the ability to adapt quickly to new domains. What You’ll Be Doing Train manipulation policies using representation learning, behaviour cloning, and reinforcement learning. Partner with More ❯
Posted:

AI SME

london, south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME

slough, south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME

london (city of london), south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME- DB Pensions

London, South East, England, United Kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. Carbon60, Lorien & SRG - The Impellam Group STEM More ❯
Employment Type: Contractor
Rate: Salary negotiable
Posted:

AI Engineer

london, south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

AI Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

AI Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:
Reinforcement Learning
the UK excluding London
10th Percentile
£68,675
25th Percentile
£70,000
Median
£80,000
75th Percentile
£95,625
90th Percentile
£121,250