Reinforcement Learning Jobs in the UK

51 to 69 of 69 Reinforcement Learning Jobs in the UK

Senior Research Scientist (Data Science & Machine Learning)

Chelmsford, Essex, South East, United Kingdom
Hybrid / WFH Options
Anson Mccade
Data Science and Machine Learning Research Scientist Location: Chelmsford | Hybrid (2 days onsite) | Salary ~£70,000 + 10% bonus | SC Clearable A specialist technology consultancy is seeking a Data Science and Machine Learning Research Scientist to develop novel AI/ML solutions for complex, real-world challenges in national security and defence. Working within a mission-led, multidisciplinary … youll help shape next-generation tools across areas such as multi-modal fusion, geospatial intelligence, and intelligent decision support. About the Opportunity Youll join a 35+ person AI & Machine Learning capability, collaborating across disciplines including computer vision, NLP, knowledge representation, and reinforcement learning. This hands-on role focuses on applied research and technical deliveryspanning early-stage concept exploration … to proof-of-concept demonstrators for government R&D sponsors. The team operates at the intersection of data science, autonomy, cyber, and complex systemsblending statistical, symbolic, and machine learning approaches to address high-impact use cases. Key Responsibilities Conduct applied research in data science and machine learning, with a focus on real-world decision-making applications Design, implement More ❯
Employment Type: Permanent, Work From Home
Salary: £70,000
Posted:

Research Engineer, Search

London, United Kingdom
Hybrid / WFH Options
Menlo Ventures
new architectures for how information is organized, and train language models to optimally use those architectures. Responsibilities: Designing and implementing from scratch new information architecture strategies Performing finetuning and reinforcement learning to teach language models how to interact with new information architectures Building "hard" knowledge base eval sets to help identify failure modes of how language models work … may be a good fit if you: Are a very experienced Python programmer who can quickly produce reliable, high quality code that your teammates love using Have good machine learning research experience Have experience developing software that utilizes Large Language Models such as Claude Are results-oriented, with a bias towards flexibility and impact Pick up slack, even if … many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior AI Engineer

London, United Kingdom
Hybrid / WFH Options
GSMA LLC
and contribute to algorithm development. Dr G.A.McHale, Technical Director, AI & Data Science About the Team The team is led by someone with significant AI experience in bio-inspired architectures, reinforcement learning, expert systems, scheduling, meta-heuristics, robotics, and natural language processing (including LLMs). We have recruited an experienced scientific computing developer with a strong mathematics background in … mobile's future. You will work in a fast-paced, evolving environment on global solutions and industry-changing projects. Benefits include state-of-the-art offices, competitive pay, excellent learning opportunities, generous holidays, and professional development days. We value diversity, equity, and inclusion, offering flexible working arrangements to support work/life balance. We encourage applications from all backgrounds More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist

London, United Kingdom
Hybrid / WFH Options
Ecm Selection
experimental, and it is understood that not all projects succeed, even failed projects contain valuable insights. You will be building upon cutting-edge ML techniques such as transformers and reinforcement learning to create novel multi-modal solutions. Examples include sensor fusion systems, physics-informed neural networks for simulations, and multi-purpose autonomous robots. Projects will be defence focused More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Technologist

Chelmsford, Essex, South East, United Kingdom
Hybrid / WFH Options
Anson Mccade
who can provide technical leadership across a portfolio of advanced R&D programmes. You will guide both internal delivery and external engagement in domains including NLP, LLMs, computer vision, reinforcement learning, and AI for sensor applications. The role blends hands-on innovation with assurance, mentoring, and proposal direction. Key Requirements: PhD in a relevant field with 10+ years More ❯
Employment Type: Permanent, Work From Home
Posted:

VP of Engineering - Cloud, AI, and Microservices. (London)

Wandsworth, Greater London, UK
Bounty Hunter World
particularly with cloud-based systems , Microservices , and AI integrations . Oversee LLM (Large Model) integration , leveraging models like GPT, and implement advanced AIarchitectures such as Retrieval-Augmented (RAG) and Reinforcement Learning with HumanFeedback (RLHF) . Lead the development of applications using React, .NET , Python , or Node.js , ensuring scalable, maintainable, andperformant solutions. Team Management: Manage and mentor engineering managers More ❯
Employment Type: Full-time
Posted:

VP of Engineering - Cloud, AI, and Microservices. London, United (London)

Wandsworth, Greater London, UK
esrhealthcare
cloud-based systems, microservices, and AI integrations. Oversee LLM (Large Language Model) integration, leveraging models like GPT, and implement advanced AI architectures such as Retrieval-Augmented Generation (RAG) and Reinforcement Learning with Human Feedback (RLHF). Lead the development of applications using React, .NET, Python, or Node.js, ensuring scalable, maintainable, and performant solutions. Team Management: Manage and mentor More ❯
Employment Type: Full-time
Posted:

VP of Engineering - Cloud, AI, and Microservices. London, United Kingdom (London)

London, UK
ESR Healthcare
with cloud-based systems,Microservices, and AI integrations. Oversee LLM (Large Language Model) integration, leveraging models like GPT, and implement advanced AIarchitectures such as Retrieval-Augmented Generation (RAG) and Reinforcement Learning with HumanFeedback (RLHF). Lead the development of applications using React, .NET, Python, or Node.js, ensuring scalable, maintainable, andperformant solutions. Team Management: Manage and mentor engineering managers More ❯
Employment Type: Full-time
Posted:

Head of Product

London Area, United Kingdom
algo1
by a top-tier VC, building products that will redefine the retail experience. Currently in stealth mode, we’re blending cutting-edge advances in behavioural science, Generative AI, and reinforcement learning to create systems that anticipate and serve customer needs in radically new ways. We're seeking a visionary Head of Product to lead our next phase of More ❯
Posted:

Head of Product

City of London, London, United Kingdom
algo1
by a top-tier VC, building products that will redefine the retail experience. Currently in stealth mode, we’re blending cutting-edge advances in behavioural science, Generative AI, and reinforcement learning to create systems that anticipate and serve customer needs in radically new ways. We're seeking a visionary Head of Product to lead our next phase of More ❯
Posted:

AI Tech Lead

South East, United Kingdom
Anson Mccade
Experience in proposal writing (e.g. DSTL, NS, EPSRC). A background in Aerospace, Defence or Public Sector consulting. Experience in one or more of: AI/ML for imagery, Reinforcement learning, NLP, LLM, Knowledge graphs, RF, Radar, Sonar or acoustics. AI Tech Lead key details: Up to £80k salary Up to 10% performance bonus. 2 Days a week More ❯
Employment Type: Permanent
Posted:

Senior Research Scientist (Data Science & ML)

Chelmsford, Essex, South East, United Kingdom
Hybrid / WFH Options
Anson Mccade
Senior Research Scientist (Data Science & ML) £60,000 - 70,000 GBP Hybrid WORKING Location: Chelmsford, East of England - United Kingdom Type: Permanent Data Science and Machine Learning Research Scientist Location: Chelmsford | Hybrid (2 days onsite) | Salary ~£70,000 + 10% bonus | SC Clearable A specialist technology consultancy is seeking a Data Science and Machine Learning Research Scientist to … help shape next-generation tools across areas such as multi-modal fusion, geospatial intelligence, and intelligent decision support. About the Opportunity You'll join a 35+ person AI & Machine Learning capability, collaborating across disciplines including computer vision, NLP, knowledge representation, and reinforcement learning. This hands-on role focuses on applied research and technical delivery-spanning early-stage concept … to proof-of-concept demonstrators for government R&D sponsors. The team operates at the intersection of data science, autonomy, cyber, and complex systems-blending statistical, symbolic, and machine learning approaches to address high-impact use cases. Key Responsibilities Conduct applied research in data science and machine learning, with a focus on real-world decision-making applications Design More ❯
Employment Type: Permanent, Work From Home
Posted:

Graphic Designer

London Area, United Kingdom
algo1
by a top-tier VC, building products that will redefine the retail experience. Currently in stealth mode, we’re blending cutting-edge advances in behavioural science, Generative AI, and reinforcement learning to create systems that anticipate and serve customer needs in radically new ways. We're looking for an exceptional Graphic Designer to join our growing team and … highly desirable. A passion for creating visually stunning and intuitive experiences. Bonus points if you have: Experience working in a fast-paced startup environment. Familiarity with AI or machine learning concepts. Experience in the retail or e-commerce sector. Why join us? This is a unique opportunity to join a stealth-mode company at the ground floor, working on More ❯
Posted:

Graphic Designer

City of London, London, United Kingdom
algo1
by a top-tier VC, building products that will redefine the retail experience. Currently in stealth mode, we’re blending cutting-edge advances in behavioural science, Generative AI, and reinforcement learning to create systems that anticipate and serve customer needs in radically new ways. We're looking for an exceptional Graphic Designer to join our growing team and … highly desirable. A passion for creating visually stunning and intuitive experiences. Bonus points if you have: Experience working in a fast-paced startup environment. Familiarity with AI or machine learning concepts. Experience in the retail or e-commerce sector. Why join us? This is a unique opportunity to join a stealth-mode company at the ground floor, working on More ❯
Posted:

Junior UI Designer (Material 3 Expressive)

City of London, London, United Kingdom
algo1
by a top-tier VC, building products that will redefine the retail experience. Currently in stealth mode, we’re blending cutting-edge advances in behavioural science, Generative AI, and reinforcement learning to create systems that anticipate and serve customer needs in radically new ways. We're looking for a talented Junior UI Designer to join our growing team … skills. A passion for creating visually stunning and intuitive digital experiences. Bonus points if you have: Experience working in a fast-paced startup environment. Familiarity with AI or machine learning concepts. Experience in the retail or e-commerce sector. Why join us? This is a unique opportunity to join a stealth-mode company at the ground floor, working on More ❯
Posted:

Junior UI Designer (Material 3 Expressive)

London Area, United Kingdom
algo1
by a top-tier VC, building products that will redefine the retail experience. Currently in stealth mode, we’re blending cutting-edge advances in behavioural science, Generative AI, and reinforcement learning to create systems that anticipate and serve customer needs in radically new ways. We're looking for a talented Junior UI Designer to join our growing team … skills. A passion for creating visually stunning and intuitive digital experiences. Bonus points if you have: Experience working in a fast-paced startup environment. Familiarity with AI or machine learning concepts. Experience in the retail or e-commerce sector. Why join us? This is a unique opportunity to join a stealth-mode company at the ground floor, working on More ❯
Posted:

Senior Account Executive

London, United Kingdom
Anima
you'll help us extend the 3 existing product lines we have, that millions of patients use, and build out new ones at the very cutting edge of healthcare reinforcement learning and agentic AI. Your work will save countless lives. Do you want to save lives with every clinic that you onboard? Please note that as this role … to grow towards a senior executive role. Seeks to maximise combined team productivity, communicating the right things at the right time through the right channels. Intellectually curious and loves learning - able to tackle entirely novel challenges that lack prior precedent through first principles thinking, creatively using the right pragmatic approach, with an understanding of alternatives and trade offs. We More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Research Scientist

London, United Kingdom
Antler
the freedom to propose and implement state-of-the-art infrastructure solutions. Exceptional Team: Collaborate with talented colleagues from diverse backgrounds across ML, bioinformatics, and engineering. Growth Opportunities: Continuous learning and growth opportunities in a rapidly advancing technical field. Culture Insight What we are doing is extremely hard. Prima Mente is for great people. We are team players who … PyTorch, JAX, TensorFlow) and familiarity with scalable training frameworks. Experience managing large-scale distributed training across GPUs/TPUs. Background in generative AI, large language models, mechanistic interpretability, or reinforcement learning. Interview Process Our interview process is hard from the beginning, so please do come prepared to show us your strongest self. Marie is based in SF and Hannah More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Head of CortAIx Lab UK

Reading, Berkshire, United Kingdom
Hybrid / WFH Options
Thales Group
AI is used appropriately for their problems and collaborate with the UK, European and International research ecosystems. You will provide specific leadership for AI R&D and innovation for reinforcement learning. With our ambition to develop and expand the team, will we expect you to identify new talent and partnering opportunities with industry and academia. The role is also … to mental wellbeing app Employee discount shopping schemes on major brands and retailers Development focused targets and relocation support can be provided if required along with a variety of learning, development and career enhancement opportunities. Thales is an equal opportunities employer and diversity and inclusion are integral to the success of Thales. In line with Thales' Baseline Security requirements More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Reinforcement Learning
10th Percentile
£75,000
25th Percentile
£86,250
Median
£100,000
75th Percentile
£175,000