24 of 24 Reinforcement Learning Jobs in the UK

Artificial Intelligence Researcher

Hiring Organisation
Cubiq Recruitment
Location
City of London, London, United Kingdom
Robot Learning/Embodied AI We’re partnering with a venture-backed robotics startup building systems that allow humans to extend their physical capabilities through intelligent robotic platforms. The company recently secured new funding and is expanding its AI research team to develop learning systems for dexterous manipulation … real-world robotic autonomy. This role sits at the intersection of robot learning, multimodal models, and real-world deployment. The focus is not simulation research alone. The work is about taking cutting-edge robot learning approaches and making them function reliably on physical systems. Why this opportunity ...

Artificial Intelligence Engineer

Hiring Organisation
WorkGenius Group
Location
City of London, London, United Kingdom
Role: Full-time (Permanent Role) We are building a world-class AI research team focused on advancing next-generation agentic systems and intent-aware learning architectures. Our mission is to bridge cutting-edge research in large language models, reinforcement learning, and alignment with scalable, real-world production … systems. You will operate at the intersection of research and product, shaping foundational capabilities in intent understanding, agent learning, and model alignment across distributed AI environments. This is an opportunity to influence AI systems deployed at global scale across diverse compute environments including edge and cloud. Responsibilities Define Research ...

Applied AI Research Engineer - £300k + bens - London

Hiring Organisation
Transparent Technology
Location
City of London, London, United Kingdom
Employment Type
Permanent
success. What You'll Do * Design and implement state-of-the-art instruction tuning methods * Fine-tune and deploy LLMs in production environments * Apply reinforcement learning techniques (SFT, PPO, DPO, GRPO) * Run hands-on experimentation to outperform closed-source models * Break down ambiguous research ideas into structured roadmaps … speech systems. Ideal Background * 5-7+ years in applied AI/ML (exceptional 3+ years considered) * Deep experience in fine-tuning + reinforcement learning * Experience shipping ML systems from research into production * Open-source LLM experience essential * Product-driven engineering mindset (Apple, LinkedIn, Amazon style environments ideal ...

Senior Machine Learning Engineer

Hiring Organisation
OJ Digital
Location
Greater London, England, United Kingdom
Senior Machine Learning Engineer The Role We’re hiring a Senior or Staff ML Research Engineer to join a high growth AI company building advanced proprietary language models that power real world products at scale. This business has strong product market fit and significant enterprise adoption. A large proportion … Design and implement state of the art instruction tuning and information retrieval methods Fine tune and deploy large open source LLMs in production Apply reinforcement learning approaches including SFT, DPO, PPO and GRPO Develop models that outperform closed source alternatives Break down ambiguous research ideas into structured technical ...

Software Engineer - Large Language Models

Hiring Organisation
Fastino Labs
Location
United Kingdom
overall performance metrics Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards Build robust … Required - Great velocity for building and shipping agents/AI products. Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies Optional - Demonstrated ability to do independent research in Academic ...

Data Scientist

Hiring Organisation
Synergetic
Location
City of London, London, United Kingdom
with software engineering capabilities to build end-to-end AI solutions. The ideal candidate will have a strong foundation in both developing sophisticated machine learning models and implementing them within production systems. You will work closely with cross-functional teams to transform concepts into scalable AI-powered products. … looking for candidates that can combine technical expertise with a true consulting approach. Responsibilities Design, develop, and implement advanced machine learning models and AI capabilities Build and maintain knowledge graphs and causal inference systems Create probabilistic models to address complex business problems Scale AI solutions from proof-of-concept ...

AI Engineer

Hiring Organisation
Akixi
Location
United Kingdom
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...

Lead AI Engineer

Hiring Organisation
Akixi
Location
United Kingdom
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...

Algorithms Engineer

Hiring Organisation
SeeByte
Location
Edinburgh, Scotland, United Kingdom
mathematics, physics, engineering, or a related field (e.g., BSc/MSc/PhD). Solid understanding of statistics, linear algebra, probability, optimisation, and machine learning techniques. Experience with algorithm development, numerical methods, or physical simulation. Proficiency in at least one programming language commonly used for scientific computing and prototyping … software development. Ability to clearly explain complex technical concepts, both verbally and in writing, to technical and non-technical audiences. Desirable Experience Developing machine learning models, especially in the context of reinforcement learning or computer vision. Robotics or autonomous systems (e.g., path-planning problems ...

Software Engineer (Applied AI)

Hiring Organisation
Euphoric
Location
United Kingdom
iteration of our next-generation benefits platform features that leverage personalization, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript … against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural ...

Lead ML Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language ...

Senior Data Scientist

Hiring Organisation
Vitality
Location
City of London, London, United Kingdom
Full time, 37.5 hours per week. We are happy to discuss flexible working! Top 3 skills needed for this role: Deep Expertise in Machine Learning, Data Science & Technical Tooling Strategic Project Leadership & Business Impact Delivery High Level Stakeholder Engagement & Communication What this role is all about: Vitality is entering … members live healthier, happier, longer lives. As a Senior Data Scientist , you will play a pivotal role in designing, building, and executing advanced machine learning and AI solutions that sit at the heart of Vitality’s transformation. Your work will help shape the next generation of personalised health insurance ...

Senior Data Scientist

Hiring Organisation
Vitality Corporate Services Limited - Tech
Location
Central London, London, United Kingdom
Employment Type
Permanent
Salary
£95,000
Office.Full time, 37.5 hours per week. We are happy to discuss flexible working! Top 3 skills needed for this role: Deep Expertise in Machine Learning, Data Science & Technical Tooling Strategic Project Leadership & Business Impact Delivery High Level Stakeholder Engagement & Communication What this role is all about: Vitality is entering … members live healthier, happier, longer lives. As a Senior Data Scientist , you will play a pivotal role in designing, building, and executing advanced machine learning and AI solutions that sit at the heart of Vitalitys transformation. Your work will help shape the next generation of personalised health insurance ...

Senior Data Scientist

Hiring Organisation
Vitality
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
Full time, 37.5 hours per week. We are happy to discuss flexible working! Top 3 skills needed for this role: Deep Expertise in Machine Learning, Data Science & Technical Tooling Strategic Project Leadership & Business Impact Delivery High Level Stakeholder Engagement & Communication What this role is all about: Vitality is entering … members live healthier, happier, longer lives. As a Senior Data Scientist , you will play a pivotal role in designing, building, and executing advanced machine learning and AI solutions that sit at the heart of Vitality’s transformation. Your work will help shape the next generation of personalised health insurance ...

AI Engineer

Hiring Organisation
DXC
Location
City of London, London, United Kingdom
Employment Type
Permanent
data pipelines and infrastructure. Partnering with cross-functional teams to understand data needs and shape solutions. Contributing to data quality, governance, and security initiatives. Learning directly from specialists in AI and data engineering. Helping to continuously improve and optimise data processes. Staying current with emerging tools, trends, and technologies. … Mistral, Claude). Skills in fine-tuning, prompt engineering, and building RAG pipelines. Familiarity with Agent Frameworks (LangChain, LlamaIndex, CrewAI, AutoGen). Knowledge of reinforcement learning methods or tools (Q-learning, policy gradients, RLlib). Why Join Us? Work on AI solutions that make a meaningful impact ...

Software Engineer (Numerical Modelling, AI/ML, C++/Python)

Hiring Organisation
Hays
Location
Guildford, Surrey, South East, United Kingdom
Employment Type
Permanent
level scientific language (e.g., Python, Julia). We are particularly looking at experience with scientific computing, numerical methods, or computational modelling. Desirables are Machine learning, optimization, control, probabilistic modelling, or related fields. Familiarity digital twins, agentic systems, reinforcement learning or advanced optimisation. What ...

Senior NLP Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense ...

Leadership Development Programs Architect

Hiring Organisation
SoftServe
Location
United Kingdom
across both internal program design and external consulting environments AND YOU WANT TO Design end-to-end leadership development journeys, including modules, coaching, action learning, and reinforcement Integrate multiple learning modalities – workshops, simulations, peer learning, coaching, and experiential learning Build scalable program foundations that … cascaded to managers and directors Define clear learning objectives, success criteria, and differentiation across leadership levels Partner with SMEs and external providers for content development and validation Collaborate closely with the CEO, CPO, Leadership Team, business leaders, and Strategic HRBPs to align program goals Advise senior leaders on learning ...

AI Simulation and Control Engineer (up to £125k + equity)

Hiring Organisation
Optimal Agriculture
Location
Greater London, England, United Kingdom
performance of Optimal's AI to maximise crop yields and minimise resource consumption, working closely with our agronomy experts. Technical skills Experience training Machine Learning models. Strong background in at least one of Machine Learning, Optimisation, Control (Model Predictive Control, Optimal Control, and classical feedback techniques), Reinforcement Learning, Physics Modelling and Numerical Simulation. Software engineering in Python (Julia is a bonus) Software engineering processes and tools (containers, version control, deployments etc) AI coding Compensation Salary: £70k – £125k Equity: 0.5% – 5.0 ...

Research Engineer (Agents)

Hiring Organisation
Native
Location
City of London, London, United Kingdom
structured data representations (tables, graphs, schemas) Build training, simulation, and evaluation environments for long-horizon, multi-step agent behavior Develop self-supervised and reinforcement learning objectives for improving agent reliability and correctness Integrate foundation model embeddings and symbolic components into agent workflows Own agent systems ...

Senior Data Engineer

Hiring Organisation
develop
Location
City of London, London, United Kingdom
industry standards AI Enablement & Data Serving Build high-quality datasets for retrieval pipelines (RAG), embeddings and conversational agents Create data foundations supporting decision engines, reinforcement learning and value measurement Partner with AI engineers to operationalise pipelines for LLM workflows and agentic systems Standards, Documentation & Reusability Produce clear documentation ...

Head of Decision Science Consulting, UK based

Hiring Organisation
Staffworx Limited
Location
Central London, London, United Kingdom
Employment Type
Permanent
models in live business decision environments. Strong fluency in Python, modern ML tools, and decision optimisation frameworks. Deep understanding of statistical modelling and machine learning with experience deploying models into production-scale systems. Beneficial: pricing and revenue optimisation, forecasting and supply chain, risk and fraud modelling, reinforcement learning ...

AI Systems Architect - Studio Co-Founder

Hiring Organisation
Skillsearch Ltd
Location
London Area, United Kingdom
company as part of the founding team. Technical Requirements Strong background in software engineering and system architecture. Experience designing and deploying AI or machine learning systems in production environments. Experience building scalable backend systems and integrating AI services into applications. Experience working with or integrating systems into Unity … with personalisation systems, recommendation engines or player behaviour modelling. Experience working in early-stage startups or building products from zero. Experience with generative AI, reinforcement learning or adaptive gameplay systems. Experience designing developer tools or AI frameworks used by product teams. What You Will Help Build ...

Data Scientist - Inside IR35 - Hybrid

Hiring Organisation
Halian Technology Limited
Location
Croydon, Surrey, South East, United Kingdom
Employment Type
Contract
Role We are recruiting on behalf of a mobility technology business building intelligent fleet orchestration systems. This role suits an experienced Applied Machine Learning Engineer or Data Scientist comfortable working with messy real-world data, operational constraints, and production systems. Youll join a small, high-calibre team solving complex … years Geospatial data experience (H3, GeoPandas, PostGIS or similar) Optimisation/operations research exposure Logistics/mobility/marketplace domain experience Nice to Have Reinforcement learning Simulation modelling Experience deploying models into cloud environments Experimentation frameworks (A/B testing, model validation at scale) How to Apply ...