Remote Reinforcement Learning Jobs in the UK

1 to 25 of 29 Remote Reinforcement Learning Jobs in the UK

Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options
Locus Robotics
to sorting and pack-out-Locus Robotics empowers businesses to meet peak demands and adapt to ever-changing operational needs. Are you a Machine Learning Engineer with a passion for reinforcement learning, multi-agent systems, and simulation at scale? We want to hear from you! At Locus … tools and ML systems to optimize the behavior of large autonomous fleets in dynamic environments. In this role, you will work on cutting-edge reinforcement learning (RL) models, multi-agent systems, and faster-than-real-time simulations to drive innovation in logistics, robotics, and beyond. You'll collaborate … real-time modelling of 1,000+ autonomous agents for various use cases such as fleet optimization, logistics, or robotics. Develop, deploy, and maintain machine learning models, with a strong focus on reinforcement learning (RL) and multi-agent systems to optimize fleet behavior in dynamic environments. Implement and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Software Engineer

Surrey, Metro Vancouver, British Columbia
Hybrid / WFH Options
ECM Selection
experience will also be considered) - Full rights to work in the UK without limitation While not required, any additional experience with digital twins, machine learning, reinforcement learning or agentic systems within a scientific context would be beneficial. On offer is a very competitive salary and attractive benefits … hybrid or remote work. Keywords: RSE, Scientific Modelling, R&D, Python, Julia, R, C, C++, C#, Fortran, Digital Twins, Agentic Systems, Probabilistic Programming, Machine Learning, Reinforcement Learning Another top job from ECM, the high-tech recruitment experts. Even if this job's not quite right, do contact More ❯
Employment Type: Permanent
Salary: £60,000 - £70,000
Posted:

Senior Data Scientist Data and Insights London Hybrid Remote

London, United Kingdom
Hybrid / WFH Options
loveholidays
Data Scientists, four Data Scientists, and the Head of Data Science. We specialise in various areas such as Recommender Systems, Time Series Forecasting, Deep Learning, and Reinforcement Learning, fostering a collaborative learning environment. Our focus is on modelling and problem-solving, leveraging advanced machine learning … planning/prioritisation to delivery including monitoring and alerting Designing experiments and modelling to generate actionable insights and enhance business performance Proficient in machine learning and statistical methods for predictive modelling and forecasting Experience deploying ML models to production at scale Solid understanding of SQL Proficiency in unit testing … CI/CD, model management and experiment tracking Desirable Experience with Deep Learning, Generative AI and Reinforcement Learning Experience with Time Series Forecasting and Recommender Systems Previous experience working in e-commerce, retail, or the travel industry. Conducted and analysed large scale A/B experiments Experience More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI/ML Tech Lead

Manchester, UK
Hybrid / WFH Options
Airtime
engineers, product teams, and software engineers to integrate ML solutions into proprietary leading-edge fintech applications. Research and implement state-of-the-art deep learning, reinforcement learning, and transformer-based NLP models (e.g., BERT, GPT). Monitor model performance and security, retrain models as necessary, and optimise … adoption. Hire, mentor and support junior ML engineers, fostering engineering excellence, knowledge-sharing and best practices within the team. Requirements Strong background in Machine Learning, Deep Learning, and NLP Experience with transformer models (BERT, GPT, LLaMA, etc.) Proficiency in SQL, Python and ML libraries such as TensorFlow, PyTorch … accessible across the business Colleague Benefits Share options. 23 days annual leave, plus one for each year served (capped at 28). Birthday leave. Learning & development budget/time allocation Flexible start & finish hours 06:30 - 10:30 am Life assurance at 5x salary Health cash plan Virtual GP More ❯
Posted:

AI / ML Recruitment Consultant

United Kingdom
Hybrid / WFH Options
AT Talent - Rec2Rec
plug directly into the world's most impressive DeepTech businesses in the world to place the TOP 1% across Robotics, Computer Vision, GenAI, Machine Learning, Deep Learning, Reinforcement Learning + the works. I could keep going, so let me skip to the logistics of the role More ❯
Posted:

AI Engineer

London, United Kingdom
Hybrid / WFH Options
Birdie
up skill our broader engineering teams. Build and maintain AI infrastructure leveraging the latest technology approaches: Investigate cutting-edge approaches like generative AI or reinforcement learning to drive adoption and impact. Ensure seamless integration of AI into existing platform and customer-facing processes. We'd love to hear … from you if you have: At least 2 years of experience working in an AI engineering or machine learning role supporting customer-facing processes and infrastructure. Experience working in a cross functional product engineering squad with deep interest in understanding and solving end user problems. Proven expertise in building … scalable machine learning models and deploying them into production environments. Passionate about enabling others with tools, frameworks, and documentation-ensuring teams can independently leverage AI without creating bottlenecks. A technical leader who stays curious about emerging technologies (e.g., generative AI, reinforcement learning) and rapidly evaluates their pros More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Specialist, Applied AI Engineer

London, United Kingdom
Hybrid / WFH Options
Industrial and Financial Systems
and iterate AI solutions in collaboration with customers and use case leads is a must. You will be responsible for applying generative AI techniques, reinforcement learning, and multi-agent systems to solve real-world industrial challenges. This role demands a high level of creativity in developing new AI … and optimization of AI models to ensure reliability, safety, and regulatory compliance in industrial settings. Stay ahead of the latest advancements in generative AI, reinforcement learning, and multi-agent systems to drive innovation within the company. Qualifications Masters degree in Data Science. Thought leader and expert in applied More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Software Engineer

Guildford, Surrey, United Kingdom
Hybrid / WFH Options
Ecm Selection
experience will also be considered) Full rights to work in the UK without limitation While not required, any additional experience with digital twins, machine learning, reinforcement learning or agentic systems within a scientific context would be beneficial. On offer is a very competitive salary and attractive benefits More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer, Content and Catalog Management

London, United Kingdom
Hybrid / WFH Options
Spotify AB
environment. What You'll Do: Drive the full lifecycle of ML solutions for CoCaM services, including research, design, development, evaluation, and deployment. Manage Machine Learning projects ranging from Supervised Learning, to Reinforcement Learning, to LLMs. Optimize and monitor deployed ML model performance, implementing improvements based on … applications. Work closely with engineering teams to integrate ML models into existing systems and workflows. Be an active participant of a group of machine learning engineers, staying updated with the latest advancements, participating in code reviews, and contributing to knowledge sharing across the team. Who You Are: 2+ years … of hands-on experience in developing and deploying machine learning models in a production environment. Practical experience in implementing ML systems using languages like Python or Scala and are familiar with relevant ML libraries and frameworks (e.g., TensorFlow or PyTorch). Solid understanding of various machine learning algorithms More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer - Content and Catalog Management

London, United Kingdom
Hybrid / WFH Options
Spotify
environment. What You'll Do Drive the full lifecycle of ML solutions for CoCaM services, including research, design, development, evaluation, and deployment. Manage Machine Learning projects ranging from Supervised Learning, to Reinforcement Learning, to LLMs. Optimize and monitor deployed ML model performance, implementing improvements based on … applications. Work closely with engineering teams to integrate ML models into existing systems and workflows. Be an active participant of a group of machine learning engineers, staying updated with the latest advancements, participating in code reviews, and contributing to knowledge sharing across the team. Who You Are 2+ years … of hands-on experience in developing and deploying machine learning models in a production environment. Practical experience in implementing ML systems using languages like Python or Scala and are familiar with relevant ML libraries and frameworks (e.g., TensorFlow or PyTorch). Solid understanding of various machine learning algorithms More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning (ML) Engineer II

Uxbridge, Middlesex, United Kingdom
Hybrid / WFH Options
NLP PEOPLE
office is open planned, and you will be working in an innovative and collaborative environment with your international peers and colleagues. As the Machine Learning (ML) Engineer II, you will be responsible for developing AI Labs' machine learning platform and creating production-ready AI-based solutions for key … guidance to other parts of the business. Key Accountabilities: ML and Natural Language Processing (NLP) Technology: • Leverage proprietary technology stack to build custom machine learning models • Design, implement, and document new ML/NLP modeling techniques and strategies • Develop Back-end/server-side software to support AI solution … development and serving • Build internal frameworks, libraries, and infrastructure to improve machine learning and NLP capabilities to allow for rapid prototyping and new product delivery • Review and adapt recent research in ML and NLP into prototypes and production solutions • Review and improve the code of other engineers to enhance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Manager, Research Science (Foundational Research, Machine Learning)

London, United Kingdom
Hybrid / WFH Options
Thomas Reuters
Are you a curious and open-minded individual with an interest in leading teams conducting state-of-the-art foundational machine learning research? Thomson Reuters Labs is seeking Research Scientist Managers with a passion for leading teams and solving challenging machine learning problems in a data-rich, complex … academic environment driven by real-world problems. Foundational Research is the dedicated core Machine Learning research division of Thomson Reuters. We are focused on research and development, with a particular focus on advanced algorithms and training techniques for Large Language Models (LLMs). We are building a strong foundation … sleeves and participate in designing, coding, conducting experiments, and translating findings into concrete deliverables. Our focus areas are: LLM Training (Continued Pretraining, Instruction Tuning, Reinforcement Learning Alignment, Distributed Training, Efficient ML techniques) Post-training techniques for planning & reasoning (e.g. LLMs & Knowledge Graphs, Advanced Prompting, CoT pipelines, RAG, Tool More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Scientist

Birmingham, Staffordshire, United Kingdom
Hybrid / WFH Options
Investigo
Scientist Salary: £50,000 - £57,000 Location: Birmingham - Hybrid working 1-2 days a month on site About You: Technical Skills: Expertise in machine learning (supervised, unsupervised, reinforcement learning). Strong proficiency in Databricks, Azure ML, Azure Data Factory, and SQL. Hands-on experience in Python, R … and experience with deep learning frameworks such as TensorFlow, PyTorch, or Scikit-learn. Expert-level experience in Power BI for advanced visualisations, ML model interpretation, and KPI tracking. Deep knowledge of feature engineering, model deployment, and MLOps best practices. Experience with big data processing (Spark, Hadoop) and cloud-based More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Scientist

Birmingham, West Midlands, West Midlands (County), United Kingdom
Hybrid / WFH Options
Investigo
Scientist Salary: £50,000 - £57,000 Location: Birmingham - Hybrid working 1-2 days a month on site About You: Technical Skills: Expertise in machine learning (supervised, unsupervised, reinforcement learning). Strong proficiency in Databricks, Azure ML, Azure Data Factory, and SQL. Hands-on experience in Python, R … and experience with deep learning frameworks such as TensorFlow, PyTorch, or Scikit-learn. Expert-level experience in Power BI for advanced visualisations, ML model interpretation, and KPI tracking. Deep knowledge of feature engineering, model deployment, and MLOps best practices. Experience with big data processing (Spark, Hadoop) and cloud-based More ❯
Employment Type: Permanent
Salary: £50000 - £57000/annum
Posted:

Applied Scientist, EU STEP Science and Tech

London, United Kingdom
Hybrid / WFH Options
Amazon
options in Barcelona, Berlin, or London, designed to maximize team interaction. Remote work options are also considered. Basic Qualifications PhD in Operations Research, Machine Learning, Statistics, Applied Mathematics, Computer Science, or related fields, or equivalent experience. Excellent written and verbal communication skills. Experience programming in Java, Python, C++, or … in a fast-paced applied research environment. Ability to handle ambiguity. Top-tier publications in relevant fields. Key Responsibilities Solve complex optimization and machine learning problems using scalable algorithms. Design and develop prototypes addressing real-world logistics challenges. Lead analyses to support decision-making and communicate results to leadership. … optimizing Amazon's logistics through advanced mathematical algorithms and data-driven techniques. We work closely with academic experts, employ cutting-edge methods like deep learning and reinforcement learning, and prioritize correct modeling and production-ready prototypes. Our goal is to support strategic investments and solve real-world More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Machine Learning Developer

United Kingdom
Hybrid / WFH Options
Capgemini
LLMs and GenAI. • Perform feature engineering and selection to optimize model performance. • Select and implement appropriate AI/ML algorithms, including supervised, unsupervised, and reinforcement learning models. • Train, evaluate, and optimize models using machine learning techniques. • Deploy models to production environments, ensuring robustness and scalability. • Monitor model … in AI/ML development. • Strong understanding of AI/ML algorithms and techniques, including LLMs, GenAI, and automated AI systems. • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch). • Proficiency in Python, R, or other relevant programming languages. • Experience with data analysis and visualization tools (e.g., Matplotlib, Seaborn … invested in wellbeing apps such as Thrive and Peppy. You will be empowered to explore, innovate, and progress. You will benefit from Capgemini's 'learning for life' mindset, meaning you will have countless training and development opportunities from think tanks to hackathons, and access to 250,000 courses with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Scientist - AI & ML

Chelmsford, Essex, United Kingdom
Hybrid / WFH Options
BAE Systems Applied Intelligence
recruit a team lead for our Knowledge and Semantics team. The Data and Decision Support Capability has a diverse range of teams working in reinforcement learning, NLP, knowledge graphs, applications of LLMs, computer vision, AI for RF and EW, sonar and acoustics. You will have the opportunity to … You will have experience of leading a team of researchers working in NLP, LLM applications, text-mining, knowledge graphs and/or graph machine learning and with a vision on how to develop solutions for practical applications of ML in these domains. You will have a proven record of … successful proposal writing and delivery to funding bodies relevant to the Defence sector. You should have existing skills in Machine Learning (ML), will need to be a proficient programmer in Python, with extensive experience in the use of libraries and toolboxes to support efficient development. Typical Responsibilities: Propose and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Algorithm Developer

Stevenage, Hertfordshire, United Kingdom
Hybrid / WFH Options
MBDA Miissle System
skills Relevant experience (Post-Doctoral or industrial) in robotics, data fusion, tracking/estimation, pattern discovery & recognition, statistical inference, optimisation and machine/deep learning algorithms along with real-time implementation, and/or validation & verification is a strong advantage Experience in Matlab, Simulink and/or Stateflow, Python … filters, grid-based estimation methods, Multi-Object-Multi-Sensor Fusion, data-association, random finite sets, Bayesian belief networks, Dempster-Shafer theory of evidence Machine Learning for regression and pattern recognition/discovery problems e.g. Gaussian processes, latent variable methods, support vector machines, probabilistic/statistical models, neural networks, Bayesian … inference, random-forests, novelty detection, clustering Deep Learning e.g. Deep reinforcement learning, Monte-Carlo tree search, deep regression/classification, deep embeddings, recurrent Networks, natural language processing Computer Vision algorithms e.g. Structure from motion, image Based navigation, SLAM, pose estimation/recovery To be considered for the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Scientist: Knowledge and Semantics

United Kingdom
Hybrid / WFH Options
BAE Systems (New)
recruit a team lead for our Knowledge and Semantics team. The Data and Decision Support Capability has a diverse range of teams working in: reinforcement learning, NLP, knowledge graphs, applications of LLMs, computer vision, AI for RF and EW, sonar and acoustics. You will have the opportunity to … You will have experience of leading a team of researchers working in NLP, LLM applications, text-mining, knowledge graphs and/or graph machine learning and with a vision on how to develop solutions for practical applications of ML in these domains. You will have a proven record of … successful proposal writing and delivery to funding bodies relevant to the Defence sector. You should have existing skills in Machine Learning (ML), will need to be a proficient programmer in Python, with extensive experience in the use of libraries and toolboxes to support efficient development. Candidates will also have More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Scientist - AI & ML

Chelmsford, Essex, United Kingdom
Hybrid / WFH Options
BAE Systems (New)
recruit a team lead for our Knowledge and Semantics team. The Data and Decision Support Capability has a diverse range of teams working in: reinforcement learning, NLP, knowledge graphs, applications of LLMs, computer vision, AI for RF and EW, sonar and acoustics. You will have the opportunity to … You will have experience of leading a team of researchers working in NLP, LLM applications, text-mining, knowledge graphs and/or graph machine learning and with a vision on how to develop solutions for practical applications of ML in these domains. You will have a proven record of … successful proposal writing and delivery to funding bodies relevant to the Defence sector. You should have existing skills in Machine Learning (ML), will need to be a proficient programmer in Python, with extensive experience in the use of libraries and toolboxes to support efficient development. Candidates will also have More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Scientist- Knowledge & Semantics

Chelmsford, Essex
Hybrid / WFH Options
BAE Systems
recruit a team lead for our Knowledge and Semantics team. The Data and Decision Support Capability has a diverse range of teams working in: reinforcement learning, NLP, knowledge graphs, applications of LLMs, computer vision, AI for RF and EW, sonar and acoustics. You will have the opportunity to … You will have experience of leading a team of researchers working in NLP, LLM applications, text-mining, knowledge graphs and/or graph machine learning and with a vision on how to develop solutions for practical applications of ML in these domains. You will have a proven record of … successful proposal writing and delivery to funding bodies relevant to the Defence sector. You should have existing skills in Machine Learning (ML), will need to be a proficient programmer in Python, with extensive experience in the use of libraries and toolboxes to support efficient development. Candidates will also have More ❯
Employment Type: Permanent
Posted:

Data Science Manager (Valuations & Pricing)

Hook, Hampshire, United Kingdom
Hybrid / WFH Options
360 Resourcing Solutions
hands-on, working with our Senior Staff Engineer to provide technical leadership to the team. You'll have great technical understanding of various machine learning approaches, which would include regression methods, probabilistic pricing models, tree based gradient boosting approaches, information theory, neural networks, transfer learning, etc. While your … our experienced MLOps and Data Engineering teams, allowing you more time to focus on model development, research and the implementation of new novel machine learning approaches. Key Responsibilities: Lead the Valuations & Pricing Team ensuring the on-going improvements in KPIs associated with the team's products. Manage, coach and … agreeing priorities with stakeholders and the Head of Data Science and enable the team to deliver against agreed targets. Work closely with the Machine Learning Operations team to deliver products to end users, leveraging and contributing to established engineering standards and best practices. Work collaboratively with the Insight & Analytics More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Head of Applied AI

London Area, United Kingdom
Hybrid / WFH Options
Acquired Talent Ltd
far richer, more capable intelligent behaviours. This is a unique opportunity to help build a dedicated AI function focused on embedding cutting-edge multimodal learning models into next-gen robotic platforms. The role will involve defining the long-term roadmap for how intelligence is deployed across the stack, from … successful in this role, you’ll need to bring: Strong technical leadership across applied AI/ML, with deep hands-on experience in robotic learning or embodied intelligence A solid background in multimodal model development—especially in areas that combine computer vision, language understanding, and interactive learning (LLM … VLM or VLA) Real-world deployment experience of learning-based systems, ideally within robotic or physical environments (embodied systems (AI) & reinforcement learning) Comfort collaborating across functions, especially with engineering, hardware, and system design teams Solid programming and prototyping skills using modern deep learning frameworks (e.g. PyTorch More ❯
Posted:

Head of Applied AI

london, south east england, united kingdom
Hybrid / WFH Options
Acquired Talent Ltd
far richer, more capable intelligent behaviours. This is a unique opportunity to help build a dedicated AI function focused on embedding cutting-edge multimodal learning models into next-gen robotic platforms. The role will involve defining the long-term roadmap for how intelligence is deployed across the stack, from … successful in this role, you’ll need to bring: Strong technical leadership across applied AI/ML, with deep hands-on experience in robotic learning or embodied intelligence A solid background in multimodal model development—especially in areas that combine computer vision, language understanding, and interactive learning (LLM … VLM or VLA) Real-world deployment experience of learning-based systems, ideally within robotic or physical environments (embodied systems (AI) & reinforcement learning) Comfort collaborating across functions, especially with engineering, hardware, and system design teams Solid programming and prototyping skills using modern deep learning frameworks (e.g. PyTorch More ❯
Posted:

Data Scientist

Bristol, Avon, South West, United Kingdom
Hybrid / WFH Options
Hargreaves Lansdown Asset Management Limited
improvement, leveraging your existing skills while also expanding your expertise across diverse projects and technologies. What you'll be doing Develop and implement machine learning products in the cloud that improve client experiences and boosts colleague productivity in real-time. Continuously monitor, retrain, and optimise existing models to ensure … have strong commercial experience in data, with a proven track record of solving complex problems and delivering tangible results. Experience building and deploying machine learning solutions in the cloud (AWS experience preferred). Strong proficiency in coding (Python) and SQL (Snowflake). Excellent stakeholder management skills, capable of building … focus on scalability and reliability. Experience with large-scale data analysis, manipulation, and distributed computing platforms (e.g., Hive, Hadoop). Familiarity with advanced machine learning methods, including neural networks, reinforcement learning, and other cutting-edge Gen AI approaches. Skilled in API development and deployment, with a focus More ❯
Employment Type: Permanent, Part Time, Work From Home
Posted:
Reinforcement Learning
10th Percentile
£76,727
25th Percentile
£90,801
Median
£150,000
75th Percentile
£175,000