1 to 25 of 79 Permanent Reinforcement Learning Jobs in the UK excluding London

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
Slough, Berkshire, UK
Employment Type
Full-time
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Deep Learning Researcher

Hiring Organisation
Microtech Global Ltd
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent, Work From Home
Deep Learning Researcher Cambridge/London About Us: MicroTECH-Global are working with a global leader in advanced computing, with a dedicated research team focused on applying artificial intelligence to next-generation semiconductor design and optimization. Role Overview: We're seeking a motivated Deep Learning Researcher with … strong background in machine learning, AI, or related fields. Youll contribute to innovative projects in areas such as large language models (LLMs), reinforcement learning, and optimization for chip design and AI system integration. Responsibilities: Conduct and publish cutting-edge AI/ML research Design algorithms for chip ...

Reader in Artificial Intelligence

Hiring Organisation
University of Bath
Location
Bath, Somerset, South West, United Kingdom
Employment Type
Permanent
Salary
£55,000
Reader in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security) The Department of Computer Science wishes to appoint academics to strengthen our growing Artificial Intelligence and Machine Learning Research Group. We welcome strong applications from all areas of AI and machine learning. … particularly keen to recruit in Natural Language Processing, Reinforcement Learning and/or AI Security. Appointments would be at the Reader level. We are a highly collaborative team, working not only with other researchers in our department, but across the university and beyond. We will offer you support ...

Lecturer / Senior Lecturer in Artificial Intelligence

Hiring Organisation
University of Bath
Location
Bath, Somerset, South West, United Kingdom
Employment Type
Permanent
Salary
£70,000
Lecturer/Senior Lecturer in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security) The Department of Computer Science wishes to appoint academics to strengthen our growing Artificial Intelligence and Machine Learning Research Group. We welcome strong applications from all areas of AI and machine … learning. We are particularly keen to recruit in Natural Language Processing, Reinforcement Learning and/or AI Security. Appointments may be at Lecturer or Senior Lecturer. We are a highly collaborative team, working not only with other researchers in our department but across the university and beyond. ...

Reinforcement Learning RL control Engineer

Hiring Organisation
Randstad Technologies
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £120,000 per annum
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

Machine Learning Researcher Statistics Python AI

Hiring Organisation
Client Server
Location
Cambridge, England, United Kingdom
Machine Learning Researcher (PhD Statistics Python AI R&D) Cambridge/WFH to £85k Are you a tech savvy, PhD educated, Machine Learning Researcher looking for an opportunity to work on complex and interesting systems at the cutting edge of AI technology? You could be progressing your career … that provides AI and ML products for automotive innovators to design better cars faster and achieve greater sustainability through Machine Learning. As a Machine Learning Researcher you will work fairly independently, developing your own research programme, with a view to developing new tools and techniques for probabilistic models, Bayesian ...

Machine Learning Engineer (0–3 Years Experience).

Hiring Organisation
IT Graduate Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £75,000 per annum, OTE
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning ...

Reader in Artificial Intelligence

Hiring Organisation
University of Bath
Location
Bath, Somerset, United Kingdom
Employment Type
Permanent
Salary
GBP 55,000 Annual
Reader in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security) The Department of Computer Science wishes to appoint academics to strengthen our growing Artificial Intelligence and Machine Learning Research Group. We welcome strong applications from all areas of AI and machine learning click ...

Lecturer / Senior Lecturer in Artificial Intelligence

Hiring Organisation
University of Bath
Location
Bath, Somerset, United Kingdom
Employment Type
Permanent
Salary
GBP 70,000 Annual
Lecturer/Senior Lecturer in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security) The Department of Computer Science wishes to appoint academics to strengthen our growing Artificial Intelligence and Machine Learning Research Group. We welcome strong applications from all areas of AI and machine … learning click apply for full job details ...

Machine Learning Engineer

Hiring Organisation
Higher - AI recruitment
Location
Slough, Berkshire, UK
Employment Type
Full-time
partnering with an early-stage, mission-driven company at the intersection of AI and national defence to appoint exceptional Machine Learning Engineers. This fast-growing organisation is transforming mission-critical combat planning and operational decision-making by building next-generation AI software tools for Western forces. Founded … sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, GenAI, Computer Vision, and Reinforcement Learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Machine Learning Engineers who are passionate about using ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Bradford, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Aberdeen, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Swindon, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Belfast, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Cardiff, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Stevenage, Hertfordshire, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Lincoln, Lincolnshire, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Maidstone, Kent, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Bedford, Bedfordshire, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Colchester, Essex, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Bournemouth, Dorset, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Cheltenham, Gloucestershire, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Ipswich, Suffolk, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...

Data Scientist - Peer‐to‐Peer Renewable Energy Trading Platform

Hiring Organisation
The Green Recruitment Company
Location
Slough, Berkshire, UK
Employment Type
Full-time
Responsibilities Modelling and Forecasting Develop time‐series models for generation, consumption, and market price forecasting. Build probabilistic and scenario‐based forecasting capabilities. Apply machine learning to optimise matching, pairing, and routing algorithms within the P2P marketplace. Trading and Optimisation Intelligence Create algorithms that optimise buyer–seller matching, pricing … learn, PyTorch/TensorFlow). Strong experience with time‐series modelling (ARIMA, Prophet, LSTMs or similar). Understanding of optimisation methods (linear, mixed‐integer, reinforcement learning desirable). Strong SQL and practical experience with production‐ready data pipelines. Experience working with cloud environments (AWS, GCP, or Azure). ...