1 to 25 of 55 Permanent Reinforcement Learning Jobs in the UK excluding London

AI Engineer

Hiring Organisation
DXC Technology
Location
Bishopton, Renfrewshire, Scotland, United Kingdom
Employment Type
Permanent
advanced prompt engineering strategies Leverage Retrieval-Augmented Generation (RAG) for enhanced contextual performance Build intelligent agents using frameworks like LangChain, LlamaIndex, CrewAI, AutoGen Apply reinforcement learning techniques including Q-learning , policy gradients , and RLlib Collaborate with cross-functional teams to integrate AI solutions into scalable products Ensure … background in fine-tuning and prompt engineering Hands-on experience with RAG pipelines Familiarity with Agent Frameworks (LangChain, LlamaIndex, CrewAI, AutoGen) Solid understanding of reinforcement learning concepts and tools (Q-learning, policy gradients, RLlib) Azure AI Engineer Associate certification (or willingness to obtain) Bachelor's degree ...

AI Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £85,000 per annum
currently working on an AI Engineer role with a global language and translation company. You will be designing, developing, and deploying cutting-edge machine learning solutions across the company.If you enjoy end-to-end ownership (from experimentation to deployment), working with AWS, Docker, MLflow, TensorFlow/PyTorch, and contributing … code in Python Experience with TensorFlow, PyTorch and Scikit-learn Experience with NLPs and LLMs Speech, Text or Audio data Strong knowledge of machine learning techniques and algorithms, including supervised and unsupervised learning, deep learning, and reinforcement learning ...

DevOps Engineer

Hiring Organisation
Matchtech
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£85,000 per annum, Negotiable
DevOps Engineer - Reinforcement Learning Platforms We are seeking an experienced DevOps Engineer to help build and scale a web-based platform for reinforcement learning (RL) training and RLOps. You will design, implement, and maintain the cloud infrastructure, CI/CD pipelines, and deployment systems that support … solving and communication skills Compensation & Benefits * Stock options* 30 days' holiday plus bank holidays* Flexible and remote working options* Enhanced parental leave* £500 annual learning and development budget* Pension scheme* Regular socials and quarterly gatherings* Bike-to-Work scheme ...

Principal Data Scientist

Hiring Organisation
Sky
Location
Middlesex, south east england, united kingdom
Experience building and deploying advanced analytics solutions in a large scale (preferably B2C) cloud environmen t Experience in deploying commercially viable applications using deep learning techniques combining structured and unstructured data is highly desirable Ability to quickly understand a business objective, problem solving to create an analytical solution …/or data analysis e.g. Python, Tensorflow (essential) Database experience, preferably SQL (essential) Expertise in cutting-edge AI methodologies, including Generative AI and Reinforcement Learning Machine learning - Supervised/unsupervised learning, regression, decision trees, random forests, boosting, clustering (essential) The rewards There's one thing people ...

Senior Machine Learning Researcher - MSR AI for Science

Hiring Organisation
Microsoft
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
Overview At Microsoft Research AI for Science , we believe machine learning and artificial intelligence has the potential to transform scientific modelling and discovery crucial for solving the most pressing problems facing society including sustainable materials and discovery of new drugs. We seek a highly motivated ML Researchers to join … large ML and LLM algorithms to accelerate the discovery of small molecule drugs and materials. Our team encompasses people from multiple disciplines across machine learning, engineering, and the natural sciences, who work together closely on well-defined and challenging goals. If you have strong machine learning expertise ...

Machine Learning Engineer (0–3 Years Experience)

Hiring Organisation
IT Graduate Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £75,000 per annum, OTE
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning ...

Senior Researcher in Machine Learning: People-Centric AI

Hiring Organisation
Microsoft
Location
Cambridge, England, United Kingdom
Overview We are seeking Senior Machine Learning Researcher candidates for our research in the area of People-Centric AI at Microsoft Research Cambridge (UK). The successful candidate will be responsible for pushing the state of the art in machine learning to enable human agency and skill, support … creativity and collaboration, and ensure equitable representation and participation. Key machine learning challenges that we aim to address include, but are not limited to, human-in-the-loop learning, uncertainty quantification, value alignment, interpretability, fairness and bias mitigation, as well as related areas. People-Centric ...

Senior AI/ML Performance Engineer

Hiring Organisation
Google
Location
Slough, Berkshire, UK
Employment Type
Full-time
languages. 3 years of experience with one or more of the following: Speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field. 3 years of experience with ML infrastructure (e.g., model deployment … Python. Experience with an emphasis on algorithms, systems and tools for ML performance projections and evaluation. Experience designing or implementing components of a Deep Learning Compiler Stack (e.g., XLA, MLIR, TVM, ONNX Runtime). Experience in low-latency systems programming (e.g., C/C++) and optimizing data movement across ...

Senior Data Scientist

Hiring Organisation
La Fosse
Location
Slough, Berkshire, UK
Employment Type
Full-time
preparing for an international launch, we're now looking for a Senior Data Scientist. What you'll be doing: Build, deploy, and scale machine learning systems that forecast demand, optimise staffing, and improve operational performance across thousands of venues. Lead projects end-to-end, from data design and modelling … Confident working in AWS or similar cloud environments (SageMaker, Lambda, Docker, etc.). Experienced in (or eager to explore) areas such as forecasting, optimisation, reinforcement learning, generative AI, or computer vision. Solid engineering mindset, you know how to take models from research to production and keep them running ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Aberdeen, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Swindon, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Coventry, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Belfast, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Southampton, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Leicester, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Sheffield, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Bradford, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Edinburgh, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Glasgow, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Leeds, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Bristol, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Watford, Hertfordshire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Ipswich, Suffolk, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Slough, Berkshire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Gloucester, Gloucestershire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...