11 of 11 Permanent Reinforcement Learning Jobs in the UK excluding London

AI Engineer Lead

Hiring Organisation: Millennium
Location: Slough, Berkshire, UK
Employment Type: Full-time

execution of trades and modeling and building simulations of risk, to name a few. The AI/ML work spans usage of LLMs, deep learning, reinforcement learning and other ML techniques. The work also requires building AI/ML infrastructure for data wrangling, building feature stores, building … frameworks for deployments, measurements, re-training, etc. We are seeking an AI Engineering Lead who has experience with LLMs, deep learning, reinforcement learning, as well as an interest and ability to keep up with advancements in AI/ML. The candidate should also be strong in software ...

AI Engineer

Hiring Organisation: DXC Technology
Location: Bishopton, Renfrewshire, Scotland, United Kingdom
Employment Type: Permanent

advanced prompt engineering strategies Leverage Retrieval-Augmented Generation (RAG) for enhanced contextual performance Build intelligent agents using frameworks like LangChain, LlamaIndex, CrewAI, AutoGen Apply reinforcement learning techniques including Q-learning , policy gradients , and RLlib Collaborate with cross-functional teams to integrate AI solutions into scalable products Ensure … background in fine-tuning and prompt engineering Hands-on experience with RAG pipelines Familiarity with Agent Frameworks (LangChain, LlamaIndex, CrewAI, AutoGen) Solid understanding of reinforcement learning concepts and tools (Q-learning, policy gradients, RLlib) Azure AI Engineer Associate certification (or willingness to obtain) Bachelor's degree ...

AI Engineer

Hiring Organisation: Harnham - Data & Analytics Recruitment
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £80,000 - £85,000 per annum

currently working on an AI Engineer role with a global language and translation company. You will be designing, developing, and deploying cutting-edge machine learning solutions across the company.If you enjoy end-to-end ownership (from experimentation to deployment), working with AWS, Docker, MLflow, TensorFlow/PyTorch, and contributing … code in Python Experience with TensorFlow, PyTorch and Scikit-learn Experience with NLPs and LLMs Speech, Text or Audio data Strong knowledge of machine learning techniques and algorithms, including supervised and unsupervised learning, deep learning, and reinforcement learning ...

DevOps Engineer

Hiring Organisation: Matchtech
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £85,000 per annum, Negotiable

DevOps Engineer - Reinforcement Learning Platforms We are seeking an experienced DevOps Engineer to help build and scale a web-based platform for reinforcement learning (RL) training and RLOps. You will design, implement, and maintain the cloud infrastructure, CI/CD pipelines, and deployment systems that support … solving and communication skills Compensation & Benefits * Stock options* 30 days' holiday plus bank holidays* Flexible and remote working options* Enhanced parental leave* £500 annual learning and development budget* Pension scheme* Regular socials and quarterly gatherings* Bike-to-Work scheme ...

Machine Learning Engineer (0–3 Years Experience)

Hiring Organisation: IT Graduate Recruitment
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £45,000 - £75,000 per annum, OTE

Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning ...

Data Science Manager

Hiring Organisation: Zilch
Location: Slough, Berkshire, UK
Employment Type: Full-time

looking for an exceptional Data Science Manager to lead and grow our data science capability, driving the development, deployment, and optimisation of machine-learning solutions that support Zilch's mission to deliver responsible, real-time and seamless credit experiences. This is a leadership role that involves line management … setting a high bar for technical and delivery excellence. Bonus Skills. Experience with modern data stack tools (DBT, Snowflake, Looker). Exposure to deep learning, LLMs, or advanced ML techniques relevant to fintech. Experience developing near-real-time scoring pipelines or streaming-based ML systems. Interest in emerging technologies ...

Artificial Intelligence Engineer

Hiring Organisation: Lola Cars
Location: silverstone, midlands, united kingdom

strategies, and advance driver safety in the high-speed world of motorsports. The ideal candidate will have a strong background in AI and machine learning, with a keen interest in applying these technologies to revolutionize racing. Role Responsibilities: Develop and implement AI models to analyse real-time and historical … improving vehicle performance and strategic decision-making. Collaborate with engineering teams to integrate AI-driven solutions into vehicle design and race operations. Utilize machine learning algorithms to predict race outcomes, optimize pit stop strategies, and enhance driver performance. Stay abreast of the latest advancements in AI and motorsport technologies ...

Senior Data Scientist

Hiring Organisation: La Fosse
Location: Slough, Berkshire, UK
Employment Type: Full-time

preparing for an international launch, we're now looking for a Senior Data Scientist. What you'll be doing: Build, deploy, and scale machine learning systems that forecast demand, optimise staffing, and improve operational performance across thousands of venues. Lead projects end-to-end, from data design and modelling … Confident working in AWS or similar cloud environments (SageMaker, Lambda, Docker, etc.). Experienced in (or eager to explore) areas such as forecasting, optimisation, reinforcement learning, generative AI, or computer vision. Solid engineering mindset, you know how to take models from research to production and keep them running ...

Graduate AI Analyst / Engineer

Hiring Organisation: Kingsgate Recruitment Ltd
Location: Manchester, Lancashire, England, United Kingdom
Employment Type: Full-Time
Salary: £24,000 - £28,000 per annum

Shape the Future with AI — Start Your Career Here Are you fascinated by artificial intelligence, machine learning, and data-driven innovation? Are you excited by the potential of AI to solve real-world problems and build smarter systems? We’re looking for an ambitious and curious Graduate AI Analyst … Data Science team. This is an ideal role for a recent graduate who wants to kickstart their career by working on real AI projects, learning from experts, and gaining hands-on experience with modern AI tools, models, and data pipelines. Whether you studied Computer Science, Engineering, Maths, Data Science ...

Head of Data Science

Hiring Organisation: Enstar Group
Location: Slough, Berkshire, UK
Employment Type: Full-time

analytics to drive measurable value. What you will be doing: Data Science and Analytics Lead the design and delivery of advanced analytical and Machine Learning solutions across different business functions; for example; predictive reserving, claims forecasting, capital optimisation, portfolio risk, and operational efficiency, leveraging end-to-end ML architectures … spanning supervised, unsupervised, and reinforcement learning. Combine statistical methods with modern ML techniques such as gradient boosting (XGBoost, LightGBM), ensemble methods, deep learning (CNN, RNN, LSTM), and generative models (GAN, VAE) to enhance predictive accuracy, interpretability, and automation. Engineer scalable analytical frameworks and reusable ML assets, integrating Python ...

Head of Vision - Language - Action (VLA) Development - Manipulation

Hiring Organisation: Shaw Daniels Solutions
Location: Slough, Berkshire, UK
Employment Type: Full-time

automation system designed to deliver efficient, adaptable performance across real-world environments, beginning with industrial applications. Responsibilities Lead the strategy and execution of representation learning, behaviour cloning, and reinforcement learning programs. Drive large-scale post-training for multi-modal LLM/VLM/VLA systems, integrating vision … optimise models for real-time edge inference. Grow, mentor, and unblock a high-performing team of researchers and engineers. Requirements 6+ years developing deep learning systems, with 2+ years in technical leadership. Hands-on experience with LLM/VLM architecture design and large-scale training. Proven expertise in robotics ...