17 of 17 Reinforcement Learning Jobs in the UK

Data Scientist - Principal

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
high-impact data science initiatives end-to-end, including problem framing, methodology selection, experiment development, implementation partnership, and impact measurement.* Build and deliver machine learning and reinforcement learning solutions to improve player engagement, retention, monetization, and operational outcomes.* Lead the modeling framework for complex systems, guaranteeing comprehensive … evaluation and monitoring of causal inference, uplift modeling, sequential decisioning, bandits/reinforcement learning, and forecasting.* Partner with game teams to define success metrics, guardrails, and decision frameworks, translating analytical results into actionable product and operational actions.* Define and uphold engineering standards and guidelines for model development, including ...

Robotics Learning Engineer

Hiring Organisation
OpenSourced Ltd
Location
Bristol, Avon, England, United Kingdom
Employment Type
Full-Time
Salary
£60,000 - £100,000 per annum
Senior Robot Learning Engineer – Large Behaviour Models Bristol (On-site) Competitive Salary Full-time, Permanent We are working with a cutting-edge robotics company developing advanced humanoid systems for real-world manipulation tasks. They are seeking a Senior Robot Learning Engineer to lead the development of large behaviour … models for complex, bi-manual robotic manipulation. This role sits at the intersection of robot learning, foundation models, and real-world deployment , offering the opportunity to bring state-of-the-art research into production systems. The Role You will take ownership of scaling and deploying advanced policy architectures across ...

Principal Data Scientist

Hiring Organisation
Jobleads-UK
Location
Birmingham, England, United Kingdom
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government’s AI Cyber Security Code of Practice … Kainos, you will be accountable for the successful delivery of large-scale, high-impact AI solutions that leverage state-of-the- art machine learning, generative, and agentic AI technologies. You will help set the direction for AI and data science across the business, driving the adoption of modern ...

Software Engineer - Large Language Models

Hiring Organisation
Fastino Labs
Location
United Kingdom
overall performance metrics Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards Build robust … Required - Great velocity for building and shipping agents/AI products. Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies Optional - Demonstrated ability to do independent research in Academic ...

Software Application Engineer

Hiring Organisation
European Tech Recruit
Location
Farnborough, England, United Kingdom
Linux, Android) Strong programming skills in C/C++ and scripting languages such as Python Hands-on experience with computer vision and deep learning frameworks (e.g., OpenCV, OpenCL, PyTorch, TensorFlow, Caffe/Caffe2) Experience with both supervised and unsupervised machine learning, including computer vision, NLP, and LLM-based … Experience with dedicated ML accelerators or embedded AI hardware Proficiency with debugging tools (e.g., JTAG, ftrace, GDB) Experience with generative AI model tuning or reinforcement learning By applying to this role you understand that we may collect your personal data and store and process it on our systems. ...

Senior AI Engineer

Hiring Organisation
Platform Recruitment
Location
London Area, United Kingdom
team building AI-driven trading and quantitative systems operating at significant scale across global financial markets. You will work at the intersection of machine learning, time-series modelling, and algorithmic strategy developing models that directly influence trading decisions and market performance. If you have a strong background … latency ML solutions, and play a key role in defining the technical architecture of the platform. Requirements Master's or PhD in Machine Learning, AI, CompSci, Mathematics, or a quantitative discipline 5+ years of ML engineering experience, ideally within finance or a quantitative environment Expert in Python and deep ...

AI Engineer

Hiring Organisation
Akixi
Location
United Kingdom
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...

Principal Data Scientist

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
work in small, collaborative teams to explore ideas, test solutions, and scale products that deliver meaningful value to customers. Our environment supports autonomy, learning, and inclusive collaboration.**About the Role**As a **Senior Principal Data Scientist**, you will contribute deep technical expertise while helping shape best practices across … apply advanced analytics to real‐world problems.### **Responsibilities:*** **Technical Leadership** Act as the go-to expert for **state-of-the-art** advanced analytics, machine learning, and generative AI. Define and evolve best practices for complex modeling and design problems, ensuring scalability, robustness, and performance. Provide thought leadership on emerging ...

Software Engineer (Applied AI)

Hiring Organisation
Euphoric
Location
United Kingdom
iteration of our next-generation benefits platform features that leverage personalization, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript … against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural ...

Data Scientist

Hiring Organisation
Randstad Technologies Recruitment
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£450 - £480/day
preprocessing to ensure high-quality inputs for ML models. Model Development: Select and train appropriate architectures (BERT, GPT, etc.) using supervised, unsupervised, or reinforcement learning strategies. Prompt Engineering: Design, test, and iterate on complex prompts to elicit high-quality responses from LLMs while mitigating unintended behaviors. Evaluation & Optimization … establish automated monitoring systems to track drift and performance. Technical Requirements Core AI/ML: Strong experience in ML algorithms, LLM architectures, and deep learning frameworks. Generative AI: Proven expertise in Prompt Engineering and fine-tuning pre-trained models. Engineering: Proficiency in Python and experience designing data pipelines ...

Founding AI Simulation and Control Engineer (up to £125k + equity)

Hiring Organisation
Optimal Agriculture
Location
Greater London, England, United Kingdom
maximise crop yields and minimise resource consumption, working closely with our agronomy experts. Technical skills Strong background in at least one of Machine Learning, Optimisation, Control (Model Predictive Control, Optimal Control, and classical feedback techniques), Reinforcement Learning, Physics Modelling and Numerical Simulation. Experience training Machine Learning ...

Head of Data Science

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
technology to let us do this. About the team: Our data science team (currently six data scientists) work across the business to deploy machine learning approaches that are helping us realise our ambition of becoming the global leader in packaged holidays. The team focuses on building value … customers, suppliers and loveholidays through deployment of advanced machine learning techniques. The impact you’ll have: Reporting directly to the VP of Data, you will lead the function that embeds intelligence at the heart of how we grow. You will define and drive Data Science’s contribution to core ...

AI Consultant

Hiring Organisation
Reed
Location
London, UK
Employment Type
Full-time
role: Design, build, and maintain AI models and agentic systems that improve regulatory services and internal operations. Develop and tune machine learning models to ensure accuracy, fairness, and reliability in production use. Orchestrate intelligent automation workflows using low code/no code platforms and agentic AI frameworks. Monitor … secure handling of sensitive data, and regulatory considerations. Knowledge of MLOps/DevOps practices, including CI/CD pipelines for ML models. Understanding of reinforcement learning, multi-agent systems, or advanced optimization methods. Previous experience working in regulated industries is advantageous. In the first instance, please submit your ...

AI Consultant

Hiring Organisation
Reed
Location
London, South East, England, United Kingdom
Employment Type
Temporary
Salary
£550 - £650 per day, Inc benefits
role: Design, build, and maintain AI models and agentic systems that improve regulatory services and internal operations. Develop and tune machine learning models to ensure accuracy, fairness, and reliability in production use. Orchestrate intelligent automation workflows using low code/no code platforms and agentic AI frameworks. Monitor … secure handling of sensitive data, and regulatory considerations. Knowledge of MLOps/DevOps practices, including CI/CD pipelines for ML models. Understanding of reinforcement learning, multi-agent systems, or advanced optimization methods. Previous experience working in regulated industries is advantageous. In the first instance, please submit your ...

Founding Forward Deployed Engineer (up to £125k + equity)

Hiring Organisation
Optimal Agriculture
Location
Greater London, England, United Kingdom
Clear communication with technical and non-technical users. Strong intuition for physical and biological systems. Strong foundations in at least one of Machine Learning, Optimisation, Control (Model Predictive Control, Optimal Control, and classical feedback techniques), Reinforcement Learning, Physics Modelling and Numerical Simulation. Experience building useful tools quickly ...

Software Engineer

Hiring Organisation
it stars
Location
Malvern, Worcestershire, England, United Kingdom
Employment Type
Full-Time
Salary
£38,000 - £48,000 per annum
goals and outcomes. Document findings thoroughly and hand over validated work to the wider development team. Demonstrable knowledge and experience in data mining, machine learning (including deep learning), generative AI (e.g., with Retrieval-Augmented Generation), and reinforcement learning. Software Development & Delivery Design, develop, test and maintain software ...

Head of Data Science

Hiring Organisation
Jobleads-UK
Location
United Kingdom
technology to let us do this. About The Team Our data science team (currently six data scientists) work across the business to deploy machine learning approaches that are helping us realise our ambition of becoming the global leader in packaged holidays. The team focuses on building value … customers, suppliers and loveholidays through deployment of advanced machine learning techniques. The Impact You’ll Have Reporting directly to the VP of Data, you will lead the function that embeds intelligence at the heart of how we grow. You will define and drive Data Science’s contribution to core ...