Reinforcement Learning Jobs in England

51 to 75 of 86 Reinforcement Learning Jobs in England

Head of AI Architecture

London, United Kingdom
Vista Equity Partners Management, LLC
ten years experience designing and architecting large-scale, distributed systems. At least three years designing and deploying AI/ML/NLP/Deep Learning solutions. Demonstrable experience as an expert in AI, with a consistent record of leading the architecture of multiple LLM and Generative AI solutions. Expertise … embeddings, chunking and other optimization techniques within VectorDBs. Expert programming skills in languages such as Python, Java, or C++, and working knowledge of deep learning frameworks such as TensorFlow, PyTorch, or Hugging Face. Understanding of generative models, such as Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and Transformer models … application frameworks, such as Langchain. Proven expertise in CI/CD pipeline management within the Generative AI space, including LLMOps. Comprehensive understanding of machine learning and NLP algorithms, with hands-on experience in frameworks like Scikit-Learn, TensorFlow, and PyTorch. Strong knowledge of generative AI techniques, including deep generative More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist

London, United Kingdom
Hybrid / WFH Options
Faculty
large-scale open-source projects. You will also have the opportunity to teach on the fellowship about topics that range from basic statistics to reinforcement learning, and to mentor the fellows through their 6-week project. Thanks to Faculty platform, you will have access to powerful computational resources … in a short timeframe An excellent command of the basic libraries for data science (e.g. NumPy, Pandas, Scikit-Learn) and familiarity with a deep-learning framework (e.g. TensorFlow, PyTorch, Caffe) A high level of mathematical competence and proficiency in statistics A solid grasp of essentially all of the standard … data science techniques, for example, supervised/unsupervised machine learning, model cross validation, Bayesian inference, time-series analysis, simple NLP, effective SQL database querying, or using/writing simple APIs for models. We regard the ability to develop new algorithms when an innovative solution is needed as a fundamental More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer

Burbage, Leicestershire, United Kingdom
The Portfolio Group
An exceptional opportunity for a Machine Learning Engineer (with Full-Stack experience) to join an innovative market leader at the forefront of developing next-generation solutions that transform digital interactions. The role will focus on projects to leverage state-of-the-art generative AI, retrieval-augmented generation (RAG), and … reasoning frameworks to build intelligent and context-aware systems. We are seeking talented Machine Learning Engineers with full-stack software development experience to join our client's team and help shape the future of AI-powered automation. Within this dynamic role varied duties will include: Search relevancy engineering. Conversational … pipelines for effective information retrieval from structured and unstructured sources. Model Fine-Tuning & Training : Train domain-specific models using techniques like LoRA, QLoRA, PEFT, reinforcement learning, and supervised fine-tuning (SFT). Model Deployment & Inferencing : Optimise model serving and inference using vLLM, DeepSpeed, TensorRT, Triton, and other acceleration More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer

Hinckley, Leicestershire, United Kingdom
The Portfolio Group
An exceptional opportunity for a Machine Learning Engineer (with Full-Stack experience) to join an innovative market leader at the forefront of developing next-generation solutions that transform digital interactions. The role will focus on projects to leverage state-of-the-art generative AI, retrieval-augmented generation (RAG), and … reasoning frameworks to build intelligent and context-aware systems. We are seeking talented Machine Learning Engineers with full-stack software development experience to join our client's team and help shape the future of AI-powered automation. Within this dynamic role varied duties will include: Search relevancy engineering. Conversational … pipelines for effective information retrieval from structured and unstructured sources. Model Fine-Tuning & Training : Train domain-specific models using techniques like LoRA, QLoRA, PEFT, reinforcement learning, and supervised fine-tuning (SFT). Model Deployment & Inferencing : Optimise model serving and inference using vLLM, DeepSpeed, TensorRT, Triton, and other acceleration More ❯
Employment Type: Permanent
Posted:

Staff Applied Scientist - Search

London, United Kingdom
Robin AI
experimenting with cutting edge research on how to best solve pressing issues surfacing in the legal domain. You will apply your expertise in machine learning, data science, and behavioural modelling to drive meaningful insights and innovations. As AI agents become the norm in this industry, so does the need … knowledge bases and systems (e.g., hybrid search approaches combining symbolic and neural techniques, query understanding and rewriting for legal search) Perform fine-tuning and reinforcement learning to teach language models how to interact with new information architectures. Build "hard" eval sets to help identify failure modes of how … infrastructure for running experiments and visualising results. Work with colleagues to communicate results internally and publicly. Stay updated with the latest research in machine learning, AI, knowledge representation and retrieval to bring innovative solutions to the table. Mentor junior researchers and contribute to building a collaborative, knowledge-sharing culture. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Scientist

London Area, United Kingdom
Synergetic
with software engineering capabilities to build end-to-end AI solutions. The ideal candidate will have a strong foundation in both developing sophisticated machine learning models and implementing them within production systems. You will work closely with cross-functional teams to transform concepts into scalable AI-powered products. Candidates … able to thrive in fast paced environments. Being ok with ambiguity and strong communications skills are must. Responsibilities Design, develop, and implement advanced machine learning models and AI capabilities Build and maintain knowledge graphs and causal inference systems Create probabilistic models to address complex business problems Scale AI solutions … technical decision-making Translate business requirements into technical specifications Required Skills & Experience Extensive experience combining data science with software engineering Strong expertise in machine learning, with focus on causal ML and probabilistic modelling Experience developing and implementing knowledge graphs Proficiency in scaling AI solutions from concept to production Working More ❯
Posted:

Data Scientist

london, south east england, united kingdom
Synergetic
with software engineering capabilities to build end-to-end AI solutions. The ideal candidate will have a strong foundation in both developing sophisticated machine learning models and implementing them within production systems. You will work closely with cross-functional teams to transform concepts into scalable AI-powered products. Candidates … able to thrive in fast paced environments. Being ok with ambiguity and strong communications skills are must. Responsibilities Design, develop, and implement advanced machine learning models and AI capabilities Build and maintain knowledge graphs and causal inference systems Create probabilistic models to address complex business problems Scale AI solutions … technical decision-making Translate business requirements into technical specifications Required Skills & Experience Extensive experience combining data science with software engineering Strong expertise in machine learning, with focus on causal ML and probabilistic modelling Experience developing and implementing knowledge graphs Proficiency in scaling AI solutions from concept to production Working More ❯
Posted:

Lead Data Scientist

London Area, United Kingdom
Harnham
Science, Artificial Intelligence, Mathematics, Statistics or related fields. Expert in Python, R, SQL and a range of ML techniques (e.g., random forests, neural nets, reinforcement learning) A good understanding of the regulatory environment, especially responsible lending (creditworthiness/affordability) Experience in using the latest data science techniques to More ❯
Posted:

Principal Data Scientist

London Area, United Kingdom
Harnham
Science, Artificial Intelligence, Mathematics, Statistics or related fields. Expert in Python, R, SQL and a range of ML techniques (e.g., random forests, neural nets, reinforcement learning) Track record of delivering high-impact AI projects from concept to production Strong communication skills – able to translate complex insights into business More ❯
Posted:

Senior AI Engineer

Cambridge, England, United Kingdom
DeepRec.ai
models using techniques like quantization and pruning. Leverage GPU power (CUDA, Python, Jax) for accelerated training. Qualifications: Expertise in Python/C++ and deep learning (PyTorch). Experience building Deep Reinforcement Learning Models. Experience with Robotics and/or Simulation. If you want to join a company More ❯
Posted:

Senior AI Engineer

cambridge, east anglia, united kingdom
DeepRec.ai
models using techniques like quantization and pruning. Leverage GPU power (CUDA, Python, Jax) for accelerated training. Qualifications: Expertise in Python/C++ and deep learning (PyTorch). Experience building Deep Reinforcement Learning Models. Experience with Robotics and/or Simulation. If you want to join a company More ❯
Posted:

Senior AI Engineer

Cambridge, south west england, united kingdom
DeepRec.ai
models using techniques like quantization and pruning. Leverage GPU power (CUDA, Python, Jax) for accelerated training. Qualifications: Expertise in Python/C++ and deep learning (PyTorch). Experience building Deep Reinforcement Learning Models. Experience with Robotics and/or Simulation. If you want to join a company More ❯
Posted:

Applied Scientist III, Amzn Shipping-Prd & Tech, Amzn Shipping-Prd & Tech

London, United Kingdom
Amazon
help improve our ability to plan and execute package movements. As an Applied Scientist in Amazon Shipping, you will work on multiple challenging machine learning problems spread across a wide spectrum of business problems. You will build ML models to help our transportation cost auditing platforms effectively audit off … require you to demonstrate Think Big and Invent and Simplify, by refining and translating Transportation domain-related business problems into one or more Machine Learning problems. You will use techniques from a wide array of machine learning paradigms, such as supervised, unsupervised, semi-supervised and reinforcement learning. … Your model choices will include, but not be limited to, linear/logistic models, tree based models, deep learning models, ensemble models, and Q-learning models. You will use techniques such as LIME and SHAP to make your models interpretable for your customers. You will employ a family More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Robotics Manipulation Engineer

London, England, United Kingdom
Humanoid
of grant proposals and funding applications to secure external research grants and sponsorships. Expertise Ph.D. or equivalent experience in computer science, artificial intelligence, machine learning, or related field, with a focus on manipulation, robotics, or AI/ML. Strong theoretical and practical knowledge in areas such as reinforcement learning, motion planning, and perception, with a specific emphasis on manipulation tasks. Proficiency in programming languages such as Python, C++, or MATLAB, as well as experience with relevant libraries and frameworks for AI/ML development. Experience in conducting independent research and publishing in top-tier conferences and More ❯
Posted:

Senior Robotics Manipulation Engineer

london, south east england, United Kingdom
Humanoid
of grant proposals and funding applications to secure external research grants and sponsorships. Expertise Ph.D. or equivalent experience in computer science, artificial intelligence, machine learning, or related field, with a focus on manipulation, robotics, or AI/ML. Strong theoretical and practical knowledge in areas such as reinforcement learning, motion planning, and perception, with a specific emphasis on manipulation tasks. Proficiency in programming languages such as Python, C++, or MATLAB, as well as experience with relevant libraries and frameworks for AI/ML development. Experience in conducting independent research and publishing in top-tier conferences and More ❯
Posted:

Program Manager, ACES, OPTIMA

London, United Kingdom
Amazon
a global team and enables Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the … shopping feature teams deliver superior CX quality by providing them reliable and comprehensive insights and ground truth data to measure and train ML (Machine Learning) models and handle annotation and Root Cause Analysis (RCA) across 10 different languages. Minimum Qualifications Minimum 2 years Project/Program management experience in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Robotics control Engineer Locomotion

London Area, United Kingdom
Humanoid
while manipulating, fall recovery, and other advanced mobility tasks. The ideal candidate will have strong expertise in classic locomotion pipeline, whole-body control and reinforcement learning. Our Mission: At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into … for biped robots, focusing on locomotion. Proficiency with model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Knowledge of advanced topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Familiarity with real-time control systems and integration with hardware, including actuators and sensors. More ❯
Posted:

Senior Robotics control Engineer Locomotion

london, south east england, United Kingdom
Humanoid
while manipulating, fall recovery, and other advanced mobility tasks. The ideal candidate will have strong expertise in classic locomotion pipeline, whole-body control and reinforcement learning. Our Mission: At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into … for biped robots, focusing on locomotion. Proficiency with model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Knowledge of advanced topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Familiarity with real-time control systems and integration with hardware, including actuators and sensors. More ❯
Posted:

Robotics Control Engineer - Locomotion

London Area, United Kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Posted:

Robotics Control Engineer - Locomotion

london, south east england, united kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Posted:

Robotics Control Engineer - Locomotion

London, United Kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Employment Type: Permanent
Salary: GBP 80,000 - 110,000 Annual
Posted:

Robotics Control Engineer - Locomotion

London, Paddington, United Kingdom
Proactive Global
biped robots, focusing on locomotion. Understanding of model predictive control (MPC), optimal control, and feedback control loops in dynamic robotic systems. Strong expertise in reinforcement learning for robotics Deep understanding of humanoid robot dynamics and balance control. Strong experience with hardware-in-the-loop testing and deployment on … Isaac Sim or similar environments. Proficiency in Python and C++ for algorithm development, testing, and deployment. Experience in topics like model-free RL, imitation learning, or hybrid control systems that combine classic and modern methods. Desirable Skills and Experience Familiarity with real-time control systems and integration with hardware More ❯
Employment Type: Permanent
Salary: £80000 - £110000/annum
Posted:

Sr. Data Associate with English and German, OPTIMA - Alexa Shopping

London, United Kingdom
Amazon
Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. By joining us, you will play a pivotal role in shaping the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Program Manager, OPTIMA

London, United Kingdom
Amazon
a global team and enables Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the … feature teams to deliver superior CX quality by providing them reliable and comprehensive insights and ground truth data to measure and train ML (Machine Learning) models and handle annotation and Root Cause Analysis (RCA) across 10 different languages. We are seeking a highly organized, detail-oriented Program Manager to More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Technologist- AI & ML

Chelmsford, Essex
Hybrid / WFH Options
BAE Systems
strong background in the mathematics, statistics and computational techniques which underpin AI/ML. The Data and Decision Support Capability has teams working in: reinforcement learning; NLP/LLMs; knowledge graphs and graph based neural networkss; AI for RF and EW, radar, sonar, acoustics, AI for image and … interest are candidates with experience in one or more of the following domains: AI/ML for imagery including applied to remote sensing applications. Reinforcement learning. Natural Language Processing. Large Language Models. Knowledge graphs and graph-based neural nets. AI/ML for RF and EW, radar, sonar, acoustics. More ❯
Employment Type: Permanent
Posted:
Reinforcement Learning
England
10th Percentile
£76,727
25th Percentile
£90,801
Median
£150,000
75th Percentile
£175,000