Permanent Reinforcement Learning Jobs in England

1 to 25 of 398 Permanent Reinforcement Learning Jobs in England

Research Scientist | Reinforcement Learning | Python | Machine Learning | Deep Learning | Hybri[...]

London, England, United Kingdom
Enigma
Research Scientist | Reinforcement Learning | Python | Machine Learning | Deep Learning | Hybrid, London Research Scientist | Reinforcement Learning | Python | Machine Learning | Deep Learning | Hybrid, London 1 day ago Be among the first 25 applicants Direct message the job poster from Enigma Founder @ Enigma | Creator of … Vision to Venture Podcast & The Neural Network Community Research Scientist | Reinforcement Learning | Python | Machine Learning | Deep Learning | Hybrid, London Overview Today’s internet is misaligned with its users – at best one-size-fits-all and static, and at worst adversarial. We’re building AI-powered tools … s goals, needs, and values. In this new human-aligned internet, generative user interfaces will augment our capacity for discovery, understanding, and expression by learning from and responding to users in real time. We’re powering these experiences with our bespoke foundation model, enabling intelligent, highly personalized user experiences More ❯
Posted:

NLP / LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence | London, UK

London, England, United Kingdom
JPMorgan Chase & Co
NLP/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence NLP/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence JPMorgan Chase & Co. London, United Kingdom NLP/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence JPMorgan Chase … Co. London, United Kingdom Apply now Posted 3 months ago Permanent Competitive NLP/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence Job Description NLP/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence The Machine Learning Center of Excellence invites … the successful candidate to apply sophisticated machine learning methods to a wide variety of complex tasks including natural language processing, speech analytics, time series, reinforcement learning and recommendation systems. The candidate must excel in working in a highly collaborative environment together with the business, technologists and control More ❯
Posted:

Applied Scientist

London, England, United Kingdom
Hybrid / WFH Options
Wayve
us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future. At Wayve, your contributions matter. We value diversity, embrace new perspectives, and … to autonomous driving or similar robotics or decision making domain, inclusive, but not limited to the following specific areas: Model-free and model-based reinforcement learning Offline reinforcement learning Planning with learned models, model predictive control and tree search Imitation learning, inverse reinforcement learning … of real-world driving data How to architect our models to best employ the latest advances in foundation models, transformers, world models, etc. Which learning algorithms to use (e.g. reinforcement learning, behavioural cloning) How to leverage simulation for controlled experimental insight, training data augmentation, and re-simulation More ❯
Posted:

Research Engineer - Bayesian Optimisation (Contractor)

London, England, United Kingdom
Huawei Technologies Research & Development (UK) Ltd
UK) Ltd Get AI-powered advice on this job and more exclusive features. Job Description Job Title Research Engineer – Bayesian Optimization Department/Group Reinforcement Learning Team Reports To Haitham Ammar Manager Position No Location London Position Type Full time 37.5 hours per week About Huawei Research And … looking for a Research Engineer with experience in Bayesian optimization . We wish to recruit exceptional candidates to pursue research in various aspects of reinforcement learning and Bayesian optimization for autonomous decision-making under uncertainty. The successful applicant is expected to develop novel contributions allowing the field to … world applications. Key research questions include but are not limited to scalable high-dimensional Bayesian optimization, Gaussian processes, Bayesian neural networks, safe and robust reinforcement learning, multi-agent reinforcement learning, and model-based reinforcement learning. This is an exceptional opportunity for research while collaborating with More ❯
Posted:

Research Engineer: Graph Machine Learning

London, United Kingdom
Atmanlabs
Research Engineer: Graph Machine Learning Atman Labs, London About Atman Labs At Atman Labs we are building software to emulate proactive human expertise. Emulating human experts with deep knowledge and proactive assistance has largely been impossible to do via standalone Artificial Intelligence techniques. As an applied research and commercialization … proactive shopping assistance, to personal teachers to healthcare concierges - and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Machine Reasoning: Web-scale Knowledge Graph … Exploration using Reinforcement Learning Human experts can form and explore structured mental models in their heads to solve open-ended problems across different domains. Our research seeks to emulate this process through a novel combination of using reinforcement learning agents to perform exploration through a knowledge More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer, Machine Learning (Horizons)

London, England, United Kingdom
Hybrid / WFH Options
Anthropic
Research Engineer, Machine Learning (Horizons) London, UK About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We aim for AI to be safe and beneficial for users and society. Our team is a growing group of researchers, engineers, policy experts, and business leaders working … together to build beneficial AI systems. About the role: As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning, enhancing reasoning abilities in … areas like code generation and mathematics, and exploring reinforcement learning for agentic/open-ended tasks. Representative projects: Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models. Create tools and environments for models to interact with, enabling complex More ❯
Posted:

Research Engineer, Machine Learning (Horizons)

London, United Kingdom
Hybrid / WFH Options
Menlo Ventures
Research Engineer, Machine Learning (Horizons) London, UK About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers … policy experts, and business leaders working together to build beneficial AI systems. About the role: As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models through fundamental … research in reinforcement learning, improving reasoning abilities in areas such as code generation and mathematics, and exploring reinforcement learning for agentic/open-ended tasks. Representative projects: Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer, Machine Learning (Horizons) London, UK

London, England, United Kingdom
Hybrid / WFH Options
Alcides Fonseca
Research Engineer, Machine Learning (Horizons) London, UK About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers … policy experts, and business leaders working together to build beneficial AI systems. About the role: As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models through fundamental … research in reinforcement learning, improving reasoning abilities in areas such as code generation and mathematics, and exploring reinforcement learning for agentic/open-ended tasks. Representative projects: Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models. More ❯
Posted:

NLP / LLM Scientist - Applied AI ML Senior Associate - Machine Learning Centre of Excellence

London, England, United Kingdom
JPMorgan Chase & Co
/Redefined, the UK's leading over-50s age inclusive jobs board. Job Description NLP/LLM Scientist - Applied AI ML Senior Associate - Machine Learning Centre of Excellence The Machine Learning Center of Excellence invites applications for a role focused on applying sophisticated machine learning methods to … complex tasks such as natural language processing, speech analytics, time series, reinforcement learning, and recommendation systems. The ideal candidate will excel in a highly collaborative environment, working with business teams, technologists, and control partners to deploy solutions into production. A strong passion for machine learning, along with … dedication to continuous learning, research, and experimentation, is essential. Hands-on expertise in Deep Learning, analytical thinking, motivation, and a desire to innovate are also required. Job Responsibilities Research and explore new machine learning methods through independent study, industry conferences, experimentation, and knowledge sharing. Develop state-of More ❯
Posted:

Machine Learning & Reinforcement Learning Lead

Slough, England, United Kingdom
JR United Kingdom
Social network you want to login/join with: Machine Learning & Reinforcement Learning Lead, Slough Client: Opus Recruitment Solutions Location: Slough, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description: Hot Opportunity Alert! Central London Office … working with one of the most exciting robotics R&D companies out there. We’re looking for a Senior Engineer with deep expertise in reinforcement learning to help drive the development of intelligent, full-body motion capabilities. This role is ideal for someone passionate about building robust, real … world solutions for dynamic locomotion and manipulation in complex environments. Key Responsibilities: Design and implement learning-based control strategies for advanced locomotion tasks such as walking, balancing under load, stair climbing, and fall recovery. Develop high-fidelity simulation environments that reflect real-world dynamics, including actuator constraints and environmental More ❯
Posted:

Machine Learning & Reinforcement Learning Lead

London, England, United Kingdom
JR United Kingdom
Social network you want to login/join with: Machine Learning & Reinforcement Learning Lead, london col-narrow-left Client: Opus Recruitment Solutions Location: london, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 col-wide … working with one of the most exciting robotics R&D companies out there ? We’re looking for a Senior Engineer with deep expertise in reinforcement learning to help drive the development of intelligent, full-body motion capabilities. This role is ideal for someone passionate about building robust, real … world solutions for dynamic locomotion and manipulation in complex environments. Key Responsibilities: Design and implement learning-based control strategies for advanced locomotion tasks such as walking, balancing under load, stair climbing, and fall recovery. Develop high-fidelity simulation environments that reflect real-world dynamics, including actuator constraints and environmental More ❯
Posted:

Applied AI ML Senior Associate - Machine Learning Center of Excellence - Time Series Reinforcem[...]

London, England, United Kingdom
JPMorgan Chase & Co
Applied AI ML Senior Associate - Machine Learning Center of Excellence - Time Series Reinforcement Learning Job Description The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. This includes ensuring the quality, integrity, and security of the company … making. The CDAO is also responsible for developing and implementing solutions that support the firm's commercial goals by harnessing artificial intelligence and machine learning technologies to develop new products, improve productivity, and enhance risk management effectively and responsibly. As an Applied AI ML Senior Associate in Machine Learning Center of Excellence, you will have the opportunity to apply sophisticated machine learning methods to complex tasks including time series analysis, reinforcement learning, causal inference, and natural language processing. You will collaborate with various teams and actively participate in our knowledge sharing community. We are looking More ❯
Posted:

Member of Technical Staff, AI - Reinforcement Systems

London, England, United Kingdom
Microsoft
Member of Technical Staff, AI - Reinforcement Systems Join to apply for the Member of Technical Staff, AI - Reinforcement Systems role at Microsoft Member of Technical Staff, AI - Reinforcement Systems 1 week ago Be among the first 25 applicants Join to apply for the Member of Technical Staff … AI - Reinforcement Systems role at Microsoft Help build the world’s most advanced reinforcement learning systems at Microsoft AI. We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you’ll … priorities Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics More ❯
Posted:

Applied AI ML Associate - Machine Learning Scientist - Machine Learning for Technology

London, England, United Kingdom
JPMorgan Chase & Co
Job Description Join the elite Applied Innovation of AI (AI2) team at JP Morgan Chase, strategically located within the CTO office. As a Machine Learning Specialist within the JPMC businesses, you will be responsible for addressing business-critical priorities using innovative machine learning techniques. You will work closely … the growth of the business and explore novel challenges that could revolutionize the way the bank operates. Your role will involve applying advanced machine learning methods to a range of complex tasks, such as data mining, text understanding, anomaly detection, and generative AI. You will collaborate with business, technologists … Additionally, your responsibilities will include researching new methods, developing models, and contributing to reusable code and components. Job Responsibilities: Research and explore new machine learning methods through independent study, attending conferences, and experimentation. Develop state-of-the-art machine learning models to solve real-world problems in Cybersecurity More ❯
Posted:

Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options
InstaDeep Ltd
AI revolution! About DeepPCB: DeepPCB is InstaDeep's AI-powered Place & Route PCB (Printed Circuit Board) design tool. We use a combination of deep reinforcement learning and high-performance computing to automate and scale PCB place-and-route workflows, accelerating hardware innovation globally. We are looking for a … Machine Learning Engineer to join the DeepPCB team and help push the boundaries of AI for electronic design automation (EDA). You will develop, optimize, and deploy cutting-edge machine learning and reinforcement learning models focused on automating complex PCB design problems, working closely with researchers … and engineers to bring ideas to life. Responsibilities: Develop scalable and efficient machine learning algorithms to tackle PCB place-and-route challenges. Adapt and optimize ML models for large-scale distributed computing environments (e.g., GPUs, multi-node clusters). Build, test, and deploy robust production-level ML systems integrated More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer

London, England, United Kingdom
Hybrid / WFH Options
InstaDeep
AI revolution! About DeepPCB: DeepPCB is InstaDeep's AI-powered Place & Route PCB (Printed Circuit Board) design tool. We use a combination of deep reinforcement learning and high-performance computing to automate and scale PCB place-and-route workflows, accelerating hardware innovation globally. Learn more at deeppcb.ai. Role … Overview: We are looking for a Machine Learning Engineer to join the DeepPCB team and help push the boundaries of AI for electronic design automation (EDA). You will develop, optimize, and deploy cutting-edge machine learning and reinforcement learning models focused on automating complex PCB … design problems, working closely with researchers and engineers to bring ideas to life. Responsibilities: Develop scalable and efficient machine learning algorithms to tackle PCB place-and-route challenges. Adapt and optimize ML models for large-scale distributed computing environments (e.g., GPUs, multi-node clusters). Build, test, and deploy More ❯
Posted:

Member of Technical Staff, AI - Reinforcement Systems

London, United Kingdom
Microsoft
Help build the world's most advanced reinforcement learning systems at Microsoft AI. We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you'll help advance state-of-the-art model … priorities Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics … competitive programming, and related domains are a plus. Responsibilities Collaborate with research teams to advance state-of-the-art algorithms for reinforcement learning in LLMs Develop the core systems for adapting reinforcement learning to unprecedented scales and heterogeneous environments. Embody our culture of collaboration, innovation, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Scientist / Research Engineer, Pre-training

London, United Kingdom
Hybrid / WFH Options
Menlo Ventures
Contribute to the entire stack, from low-level optimizations to high-level model design Qualifications: Advanced degree (MS or PhD) in Computer Science, Machine Learning, or a related field Strong software engineering skills with a proven track record of building complex systems Expertise in Python and experience with deep … learning frameworks (PyTorch preferred) Familiarity with large-scale machine learning, particularly in the context of language models Ability to balance research goals with practical engineering constraints Strong problem-solving skills and a results-oriented mindset Excellent communication skills and ability to work in a collaborative environment Care about … Work on high-performance, large-scale ML systems Familiarity with GPUs, Kubernetes, and OS internals Experience with language modeling using transformer architectures Knowledge of reinforcement learning techniques Background in large-scale ETL processes You'll thrive in this role if you: Have significant software engineering experience Are results More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist - Creative Optimization

London, England, United Kingdom
GroupM
to VP Data Science and be part of a small (but growing!) team of Data Scientists. The ideal candidate will have a background in Reinforcement Learning (or related disciplines), with hands-on cloud technology experience. Whilst commercial experience is highly desirable, given the deployment of RL at scale … Develop and optimize the ACO algorithm(s) and related Data Science components for the product Design and contribute to the end-to-end machine learning pipeline from data collection, reprocessing to model training, simulation, evaluation, deployment and experimentation/testing Implement and interpret explainability frameworks to provide clear insights … or master's degree in Data Science, Computer Science, Engineering, Statistics, or a related quantitative field Hands-on (academic/commercial) experience in implementing Reinforcement Learning (or a related displicine). Please note: We use the term Reinforcement Learning as an umbrella term rather than a More ❯
Posted:

Robotics Control Engineer - Reinforcement Learning

London, England, United Kingdom
Humanoid
next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. We’re seeking a highly skilled Senior Reinforcement Learning (RL) Control Engineer to develop locomotion and whole body control skills for our humanoid robots. You’ll be at the cutting edge … on designing robust controllers for walking, balancing while manipulating, fall recovery, and other advanced mobility tasks. We’re seeking candidates with deep expertise in reinforcement learning and a strong track record of deploying control systems on physical robots. Our Mission: At Humanoid we strive to create the world … and perception teams to integrate control strategies into the full-stack robotic system. Required Qualifications : Master’s or PhD in Robotics, Control Systems, Machine Learning, or a related field. At least 3+ years of experience in the design and implementation of control systems for legged robots, focusing on locomotion. More ❯
Posted:

Robotics Control Engineer - Reinforcement Learning

London Area, United Kingdom
Humanoid
next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. We’re seeking a highly skilled Senior Reinforcement Learning (RL) Control Engineer to develop locomotion and whole body control skills for our humanoid robots. You’ll be at the cutting edge … on designing robust controllers for walking, balancing while manipulating, fall recovery, and other advanced mobility tasks. We’re seeking candidates with deep expertise in reinforcement learning and a strong track record of deploying control systems on physical robots. Our Mission: At Humanoid we strive to create the world … and perception teams to integrate control strategies into the full-stack robotic system. Required Qualifications : Master’s or PhD in Robotics, Control Systems, Machine Learning, or a related field. At least 3+ years of experience in the design and implementation of control systems for legged robots, focusing on locomotion. More ❯
Posted:

Robotics Control Engineer - Reinforcement Learning

City of London, London, United Kingdom
Humanoid
next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. We’re seeking a highly skilled Senior Reinforcement Learning (RL) Control Engineer to develop locomotion and whole body control skills for our humanoid robots. You’ll be at the cutting edge … on designing robust controllers for walking, balancing while manipulating, fall recovery, and other advanced mobility tasks. We’re seeking candidates with deep expertise in reinforcement learning and a strong track record of deploying control systems on physical robots. Our Mission: At Humanoid we strive to create the world … and perception teams to integrate control strategies into the full-stack robotic system. Required Qualifications : Master’s or PhD in Robotics, Control Systems, Machine Learning, or a related field. At least 3+ years of experience in the design and implementation of control systems for legged robots, focusing on locomotion. More ❯
Posted:

Machine Learning Engineer - Deep Learning

Slough, England, United Kingdom
JR United Kingdom
Social network you want to login/join with: Machine Learning Engineer - Deep Learning, slough col-narrow-left Client: Location: slough, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 col-wide Job Description: RAPP is … and causal AI projects within a fast-moving, agency-style environment. What You'll Be Doing: Designing and building custom forecasting models (XGBoost, deep learning, RL) for real-world retail scenarios Applying causal and graph-based methods to understand and optimise customer behaviour Working across the full stack of … wrangling to deployment Operating within a modern MLOps setup (Docker, CI/CD, AWS) What We're Looking For: Strong practical knowledge of deep learning fundamentals - ideally with PyTorch Experience building bespoke models for time series, tabular, image, or text data Hands-on forecasting experience with retail or consumer More ❯
Posted:

Machine Learning Engineer - Deep Learning

London, England, United Kingdom
JR United Kingdom
Social network you want to login/join with: Machine Learning Engineer - Deep Learning, london col-narrow-left Client: Location: london, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 col-wide Job Description: RAPP is … and causal AI projects within a fast-moving, agency-style environment. What You'll Be Doing: Designing and building custom forecasting models (XGBoost, deep learning, RL) for real-world retail scenarios Applying causal and graph-based methods to understand and optimise customer behaviour Working across the full stack of … wrangling to deployment Operating within a modern MLOps setup (Docker, CI/CD, AWS) What We're Looking For: Strong practical knowledge of deep learning fundamentals - ideally with PyTorch Experience building bespoke models for time series, tabular, image, or text data Hands-on forecasting experience with retail or consumer More ❯
Posted:

Research Engineer - Machine Learning

London, England, United Kingdom
PlayStation Global
is dedicated to creating immersive and unforgettable gaming experiences. As we continue to push the boundaries of technology, we are seeking a talented Machine Learning Engineer specialising in Imitation Learning to join our dynamic R&D team. A passion for pushing the boundaries of artificial intelligence and a … commitment to publishing in top-tier conferences and journals are essential. Our team is at the forefront of Imitation Learning and Reinforcement Learning techniques for game-playing agents, and we seek an expert in the field to join us in developing new technologies. As a Machine Learning … our research team, supporting the technical lead in driving technical excellence and innovation. You will be responsible for designing, implementing and optimising novel machine learning models and algorithms to perform challenging and diverse tasks. You will perform a central role in a highly collaborative and fast-moving environment where More ❯
Posted:
Reinforcement Learning
England
10th Percentile
£76,727
25th Percentile
£90,801
Median
£130,000
75th Percentile
£175,000