Research Scientist | ReinforcementLearning | Python | Machine Learning | Deep Learning | Hybrid, London Research Scientist | ReinforcementLearning | Python | Machine Learning | Deep Learning | Hybrid, London 3 days ago Be among the first 25 applicants Direct message the job poster from Enigma Founder @ Enigma | Creator of … Vision to Venture Podcast & The Neural Network Community Research Scientist | ReinforcementLearning | Python | Machine Learning | Deep Learning | Hybrid, London Overview Today’s internet is misaligned with its users – at best one-size-fits-all and static, and at worst adversarial. We’re building AI-powered tools … s goals, needs, and values. In this new human-aligned internet, generative user interfaces will augment our capacity for discovery, understanding, and expression by learning from and responding to users in real time. We’re powering these experiences with our bespoke foundation model, enabling intelligent, highly personalized user experiences More ❯
Founding Engineer: ReinforcementLearning for Expert Reasoning Atman Labs, London About Atman Labs At Atman Labs we are building software to emulate human expertise. We believe our research poses a credible path to emulate true human cognition and interaction with deep knowledge and proactive reasoning. Our unique research … is inspired by biological priors, lies at the intersection of custom ReinforcementLearning environments and Large-Scale Knowledge Representation, and is evolved and compounded with commercial application. We are deploying our platform in products across a number of commercial domains to demonstrate the value of our approach – starting … with building proactive shopping concierges for e-commerce, to eventually launching expert systems across travel, healthcare, education, science and more. The Next Frontier of ReinforcementLearning: Emulating How Humans can Reason about Knowledge to Solve Complex, Long-Horizon Tasks We are hiring for a founding engineer that will More ❯
London, England, United Kingdom Hybrid / WFH Options
Wayve Technologies Ltd
us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future. At Wayve, your contributions matter. We value diversity, embrace new perspectives, and … to autonomous driving or similar robotics or decision making domain, inclusive, but not limited to the following specific areas: Model-free and model-based reinforcementlearning Offline reinforcementlearning Planning with learned models, model predictive control and tree search Imitation learning, inverse reinforcementlearning … of real-world driving data How to architect our models to best employ the latest advances in foundation models, transformers, world models, etc. Which learning algorithms to use (e.g. reinforcementlearning, behavioural cloning) How to leverage simulation for controlled experimental insight, training data augmentation, and re-simulation More ❯
Social network you want to login/join with: Research Scientist | ReinforcementLearning | Python | Machine Learning | Deep Learning | Hybrid, London, london col-narrow-left Client: Enigma Location: london, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 Posted: 10.06.2025 … Expiry Date: 25.07.2025 col-wide Job Description: Research Scientist | ReinforcementLearning | Python | Machine Learning | Deep Learning | Hybrid, London Overview Today’s internet is misaligned with its users – at best one-size-fits-all and static, and at worst adversarial. We’re building AI-powered tools to … s goals, needs, and values. In this new human-aligned internet, generative user interfaces will augment our capacity for discovery, understanding, and expression by learning from and responding to users in real time. We’re powering these experiences with our bespoke foundation model, enabling intelligent, highly personalized user experiences More ❯
Huawei Technologies Research & Development (UK) Ltd
UK) Ltd Get AI-powered advice on this job and more exclusive features. Job Description Job Title Research Engineer – Bayesian Optimization Department/Group ReinforcementLearning Team Reports To Haitham Ammar Manager Position No Location London Position Type Full time 37.5 hours per week About Huawei Research And … looking for a Research Engineer with experience in Bayesian optimization . We wish to recruit exceptional candidates to pursue research in various aspects of reinforcementlearning and Bayesian optimization for autonomous decision-making under uncertainty. The successful applicant is expected to develop novel contributions allowing the field to … world applications. Key research questions include but are not limited to scalable high-dimensional Bayesian optimization, Gaussian processes, Bayesian neural networks, safe and robust reinforcementlearning, multi-agent reinforcementlearning, and model-based reinforcement learning. This is an exceptional opportunity for research while collaborating with More ❯
Research Engineer: Graph Machine Learning Atman Labs, London About Atman Labs At Atman Labs we are building software to emulate proactive human expertise. Emulating human experts with deep knowledge and proactive assistance has largely been impossible to do via standalone Artificial Intelligence techniques. As an applied research and commercialization … proactive shopping assistance, to personal teachers to healthcare concierges - and with this commercial focus advance our unique research that lies at the intersection of ReinforcementLearning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Machine Reasoning: Web-scale Knowledge Graph … Exploration using ReinforcementLearning Human experts can form and explore structured mental models in their heads to solve open-ended problems across different domains. Our research seeks to emulate this process through a novel combination of using reinforcementlearning agents to perform exploration through a knowledge More ❯
London, England, United Kingdom Hybrid / WFH Options
Anthropic
Research Engineer, Machine Learning (Horizons) London, UK About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We aim for AI to be safe and beneficial for users and society. Our team is a growing group of researchers, engineers, policy experts, and business leaders working … together to build beneficial AI systems. About the role: As a Research Engineer on the ReinforcementLearning Fundamentals team, you will collaborate with researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcementlearning, enhancing reasoning abilities in … areas like code generation and mathematics, and exploring reinforcementlearning for agentic/open-ended tasks. Representative projects: Develop and implement novel reinforcementlearning techniques to improve the performance and safety of large language models. Create tools and environments for models to interact with, enabling complex More ❯
London, England, United Kingdom Hybrid / WFH Options
Alcides Fonseca
Research Engineer, Machine Learning (Horizons) London, UK About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers … policy experts, and business leaders working together to build beneficial AI systems. About the role: As a Research Engineer on the ReinforcementLearning Fundamentals team, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models through fundamental … research in reinforcementlearning, improving reasoning abilities in areas such as code generation and mathematics, and exploring reinforcementlearning for agentic/open-ended tasks. Representative projects: Develop and implement novel reinforcementlearning techniques to improve the performance and safety of large language models. More ❯
/Redefined, the UK's leading over-50s age inclusive jobs board. Job Description NLP/LLM Scientist - Applied AI ML Senior Associate - Machine Learning Centre of Excellence The Machine Learning Center of Excellence invites applications for a role focused on applying sophisticated machine learning methods to … complex tasks such as natural language processing, speech analytics, time series, reinforcementlearning, and recommendation systems. The ideal candidate will excel in a highly collaborative environment, working with business teams, technologists, and control partners to deploy solutions into production. A strong passion for machine learning, along with … dedication to continuous learning, research, and experimentation, is essential. Hands-on expertise in Deep Learning, analytical thinking, motivation, and a desire to innovate are also required. Job Responsibilities Research and explore new machine learning methods through independent study, industry conferences, experimentation, and knowledge sharing. Develop state-of More ❯
Social network you want to login/join with: Machine Learning & ReinforcementLearning Lead, Slough Client: Opus Recruitment Solutions Location: Slough, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description: Hot Opportunity Alert! Central London Office … working with one of the most exciting robotics R&D companies out there. We’re looking for a Senior Engineer with deep expertise in reinforcementlearning to help drive the development of intelligent, full-body motion capabilities. This role is ideal for someone passionate about building robust, real … world solutions for dynamic locomotion and manipulation in complex environments. Key Responsibilities: Design and implement learning-based control strategies for advanced locomotion tasks such as walking, balancing under load, stair climbing, and fall recovery. Develop high-fidelity simulation environments that reflect real-world dynamics, including actuator constraints and environmental More ❯
Social network you want to login/join with: Machine Learning & ReinforcementLearning Lead, london col-narrow-left Client: Opus Recruitment Solutions Location: london, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 col-wide … working with one of the most exciting robotics R&D companies out there ? We’re looking for a Senior Engineer with deep expertise in reinforcementlearning to help drive the development of intelligent, full-body motion capabilities. This role is ideal for someone passionate about building robust, real … world solutions for dynamic locomotion and manipulation in complex environments. Key Responsibilities: Design and implement learning-based control strategies for advanced locomotion tasks such as walking, balancing under load, stair climbing, and fall recovery. Develop high-fidelity simulation environments that reflect real-world dynamics, including actuator constraints and environmental More ❯
Social network you want to login/join with: Research Scientist | ReinforcementLearning | Python | Machine Learning | Deep Learning | Hybrid, London, Slough Client: Enigma Location: Slough, United Kingdom Job Category: Other EU work permit required: Yes Job Views: 3 Posted: 10.06.2025 Expiry Date: 25.07.2025 Job Description: Research … Scientist | ReinforcementLearning | Python | Machine Learning | Deep Learning | Hybrid, London Overview Today’s internet is misaligned with its users – at best one-size-fits-all and static, and at worst adversarial. We’re building AI-powered tools to create new digital experiences that are fundamentally aligned … s goals, needs, and values. In this new human-aligned internet, generative user interfaces will augment our capacity for discovery, understanding, and expression by learning from and responding to users in real time. We’re powering these experiences with our bespoke foundation model, enabling intelligent, highly personalized user experiences More ❯
Social network you want to login/join with: Robotics Control Engineer - ReinforcementLearning, slough col-narrow-left Client: Humanoid Location: slough, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 Posted: 04.06.2025 Expiry Date: 19.07.2025 col-wide Job Description: Humanoid … next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. We’re seeking a highly skilled Senior ReinforcementLearning (RL) Control Engineer to develop locomotion and whole body control skills for our humanoid robots. You’ll be at the cutting edge … on designing robust controllers for walking, balancing while manipulating, fall recovery, and other advanced mobility tasks. We’re seeking candidates with deep expertise in reinforcementlearning and a strong track record of deploying control systems on physical robots. Our Mission: At Humanoid we strive to create the world More ❯
We are a VC-backed startup focused on behavioural AI, currently in stealth. We are building for retail with a focus on Recommendation Systems, ReinforcementLearning and GenAI. We are looking for a Senior Research Engineer with experience in advanced machine learning systems to work with our … up — no legacy, no patchwork systems — just a clean slate and a clear vision. Our mission is to bring the intelligence of modern machine learning directly to the in-store shopping experience. Key Responsibilities: Solution development : Translate latest ML advances into impactful solutions and products, from MVPs to fully … deployed systems. Novel Applied Research : Innovate upon state-of-the-art ML research applied to consumer personalisation. Performance optimisation : Optimise machine learning models for performance in modern environments (e.g., distributed clusters, GPUs). Essential Qualifications: Hands on experience: 3-5+ years implementing applied machine learning research in More ❯
Member of Technical Staff, AI - Reinforcement Systems Join to apply for the Member of Technical Staff, AI - Reinforcement Systems role at Microsoft Member of Technical Staff, AI - Reinforcement Systems 1 week ago Be among the first 25 applicants Join to apply for the Member of Technical Staff … AI - Reinforcement Systems role at Microsoft Help build the world’s most advanced reinforcementlearning systems at Microsoft AI. We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you’ll … priorities Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics More ❯
AI revolution! About DeepPCB: DeepPCB is InstaDeep's AI-powered Place & Route PCB (Printed Circuit Board) design tool. We use a combination of deep reinforcementlearning and high-performance computing to automate and scale PCB place-and-route workflows, accelerating hardware innovation globally. We are looking for a … Machine Learning Engineer to join the DeepPCB team and help push the boundaries of AI for electronic design automation (EDA). You will develop, optimize, and deploy cutting-edge machine learning and reinforcementlearning models focused on automating complex PCB design problems, working closely with researchers … and engineers to bring ideas to life. Responsibilities: Develop scalable and efficient machine learning algorithms to tackle PCB place-and-route challenges. Adapt and optimize ML models for large-scale distributed computing environments (e.g., GPUs, multi-node clusters). Build, test, and deploy robust production-level ML systems integrated More ❯
Help build the world's most advanced reinforcementlearning systems at Microsoft AI. We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you'll help advance state-of-the-art model … priorities Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics … competitive programming, and related domains are a plus. Responsibilities Collaborate with research teams to advance state-of-the-art algorithms for reinforcementlearning in LLMs Develop the core systems for adapting reinforcementlearning to unprecedented scales and heterogeneous environments. Embody our culture of collaboration, innovation, and More ❯
Join the elite Applied Innovation of AI (AI2) team at JP Morgan Chase, strategically located within the CTO office. As a Machine Learning Specialist within the JPMC businesses, you will be responsible for addressing business-critical priorities using innovative machine learning techniques. You will work closely with stakeholders … the growth of the business and explore novel challenges that could revolutionize the way the bank operates. Your role will involve applying advanced machine learning methods to a range of complex tasks, such as data mining, text understanding, anomaly detection, and generative AI. You will collaborate with business, technologists … Additionally, your responsibilities will include researching new methods, developing models, and contributing to reusable code and components. Job Responsibilities: Research and explore new machine learning methods through independent study, attending conferences, and experimentation. Develop state-of-the-art machine learning models to solve real-world problems in Cybersecurity More ❯
London, England, United Kingdom Hybrid / WFH Options
Anthropic
Contribute to the entire stack, from low-level optimizations to high-level model design Qualifications: Advanced degree (MS or PhD) in Computer Science, Machine Learning, or a related field Strong software engineering skills with a proven track record of building complex systems Expertise in Python and experience with deep … learning frameworks (PyTorch preferred) Familiarity with large-scale machine learning, particularly in the context of language models Ability to balance research goals with practical engineering constraints Strong problem-solving skills and a results-oriented mindset Excellent communication skills and ability to work in a collaborative environment Care about … Work on high-performance, large-scale ML systems Familiarity with GPUs, Kubernetes, and OS internals Experience with language modeling using transformer architectures Knowledge of reinforcementlearning techniques Background in large-scale ETL processes You'll thrive in this role if you: Have significant software engineering experience Are results More ❯
to VP Data Science and be part of a small (but growing!) team of Data Scientists. The ideal candidate will have a background in ReinforcementLearning (or related disciplines), with hands-on cloud technology experience. Whilst commercial experience is highly desirable, given the deployment of RL at scale … Develop and optimize the ACO algorithm(s) and related Data Science components for the product Design and contribute to the end-to-end machine learning pipeline from data collection, reprocessing to model training, simulation, evaluation, deployment and experimentation/testing Implement and interpret explainability frameworks to provide clear insights … or master's degree in Data Science, Computer Science, Engineering, Statistics, or a related quantitative field Hands-on (academic/commercial) experience in implementing ReinforcementLearning (or a related displicine). Please note: We use the term ReinforcementLearning as an umbrella term rather than a More ❯
working with one of the most exciting robotics R&D companies out there 🤖✨ We’re looking for a Senior Engineer with deep expertise in reinforcementlearning to help drive the development of intelligent, full-body motion capabilities. This role is ideal for someone passionate about building robust, real … world solutions for dynamic locomotion and manipulation in complex environments. Key Responsibilities: Design and implement learning-based control strategies for advanced locomotion tasks such as walking, balancing under load, stair climbing, and fall recovery. Develop high-fidelity simulation environments that reflect real-world dynamics, including actuator constraints and environmental … Collaborate with multidisciplinary teams to integrate control systems into a unified robotic platform. Required Experience & Skills: MSc or PhD in Robotics, Control Engineering, Machine Learning, or a related field. 3+ years of experience developing control systems for legged robotic platforms. Strong background in reinforcementlearning applied to More ❯
working with one of the most exciting robotics R&D companies out there 🤖✨ We’re looking for a Senior Engineer with deep expertise in reinforcementlearning to help drive the development of intelligent, full-body motion capabilities. This role is ideal for someone passionate about building robust, real … world solutions for dynamic locomotion and manipulation in complex environments. Key Responsibilities: Design and implement learning-based control strategies for advanced locomotion tasks such as walking, balancing under load, stair climbing, and fall recovery. Develop high-fidelity simulation environments that reflect real-world dynamics, including actuator constraints and environmental … Collaborate with multidisciplinary teams to integrate control systems into a unified robotic platform. Required Experience & Skills: MSc or PhD in Robotics, Control Engineering, Machine Learning, or a related field. 3+ years of experience developing control systems for legged robotic platforms. Strong background in reinforcementlearning applied to More ❯
next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. We’re seeking a highly skilled Senior ReinforcementLearning (RL) Control Engineer to develop locomotion and whole body control skills for our humanoid robots. You’ll be at the cutting edge … on designing robust controllers for walking, balancing while manipulating, fall recovery, and other advanced mobility tasks. We’re seeking candidates with deep expertise in reinforcementlearning and a strong track record of deploying control systems on physical robots. Our Mission: At Humanoid we strive to create the world … and perception teams to integrate control strategies into the full-stack robotic system. Required Qualifications : Master’s or PhD in Robotics, Control Systems, Machine Learning, or a related field. At least 3+ years of experience in the design and implementation of control systems for legged robots, focusing on locomotion. More ❯
next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. We’re seeking a highly skilled Senior ReinforcementLearning (RL) Control Engineer to develop locomotion and whole body control skills for our humanoid robots. You’ll be at the cutting edge … on designing robust controllers for walking, balancing while manipulating, fall recovery, and other advanced mobility tasks. We’re seeking candidates with deep expertise in reinforcementlearning and a strong track record of deploying control systems on physical robots. Our Mission: At Humanoid we strive to create the world … and perception teams to integrate control strategies into the full-stack robotic system. Required Qualifications : Master’s or PhD in Robotics, Control Systems, Machine Learning, or a related field. At least 3+ years of experience in the design and implementation of control systems for legged robots, focusing on locomotion. More ❯
next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. We’re seeking a highly skilled Senior ReinforcementLearning (RL) Control Engineer to develop locomotion and whole body control skills for our humanoid robots. You’ll be at the cutting edge … on designing robust controllers for walking, balancing while manipulating, fall recovery, and other advanced mobility tasks. We’re seeking candidates with deep expertise in reinforcementlearning and a strong track record of deploying control systems on physical robots. Our Mission: At Humanoid we strive to create the world … and perception teams to integrate control strategies into the full-stack robotic system. Required Qualifications : Master’s or PhD in Robotics, Control Systems, Machine Learning, or a related field. At least 3+ years of experience in the design and implementation of control systems for legged robots, focusing on locomotion. More ❯