Reinforcement Learning Jobs in London

1 to 25 of 62 Reinforcement Learning Jobs in London

Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options
Locus Robotics
to sorting and pack-out-Locus Robotics empowers businesses to meet peak demands and adapt to ever-changing operational needs. Are you a Machine Learning Engineer with a passion for reinforcement learning, multi-agent systems, and simulation at scale? We want to hear from you! At Locus … tools and ML systems to optimize the behavior of large autonomous fleets in dynamic environments. In this role, you will work on cutting-edge reinforcement learning (RL) models, multi-agent systems, and faster-than-real-time simulations to drive innovation in logistics, robotics, and beyond. You'll collaborate … real-time modelling of 1,000+ autonomous agents for various use cases such as fleet optimization, logistics, or robotics. Develop, deploy, and maintain machine learning models, with a strong focus on reinforcement learning (RL) and multi-agent systems to optimize fleet behavior in dynamic environments. Implement and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer: Graph Machine Learning

London, United Kingdom
Atmanlabs
Research Engineer: Graph Machine Learning Atman Labs, London About Atman Labs At Atman Labs we are building software to emulate proactive human expertise. Emulating human experts with deep knowledge and proactive assistance has largely been impossible to do via standalone Artificial Intelligence techniques. As an applied research and commercialization … proactive shopping assistance, to personal teachers to healthcare concierges - and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Machine Reasoning: Web-scale Knowledge Graph … Exploration using Reinforcement Learning Human experts can form and explore structured mental models in their heads to solve open-ended problems across different domains. Our research seeks to emulate this process through a novel combination of using reinforcement learning agents to perform exploration through a knowledge More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

NLP / LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence

London, United Kingdom
NLP PEOPLE
NLP/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence The Machine Learning Center of Excellence invites the successful candidate to apply sophisticated machine learning methods to a wide variety of complex tasks including natural language processing, speech analytics, time series, reinforcement learning … environment together with the business, technologists and control partners to deploy solutions into production. The candidate must also have a strong passion for machine learning and invest independent time towards learning, researching and experimenting with new innovations in the field. The candidate must have solid expertise in Deep … Learning with hands-on implementation experience and possess strong analytical thinking, a deep desire to learn and be highly motivated. Job Responsibilities Research and explore new machine learning methods through independent study, attending industry-leading conferences, experimentation and participating in our knowledge sharing community. Develop state-of-the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Applied Scientist, Prime Tech, Prime Science - ML

London, United Kingdom
Amazon
and directly apply your modeling solutions. There are numerous scientific and technical challenges you will get to tackle in this role, such as deep learning and reinforcement learning, and their application to various types of contextual, multi-step optimization of the customer journey. We employ techniques from … supervised learning, multi-armed bandits, optimization, and RL - while this role is focused on the space of discriminative and generative recommender systems. As the central science team within Prime, our expertise gets routinely called upon to weigh in on a variety of topics. We also emphasize the need and … technologies (EMR/Spark, Redshift, Sagemaker, DynamoDB, S3, ), various ML algorithms and techniques (Random Forests, Neural Networks, supervised/unsupervised/semi-supervised/reinforcement learning, LLMs), and statistical modeling techniques. Major responsibilities - Build and develop machine learning models and supporting infrastructure at TB scale, in coordination More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Member of Technical Staff, AI - Reinforcement Systems

London, United Kingdom
Microsoft
Help build the world's most advanced reinforcement learning systems at Microsoft AI. We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you'll help advance state-of-the-art model … priorities Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics … competitive programming, and related domains are a plus. Responsibilities Collaborate with research teams to advance state-of-the-art algorithms for reinforcement learning in LLMs Develop the core systems for adapting reinforcement learning to unprecedented scales and heterogeneous environments. Embody our culture of collaboration, innovation, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist Data and Insights London Hybrid Remote

London, United Kingdom
Hybrid / WFH Options
loveholidays
Data Scientists, four Data Scientists, and the Head of Data Science. We specialise in various areas such as Recommender Systems, Time Series Forecasting, Deep Learning, and Reinforcement Learning, fostering a collaborative learning environment. Our focus is on modelling and problem-solving, leveraging advanced machine learning … planning/prioritisation to delivery including monitoring and alerting Designing experiments and modelling to generate actionable insights and enhance business performance Proficient in machine learning and statistical methods for predictive modelling and forecasting Experience deploying ML models to production at scale Solid understanding of SQL Proficiency in unit testing … CI/CD, model management and experiment tracking Desirable Experience with Deep Learning, Generative AI and Reinforcement Learning Experience with Time Series Forecasting and Recommender Systems Previous experience working in e-commerce, retail, or the travel industry. Conducted and analysed large scale A/B experiments Experience More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Researcher Scientist

London, United Kingdom
Autodesk, Inc
imagine, design, and make a better world. We are a team of scientists, researchers, engineers, and designers working together on projects that range from learning-based design systems, computer vision, graphics, robotics, human-computer interaction, sustainability, simulation, manufacturing, architectural design and construction. As a member of the AI Lab … in Autodesk Research you will be an expert in research areas such as artificial intelligence, deep learning, generative AI, machine learning , computer vision, reinforcement learning, information retrieval, and natural language processing. Autodesk's AI Lab is active in the wider research community, targeting publications at CVPR … related to AI/ML such as: Computer Science, Mathematics, Statistics, Physics, Linguistics, Mechanical Engineering, Architecture or related disciplines Publication track record in machine learning conferences and/or journals Significant post-graduate research experience, or 5 or greater years of work experience (actual job title/position will More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Development Manager, Amazon Music

London, United Kingdom
Amazon
audio entertainment. We are revolutionizing how customers experience music, podcasts and audiobooks through personalized recommendations and novel listening features. We are innovating with Machine Learning technologies, including Large Language Models, to enhance content recommendations and create more engaging listening experiences. As a technical leader, you'll have ownership of … day in the life You will lead a team of applied scientists and engineers crafting delightful audio experiences. You'll leverage AI and machine learning to revolutionize how customers engage with audio content. Your day will involve leading large-scale, customer-facing services that directly impact the listening experience … the coming year. BASIC QUALIFICATIONS Bachelor's degree in Computer Science, Mathematics, Statistics, or a related quantitative field, or equivalent experience. Experience applying machine learning to solve complex problems for large-scale applications. Experience leading applied science and engineering teams in a software development organization. Highly proficient in both More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Founding AI Engineer - Reinforcement Learning & LLMs

London Area, United Kingdom
algo1
Founding AI Engineer - Reinforcement Learning & LLMs (London, In-person) VC-backed generative AI startup building platform to take LLM-powered tasks to 99%+ accuracy. Early-stage, engineering-led, everyone codes. Role: Build, train and optimise LLM systems capable of beating industry benchmarks. Deploy robust generative AI systems … ability with startup experience. Curious and loves to solve hard problems with first-principles thinking. Requirements: Experience deploying LLMs into production. Experience with deep reinforcement learning. Strong ML/data science and data engineering fundamentals. Excellent Python skills. More ❯
Posted:

Senior Applied Scientist, Alexa

London, United Kingdom
Amazon
intuitive, and adaptive content recommendations across all Amazon surfaces. We aim to facilitate seamless reasoning and customer experiences, surpassing the capabilities of previous machine learning models. We are looking for a passionate, talented, and resourceful Senior Applied Scientist in the field of Natural Language Processing (NLP), Large Language Model … Retrieval, to invent and build scalable solutions for a state-of-the-art context-aware personal assistant. A successful candidate will have strong machine learning background and a desire to push the envelope in one or more of the above areas. The ideal candidate would also enjoy operating in … delightful recommendations and conversations. Your work will directly impact our customers in the form of products and services that make use of various machine learning, deep learning and language model technologies. You will leverage Amazon's heterogeneous data sources and large-scale computing resources to accelerate advances in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer: Superhuman Visual Generation

London, United Kingdom
Atmanlabs
proactive shopping assistance to personal teachers to healthcare concierges - and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Visual Foundation Models: Unlocking Human Engagement … in writing, in person, and in small groups, and are able to educate audiences at all levels on the novel applications and relevance of reinforcement learning. You can easily distinguish authentic and high integrity thinkers from 'posers', while also critically evaluating truth from fiction in your own work. Your … dampen the mood. Your intensity goes from 0 to 1000 when you become authentically interested in a topic. You not only have interests in reinforcement learning, but are deeply curious about a range of interdisciplinary topics, ranging from knowledge graphs, recommendations, web-scale search, deep learning, generative More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

CTO - Entourage

London, United Kingdom
Moonsong Labs, Inc
insight that code serves as a universal, learnable language for agents to interact with their environment, we are developing a protocol that enables collective learning through a shared memory layer where mutually distrusting agents can exchange and validate their experiences. This protocol transforms individual code actions, trajectories and experiences … while reducing costs. This approach, combined with carefully designed token incentives and memory curation mechanisms, creates a self-reinforcing ecosystem where each agent's learning contributes to and benefits from the network's growing capability. Our approach goes beyond existing frameworks, creating the connective tissue for the emerging agent … integrations, scalability, and security. Recruiting, hiring, mentoring, and managing a world-class AI development team. Cultivating a culture of ownership, innovation, collaboration, and continuous learning essential for the fast-moving AI agent ecosystem. Engaging with developers across frameworks and tech stacks to understand their needs and incorporate feedback into More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Engineer

London, United Kingdom
Hybrid / WFH Options
Birdie
up skill our broader engineering teams. Build and maintain AI infrastructure leveraging the latest technology approaches: Investigate cutting-edge approaches like generative AI or reinforcement learning to drive adoption and impact. Ensure seamless integration of AI into existing platform and customer-facing processes. We'd love to hear … from you if you have: At least 2 years of experience working in an AI engineering or machine learning role supporting customer-facing processes and infrastructure. Experience working in a cross functional product engineering squad with deep interest in understanding and solving end user problems. Proven expertise in building … scalable machine learning models and deploying them into production environments. Passionate about enabling others with tools, frameworks, and documentation-ensuring teams can independently leverage AI without creating bottlenecks. A technical leader who stays curious about emerging technologies (e.g., generative AI, reinforcement learning) and rapidly evaluates their pros More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Research Engineer, Applied ML

London, United Kingdom
Google Inc
years of experience in software development and with data structures/algorithms. 5 years of experience building and architecting large-scale, production quality Machine Learning (ML) systems. 5 years of experience in distributed development and large-scale data processing. Experience coding in either C++ or Python. Experience with ML … fundamentals, algorithms, and techniques, including supervised, unsupervised, and reinforcement learning, and experience in areas like natural language processing (NLP), computer vision, and generative AI. Preferred Qualifications: Experience with generative models (e.g., diffusion models, GANs, transformers) for various media formats (e.g., text, image, video, audio), including prompt engineering, fine … tuning, and evaluation techniques. Experience with RL algorithms and frameworks, including policy gradient methods, Q-learning, and actor-critic architectures. Experience building and leading high-performing research or engineering teams, fostering a positive and inclusive culture. Experience being published in ML/AI conferences or journals, demonstrating a strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist London, UK (hybrid)

London, United Kingdom
Hybrid / WFH Options
Rakuten Viber
with family and friends, through managing business relationships to pursuing their passions. Connecting people across the world is a complex problem with many machine-learning applications. The purpose of this role is to implement mathematical models and algorithms to solve complex business problems in recommendations and classification. Successful outcomes … globe. As a Senior Data Scientist, you will work in a highly collaborative environment with extensive amounts of data to research and develop deep learning models in the domains of dating, moderation and content segmentation and apply them to tasks such as recommendation systems and analytics at a high … management and partner teams to design and implement solutions in recommender systems for given objectives. Lead technical efforts to improve the performance of deep learning models and propose initiatives to impact company goals directly. Autonomously find solutions to complex problems in social network recommendations and understand the data generation More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Research Engineer, Applied ML

London, United Kingdom
WeAreTechWomen
years of experience in software development and with data structures/algorithms. 5 years of experience building and architecting large-scale, production quality Machine Learning (ML) systems. 5 years of experience in distributed development and large-scale data processing. Experience coding in either C++ or Python. Experience with ML … fundamentals, algorithms, and techniques, including supervised, unsupervised, and reinforcement learning, and experience in areas like natural language processing (NLP), computer vision, and generative AI. Preferred qualifications: Experience with generative models (e.g., diffusion models, GANs, transformers) for various media formats (e.g., text, image, video, audio), including prompt engineering, fine … tuning, and evaluation techniques. Experience with RL algorithms and frameworks, including policy gradient methods, Q-learning, and actor-critic architectures. Experience building and leading high-performing research or engineering teams, fostering a positive and inclusive culture. Experience being published in ML/AI conferences or journals, demonstrating a strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Specialist, Applied AI Engineer

London, United Kingdom
Hybrid / WFH Options
Industrial and Financial Systems
and iterate AI solutions in collaboration with customers and use case leads is a must. You will be responsible for applying generative AI techniques, reinforcement learning, and multi-agent systems to solve real-world industrial challenges. This role demands a high level of creativity in developing new AI … and optimization of AI models to ensure reliability, safety, and regulatory compliance in industrial settings. Stay ahead of the latest advancements in generative AI, reinforcement learning, and multi-agent systems to drive innovation within the company. Qualifications Masters degree in Data Science. Thought leader and expert in applied More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Process Improvement Specialist - 24 Month FTE, EU CF ACES PSE

London, United Kingdom
ENGINEERINGUK
Fulfillment Technology) Software, FC Launch, Supply Chain, and EU fulfillment center leaders Create standards for your process and training material in collaboration with the Learning & Development team, to secure high level of knowledge within the FC operations Collaborate with the extended EU ACES teams to define process standards and … process excellence and continuous improvement culture in a large organization Strong Process/benchmark auditing and non-compliance management experience Experience with modern machine learning: deep learning, online/reinforcement learning, semi-supervised and transfer learning Experience working with technology and software teams to align More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Sr. Data Associate with Spanish, OPTIMA - Alexa Shopping

London, United Kingdom
Amazon
Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the … Large Language Models (LLMs), enabling Amazon to deliver a superior shopping experience to customers worldwide. Our mission is to empower Amazon's LLMs through Reinforcement Learning from Human Feedback (RLHF) across various categories at high speed. We aspire to provide an end-to-end data solution for the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Process Improvement Specialist - 24 Month FTE, EU CF ACES PSE

London, United Kingdom
Amazon
Fulfillment Technology) Software, FC Launch, Supply Chain, and EU fulfillment center leaders Create standards for your process and training material in collaboration with the Learning & Development team, to secure high level of knowledge within the FC operations Collaborate with the extended EU ACES teams to define process standards and … process excellence and continuous improvement culture in a large organization Strong Process/benchmark auditing and non-compliance management experience Experience with modern machine learning: deep learning, online/reinforcement learning, semi-supervised and transfer learning Experience working with technology and software teams to align More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer, Content and Catalog Management

London, United Kingdom
Hybrid / WFH Options
Spotify AB
environment. What You'll Do: Drive the full lifecycle of ML solutions for CoCaM services, including research, design, development, evaluation, and deployment. Manage Machine Learning projects ranging from Supervised Learning, to Reinforcement Learning, to LLMs. Optimize and monitor deployed ML model performance, implementing improvements based on … applications. Work closely with engineering teams to integrate ML models into existing systems and workflows. Be an active participant of a group of machine learning engineers, staying updated with the latest advancements, participating in code reviews, and contributing to knowledge sharing across the team. Who You Are: 2+ years … of hands-on experience in developing and deploying machine learning models in a production environment. Practical experience in implementing ML systems using languages like Python or Scala and are familiar with relevant ML libraries and frameworks (e.g., TensorFlow or PyTorch). Solid understanding of various machine learning algorithms More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer - Content and Catalog Management

London, United Kingdom
Hybrid / WFH Options
Spotify
environment. What You'll Do Drive the full lifecycle of ML solutions for CoCaM services, including research, design, development, evaluation, and deployment. Manage Machine Learning projects ranging from Supervised Learning, to Reinforcement Learning, to LLMs. Optimize and monitor deployed ML model performance, implementing improvements based on … applications. Work closely with engineering teams to integrate ML models into existing systems and workflows. Be an active participant of a group of machine learning engineers, staying updated with the latest advancements, participating in code reviews, and contributing to knowledge sharing across the team. Who You Are 2+ years … of hands-on experience in developing and deploying machine learning models in a production environment. Practical experience in implementing ML systems using languages like Python or Scala and are familiar with relevant ML libraries and frameworks (e.g., TensorFlow or PyTorch). Solid understanding of various machine learning algorithms More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning (ML) Engineer II

Uxbridge, Middlesex, United Kingdom
Hybrid / WFH Options
NLP PEOPLE
office is open planned, and you will be working in an innovative and collaborative environment with your international peers and colleagues. As the Machine Learning (ML) Engineer II, you will be responsible for developing AI Labs' machine learning platform and creating production-ready AI-based solutions for key … guidance to other parts of the business. Key Accountabilities: ML and Natural Language Processing (NLP) Technology: • Leverage proprietary technology stack to build custom machine learning models • Design, implement, and document new ML/NLP modeling techniques and strategies • Develop Back-end/server-side software to support AI solution … development and serving • Build internal frameworks, libraries, and infrastructure to improve machine learning and NLP capabilities to allow for rapid prototyping and new product delivery • Review and adapt recent research in ML and NLP into prototypes and production solutions • Review and improve the code of other engineers to enhance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer (GenAI)

London, United Kingdom
Barlowe LLP
and data pipelines Conducting experiments and evaluating model performance to ensure high-quality results Staying current with the latest developments in AI and machine learning technologies Influencing the technical direction of AI within G-Research Working with the latest Large Language Models (LLMs), utilising both commercial APIs and on … A degree in Computer Science, Artificial Intelligence, or a related field, or equivalent professional experience Strong programming skills in Python, with experience in machine learning frameworks such as PyTorch, TensorFlow or JAX Hands-on experience fine-tuning LLMs and optimising hyperparameters for improved performance Skilled in building LLM-powered … applications for real-time decision-making and automation across various domains Familiarity with RAG and agentic workflows Solid understanding of machine learning algorithms, deep learning architectures, and their applications Strong problem-solving skills and the ability to translate complex business requirements into technical solutions Desirable skills: Experience with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer, GenAI, Llama Speech London, UK • AI Research • Artificial Intelligence Londo ...

London, United Kingdom
Meta
generative AI and NLP, with experience in areas like language model evaluation; data processing for pre-training and fine-tuning; responsible LLMs; LLM alignment; reinforcement learning for language model tuning; efficient training and inference; and/or multilingual and multimodal modeling. Responsibilities Design methods, tools, and infrastructure to … goals informed by practical engineering concerns. Contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results. Adapt standard machine learning methods to best exploit modern parallel environments (e.g. distributed clusters, multicore SMP, and GPU). Work with a large and globally distributed team. Contribute … open-sourcing efforts. Minimum Qualifications Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Research experience in machine learning, deep learning, and/or natural language processing. Experience with developing machine learning models at scale from inception to business impact. Programming More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Reinforcement Learning
London
25th Percentile
£125,000
Median
£150,000
75th Percentile
£175,000