Reinforcement Learning Jobs in the UK

26 to 50 of 153 Reinforcement Learning Jobs in the UK

Research Engineer, Multimodal (Hiring Immediately)

London, UK
The Rundown AI, Inc
Snapshot Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific … Engineer at Google DeepMind, we look for the following skills and experience: BSc/BEng degree in computer science, mathematics, physics, electrical engineering, machine learning or equivalent (MSc/MEng preferable). Proven experience, either in industry or a research lab, working on complex ML problems and engineering workflows. … Strong knowledge and experience of Python and/or C++. Proven knowledge of machine learning and/or statistics e.g. Neural Nets, Deep Learning, Reinforcement Learning etc. Strong knowledge of algorithm design - with a proven ability to write ML algos from scratch. In addition, the following More ❯
Employment Type: Full-time
Posted:

Founding AI Engineer - Reinforcement Learning & LLMs

London Area, United Kingdom
algo1
Founding AI Engineer - Reinforcement Learning & LLMs (London, In-person) VC-backed generative AI startup building platform to take LLM-powered tasks to 99%+ accuracy. Early-stage, engineering-led, everyone codes. Role: Build, train and optimise LLM systems capable of beating industry benchmarks. Deploy robust generative AI systems … ability with startup experience. Curious and loves to solve hard problems with first-principles thinking. Requirements: Experience deploying LLMs into production. Experience with deep reinforcement learning. Strong ML/data science and data engineering fundamentals. Excellent Python skills. More ❯
Posted:

Founding AI Engineer - Reinforcement Learning & LLMs (Hiring Immediately)

London, UK
algo1
Job Description Founding AI Engineer - Reinforcement Learning & LLMs (London, In-person) VC-backed generative AI startup building platform to take LLM-powered tasks to 99%+ accuracy. Early-stage, engineering-led, everyone codes. Role: Build, train and optimise LLM systems capable of beating industry benchmarks. Deploy robust generative … ability with startup experience. Curious and loves to solve hard problems with first-principles thinking. Requirements: Experience deploying LLMs into production. Experience with deep reinforcement learning. Strong ML/data science and data engineering fundamentals. Excellent Python skills. #J-18808-Ljbffr More ❯
Employment Type: Full-time
Posted:

Senior Applied Scientist, Alexa

London, United Kingdom
Amazon
intuitive, and adaptive content recommendations across all Amazon surfaces. We aim to facilitate seamless reasoning and customer experiences, surpassing the capabilities of previous machine learning models. We are looking for a passionate, talented, and resourceful Senior Applied Scientist in the field of Natural Language Processing (NLP), Large Language Model … Retrieval, to invent and build scalable solutions for a state-of-the-art context-aware personal assistant. A successful candidate will have strong machine learning background and a desire to push the envelope in one or more of the above areas. The ideal candidate would also enjoy operating in … delightful recommendations and conversations. Your work will directly impact our customers in the form of products and services that make use of various machine learning, deep learning and language model technologies. You will leverage Amazon's heterogeneous data sources and large-scale computing resources to accelerate advances in More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer: Superhuman Visual Generation

London, United Kingdom
Atmanlabs
proactive shopping assistance to personal teachers to healthcare concierges - and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Visual Foundation Models: Unlocking Human Engagement … in writing, in person, and in small groups, and are able to educate audiences at all levels on the novel applications and relevance of reinforcement learning. You can easily distinguish authentic and high integrity thinkers from 'posers', while also critically evaluating truth from fiction in your own work. Your … dampen the mood. Your intensity goes from 0 to 1000 when you become authentically interested in a topic. You not only have interests in reinforcement learning, but are deeply curious about a range of interdisciplinary topics, ranging from knowledge graphs, recommendations, web-scale search, deep learning, generative More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer: Superhuman Visual Generation (Hiring Immediately)

London, UK
Atmanlabs
proactive shopping assistance to personal teachers to healthcare concierges – and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Visual Foundation Models: Unlocking Human Engagement … in writing, in person, and in small groups, and are able to educate audiences at all levels on the novel applications and relevance of reinforcement learning. You can easily distinguish authentic and high integrity thinkers from ‘posers’, while also critically evaluating truth from fiction in your own work. Your … dampen the mood. Your intensity goes from 0 to 1000 when you become authentically interested in a topic. You not only have interests in reinforcement learning, but are deeply curious about a range of interdisciplinary topics, ranging from knowledge graphs, recommendations, web-scale search, deep learning, generative More ❯
Employment Type: Full-time
Posted:

Machine Learning/Semantic Web - FTC (6 Months) - West London

Egham, Surrey, United Kingdom
microTECH Global Limited
Machine Learning/Semantic Web - FTC (6 Months) - West London We have an opportunity available for an experienced AI/ML Engineer to join a team based in Egham. This position requires immediate start so we politely request that you only apply if you are available to start within … support. Focus on AI-driven energy management systems advancing sustainability and intelligent energy solutions for homes and businesses. Main Responsibilities: Develop and optimize machine learning models for disease prediction and early diagnosis. Process and analyze structured and unstructured health data - Implement deep learning for predictive healthcare applications. Contribute … AI-driven user personalization for visually impaired individuals. Develop AI-powered accessibility solutions. Ensure compliance with data privacy and ethical AI guidelines. Essential: Machine learning & deep learning - Proficiency in TensorFlow (2.x), PyTorch, Scikit-Learn. Strong skills in Python, experience with R or JavaScript is a plus. Experience with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

CTO - Entourage

London, United Kingdom
Moonsong Labs, Inc
insight that code serves as a universal, learnable language for agents to interact with their environment, we are developing a protocol that enables collective learning through a shared memory layer where mutually distrusting agents can exchange and validate their experiences. This protocol transforms individual code actions, trajectories and experiences … while reducing costs. This approach, combined with carefully designed token incentives and memory curation mechanisms, creates a self-reinforcing ecosystem where each agent's learning contributes to and benefits from the network's growing capability. Our approach goes beyond existing frameworks, creating the connective tissue for the emerging agent … integrations, scalability, and security. Recruiting, hiring, mentoring, and managing a world-class AI development team. Cultivating a culture of ownership, innovation, collaboration, and continuous learning essential for the fast-moving AI agent ecosystem. Engaging with developers across frameworks and tech stacks to understand their needs and incorporate feedback into More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

CTO - Entourage (Hiring Immediately)

London, UK
Web3 Foundation
insight that code serves as a universal, learnable language for agents to interact with their environment, we are developing a protocol that enables collective learning through a shared memory layer where mutually distrusting agents can exchange and validate their experiences. This protocol transforms individual code actions, trajectories and experiences … while reducing costs. This approach, combined with carefully designed token incentives and memory curation mechanisms, creates a self-reinforcing ecosystem where each agent's learning contributes to and benefits from the network's growing capability. Our approach goes beyond existing frameworks, creating the connective tissue for the emerging agent … integrations, scalability, and security. Recruiting, hiring, mentoring, and managing a world-class AI development team. Cultivating a culture of ownership, innovation, collaboration, and continuous learning essential for the fast-moving AI agent ecosystem. Engaging with developers across frameworks and tech stacks to understand their needs and incorporate feedback into More ❯
Employment Type: Full-time
Posted:

AI Engineer

London, United Kingdom
Hybrid / WFH Options
Birdie
up skill our broader engineering teams. Build and maintain AI infrastructure leveraging the latest technology approaches: Investigate cutting-edge approaches like generative AI or reinforcement learning to drive adoption and impact. Ensure seamless integration of AI into existing platform and customer-facing processes. We'd love to hear … from you if you have: At least 2 years of experience working in an AI engineering or machine learning role supporting customer-facing processes and infrastructure. Experience working in a cross functional product engineering squad with deep interest in understanding and solving end user problems. Proven expertise in building … scalable machine learning models and deploying them into production environments. Passionate about enabling others with tools, frameworks, and documentation-ensuring teams can independently leverage AI without creating bottlenecks. A technical leader who stays curious about emerging technologies (e.g., generative AI, reinforcement learning) and rapidly evaluates their pros More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Research Engineer, Applied ML

London, United Kingdom
Google Inc
years of experience in software development and with data structures/algorithms. 5 years of experience building and architecting large-scale, production quality Machine Learning (ML) systems. 5 years of experience in distributed development and large-scale data processing. Experience coding in either C++ or Python. Experience with ML … fundamentals, algorithms, and techniques, including supervised, unsupervised, and reinforcement learning, and experience in areas like natural language processing (NLP), computer vision, and generative AI. Preferred Qualifications: Experience with generative models (e.g., diffusion models, GANs, transformers) for various media formats (e.g., text, image, video, audio), including prompt engineering, fine … tuning, and evaluation techniques. Experience with RL algorithms and frameworks, including policy gradient methods, Q-learning, and actor-critic architectures. Experience building and leading high-performing research or engineering teams, fostering a positive and inclusive culture. Experience being published in ML/AI conferences or journals, demonstrating a strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Artificial Intelligence Engineer

United Kingdom
Hybrid / WFH Options
WNTD
autonomous AI agents that can reason, plan, and take actions independently in complex environments. The ideal candidate will have experience with LLM-based agents, Reinforcement Learning (RL), event-driven architectures, and multi-agent systems . You will work on next-generation AI solutions that enhance automation, decision-making … CrewAI, OpenAI Function Calling, etc.). Hands-on experience with LLMs ( GPT, LLaMA, Claude, Mistral , etc.), diffusion model or GANs Hands-on experience with Reinforcement Learning (RL), Evolutionary Algorithms, or Graph Neural Networks . Strong experience with AI/ML framework such as TensorFlow, PyTorch or JAX . More ❯
Posted:

Senior Data Scientist London, UK (hybrid)

London, United Kingdom
Hybrid / WFH Options
Rakuten Viber
with family and friends, through managing business relationships to pursuing their passions. Connecting people across the world is a complex problem with many machine-learning applications. The purpose of this role is to implement mathematical models and algorithms to solve complex business problems in recommendations and classification. Successful outcomes … globe. As a Senior Data Scientist, you will work in a highly collaborative environment with extensive amounts of data to research and develop deep learning models in the domains of dating, moderation and content segmentation and apply them to tasks such as recommendation systems and analytics at a high … management and partner teams to design and implement solutions in recommender systems for given objectives. Lead technical efforts to improve the performance of deep learning models and propose initiatives to impact company goals directly. Autonomously find solutions to complex problems in social network recommendations and understand the data generation More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist London, UK (hybrid) (Hiring Immediately)

London, UK
Hybrid / WFH Options
Rakuten Viber
with family and friends, through managing business relationships to pursuing their passions. Connecting people across the world is a complex problem with many machine-learning applications. The purpose of this role is to implement mathematical models and algorithms to solve complex business problems in recommendations and classification. Successful outcomes … globe. As a Senior Data Scientist, you will work in a highly collaborative environment with extensive amounts of data to research and develop deep learning models in the domains of dating, moderation and content segmentation and apply them to tasks such as recommendation systems and analytics at a high … management and partner teams to design and implement solutions in recommender systems for given objectives. Lead technical efforts to improve the performance of deep learning models and propose initiatives to impact company goals directly. Autonomously find solutions to complex problems in social network recommendations and understand the data generation More ❯
Employment Type: Full-time
Posted:

Senior Data Scientist (Hiring Immediately)

London, UK
Viber
with family and friends, through managing business relationships to pursuing their passions. Connecting people across the world is a complex problem with many machine-learning applications. The purpose of this role is to implement mathematical models and algorithms to solve complex business problems in recommendations and classification. Successful outcomes … globe. As a Senior Data Scientist, you will work in a highly collaborative environment with extensive amounts of data to research and develop deep learning models in the domains of dating, moderation and content segmentation and apply them to tasks such as recommendation systems and analytics at a high … management and partner teams to design and implement solutions in recommender systems for given objectives. Lead technical efforts to improve the performance of deep learning models and propose initiatives to impact company goals directly. Autonomously find solutions to complex problems in social network recommendations and understand the data generation More ❯
Employment Type: Full-time
Posted:

Senior Data Scientist - Recommendations (Hiring Immediately)

London, UK
Viber
with family and friends, through managing business relationships to pursuing their passions. Connecting people across the world is a complex problem with many machine-learning applications. The purpose of this role is to implement mathematical models and algorithms to solve complex business problems in recommendations and classification. Successful outcomes … globe. As a Senior Data Scientist, you will work in a highly collaborative environment with extensive amounts of data to research and develop deep learning models in the domains of dating, moderation and content segmentation and apply them to tasks such as recommendation systems and analytics at a high … management and partner teams to design and implement solutions in recommender systems for given objectives. Lead technical efforts to improve the performance of deep learning models and propose initiatives to impact company goals directly. Autonomously find solutions to complex problems in social network recommendations and understand the data generation More ❯
Employment Type: Full-time
Posted:

Staff Research Engineer, Applied ML

London, United Kingdom
WeAreTechWomen
years of experience in software development and with data structures/algorithms. 5 years of experience building and architecting large-scale, production quality Machine Learning (ML) systems. 5 years of experience in distributed development and large-scale data processing. Experience coding in either C++ or Python. Experience with ML … fundamentals, algorithms, and techniques, including supervised, unsupervised, and reinforcement learning, and experience in areas like natural language processing (NLP), computer vision, and generative AI. Preferred qualifications: Experience with generative models (e.g., diffusion models, GANs, transformers) for various media formats (e.g., text, image, video, audio), including prompt engineering, fine … tuning, and evaluation techniques. Experience with RL algorithms and frameworks, including policy gradient methods, Q-learning, and actor-critic architectures. Experience building and leading high-performing research or engineering teams, fostering a positive and inclusive culture. Experience being published in ML/AI conferences or journals, demonstrating a strong More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior AI Research Engineer (Hiring Immediately)

London, UK
Lawhive
complex multi-agent systems. Collaborate with engineers to transition experimental models into production-ready systems . Qualifications: Strong background in AI research, applied machine learning, and NLP . Experience with LLM model adaptation, fine-tuning, and inference optimization . Proficiency in Python, Pydantic, FastAPI, and working with LLM APIs … databases, embeddings, and structured AI retrieval . Hands-on experience with LLM-based planning, reasoning, and autonomous task execution . Familiarity with self-supervised learning, reinforcement learning, or adaptive AI techniques . Ability to translate academic AI research into practical experiments and working prototypes . Experience deploying More ❯
Employment Type: Full-time
Posted:

Specialist, Applied AI Engineer (Hiring Immediately)

London, UK
TN United Kingdom
Rapid prototyping and iteration skills, along with collaboration with customers and use case leads, are essential. You will apply techniques such as generative AI, reinforcement learning, and multi-agent systems to address real-world industrial challenges. Creativity in developing new AI use cases, rapid prototyping, and working closely … in industrial settings. Conduct testing, validation, and optimization of AI models to ensure reliability, safety, and compliance. Stay updated on advancements in generative AI, reinforcement learning, and multi-agent systems to foster innovation. Qualifications: Master's degree in Data Science. Proven thought leadership in applied Generative AI, demonstrated More ❯
Employment Type: Full-time
Posted:

Specialist, Applied AI Engineer

London, United Kingdom
Hybrid / WFH Options
Industrial and Financial Systems
and iterate AI solutions in collaboration with customers and use case leads is a must. You will be responsible for applying generative AI techniques, reinforcement learning, and multi-agent systems to solve real-world industrial challenges. This role demands a high level of creativity in developing new AI … and optimization of AI models to ensure reliability, safety, and regulatory compliance in industrial settings. Stay ahead of the latest advancements in generative AI, reinforcement learning, and multi-agent systems to drive innovation within the company. Qualifications Masters degree in Data Science. Thought leader and expert in applied More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Specialist, Applied AI Engineer (Hiring Immediately)

London, UK
Hybrid / WFH Options
IFS
and iterate AI solutions in collaboration with customers and use case leads is a must. You will be responsible for applying generative AI techniques, reinforcement learning, and multi-agent systems to solve real-world industrial challenges. This role demands a high level of creativity in developing new AI … and optimization of AI models to ensure reliability, safety, and regulatory compliance in industrial settings. Stay ahead of the latest advancements in generative AI, reinforcement learning, and multi-agent systems to drive innovation within the company. Qualifications Masters degree in Data Science. Thought leader and expert in applied More ❯
Employment Type: Full-time
Posted:

Head of Engineering (Hands-on 50%+ coding) (Hiring Immediately)

London, UK
algo1
VC-backed stealth startup revolutionising retail with cutting-edge AI. We're pioneering a platform that redefines the customer experience, leveraging the power of reinforcement learning, generative AI, and advanced recommendation systems. We're seeking a hands-on Head of Engineering with deep backend expertise and a passion … You will own the technical roadmap and drive its execution, focusing on building systems that can efficiently process and leverage large datasets for machine learning applications. Technical Leadership: Provide technical guidance and mentorship to other engineers as the team grows, fostering a culture of innovation and high-quality code. … Drive Innovation: Translate cutting-edge machine learning research into impactful products and solutions, leading the development process from concept to production. Collaborate closely with the data science team to bring AI-powered features to life. Technology Selection & Evaluation: Continuously evaluate and select the best technologies and tools to support More ❯
Employment Type: Full-time
Posted:

Research Software Engineer

Guildford, Surrey, United Kingdom
Hybrid / WFH Options
Ecm Selection
experience will also be considered) Full rights to work in the UK without limitation While not required, any additional experience with digital twins, machine learning, reinforcement learning or agentic systems within a scientific context would be beneficial. On offer is a very competitive salary and attractive benefits More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer / Research Scientist, Multimodal (Hiring Immediately)

London, UK
Hybrid / WFH Options
Anthropic
multimodal data and study how they interact with text-only models at scale. Building Infrastructure We work on many infrastructure projects including: Complex multimodal reinforcement learning environments. High-performance RPC servers for processing image inputs. Sandboxing infrastructure for securely collecting data. Data Ingestion We are more interested in … Pick up slack, even if it goes outside your job description Enjoy pair programming (we love to pair!) Want to learn more about machine learning research Care about the societal impacts of your work Strong candidates may also have experience with: High performance, large-scale ML systems GPUs, Kubernetes … Pytorch, or OS internals Language modeling with transformers Reinforcement learning Large-scale ETL The expected salary range for this position is: Annual Salary: £250,000-£270,000 GBP Logistics Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience. Location-based More ❯
Employment Type: Full-time
Posted:

Process Improvement Specialist - 24 Month FTE, EU CF ACES PSE

London, United Kingdom
ENGINEERINGUK
Fulfillment Technology) Software, FC Launch, Supply Chain, and EU fulfillment center leaders Create standards for your process and training material in collaboration with the Learning & Development team, to secure high level of knowledge within the FC operations Collaborate with the extended EU ACES teams to define process standards and … process excellence and continuous improvement culture in a large organization Strong Process/benchmark auditing and non-compliance management experience Experience with modern machine learning: deep learning, online/reinforcement learning, semi-supervised and transfer learning Experience working with technology and software teams to align More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Reinforcement Learning
10th Percentile
£76,727
25th Percentile
£90,801
Median
£150,000
75th Percentile
£175,000