Permanent Reinforcement Learning Jobs in London

22 of 22 Permanent Reinforcement Learning Jobs in London

Power Platform - London, UK

London, United Kingdom
Hybrid / WFH Options
Randstad Technologies Recruitment
experience in Investment Banking environment would be a plus Spanish would be a plus Mandatory Skills : Python, ServiceNow Orchestrator, Azure Cognitive Services, GenAI - LLMOps, RPA - Microsoft Power Automate, Machine Learning - AIOPS, Deep Learning - AIOPS, Reinforcement Learning - AIOPS Randstad Technologies Ltd is a leading specialist recruitment business for the IT & Engineering industries. Please note that due to More ❯
Employment Type: Permanent
Salary: £60000 - £65000/annum
Posted:

Power Platform - London, UK

London, South East, England, United Kingdom
Hybrid / WFH Options
Randstad Technologies
experience in Investment Banking environment would be a plus Spanish would be a plus Mandatory Skills : Python, ServiceNow Orchestrator, Azure Cognitive Services, GenAI - LLMOps, RPA - Microsoft Power Automate, Machine Learning - AIOPS, Deep Learning - AIOPS, Reinforcement Learning - AIOPS Randstad Technologies Ltd is a leading specialist recruitment business for the IT & Engineering industries. Please note that due to More ❯
Employment Type: Full-Time
Salary: £60,000 - £65,000 per annum
Posted:

Machine Learning and AI Engineer

London, United Kingdom
Nature Metrics
cohort members, we have the potential and the opportunity to change the way organisationsoperate. As we continue to grow,we'relooking for a highly motivated and skilled Machine Learning (ML) and Artificial Intelligence (AI) Engineer to contribute to the development and deployment of our advanced ML and AI solutions. In this hands-on role,you'llbe involved in building … recordof successfully contributing to the design, building and deployment of ML models into production environments. Expertisein Python and relevant ML libraries Experience with various ML algorithms (e.g. supervised, unsupervised, reinforcement learning) and deep learning frameworks (e.g. TensorFlow,PyTorch). Experience developing generative AI applications and deploying them into production. Experience working with cloud platforms, ideally Google Cloud More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior ML Engineer, GenAIID

London, United Kingdom
Amazon
Data Parallel (FSDP) and DeepSpeed, ensuring scalability and efficiency • LLM Customization & Fine-Tuning: Adapt LLMs for new languages, domains, and vision applications through continued pre-training, fine-tuning, and Reinforcement Learning with Human Feedback (RLHF) • Model Optimization on AWS Silicon: Optimize AI models for deployment on AWS Inferentia and Trainium, leveraging the AWS Neuron SDK and developing custom … our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We're continuously raising … we feel supported in the workplace and at home, there's nothing we can't achieve in the cloud. BASIC QUALIFICATIONS • 5+ years of professional software development and machine learning experience • Proficiency in at least one programming language • Experience mentoring engineers, leading technical initiatives, or managing an engineering team • Hands-on experience with deep learning and machine learning More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Machine Learning Scientist

London, United Kingdom
Hybrid / WFH Options
Intercom
service. Driven by our core values, we push boundaries, build with speed and intensity, and consistently deliver incredible value to our customers. What's the opportunity? Intercom's Machine Learning team is responsible for defining new ML features, researching appropriate algorithms and technologies, and rapidly getting first prototypes in our customers' hands. We are an extremely product focussed team. … dedicated ML product engineers enable us to move to production fast, often shipping to beta in weeks after a successful offline test. We are very passionate about applying machine learning technology, and have productized everything from classic supervised models, to cutting-edge unsupervised clustering algorithms, to novel applications of transformer neural networks. We test and measure the real customer … field (e.g. MSc) Scientific thinking skills Track record shipping ML products PhD or other experience in a research environment Deep experience in an applicable ML area - E.g. NLP, Deep learning, Bayesian methods, Reinforcement learning, clustering Strong stats or math background Benefits We are a well treated bunch, with awesome benefits! If there's something important to you More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Engineer II, ROW AOP

London, United Kingdom
Amazon
world. As a Data Engineer, you will play a crucial role in supporting the team by creating and maintaining the data infrastructure necessary for the advanced analytics and machine learning solutions. Our team solves a broad range of problems that can be scaled across ROW (Rest of the World including countries like India, Australia, Singapore, MENA and LATAM). … Last Mile Channel Allocation • Using LLMs to automate analytical processes and insight generation • Ops research to optimize middle mile truck routes • Working with global partner science teams to affect Reinforcement Learning based pricing models and estimating Shipments Per Route for $MM savings • Deep Learning models to synthesize attributes of addresses • Abuse detection models to reduce network losses … attention to detail. - Excellent communication skills and ability to work in a collaborative team environment. PREFERRED QUALIFICATIONS - Experience working in a scientific or research-oriented environment. - Familiarity with machine learning workflows and model deployment. - Experience with Infrastructure as Code (IaC) by tools such as CDK. - Experience with streaming data processing and real-time analytics. - Experience with big data technologies More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Scientist - Fixed Term Contract

London, United Kingdom
Hybrid / WFH Options
Faculty
roundtables, or by contributing to large-scale open-source projects. You will also have the opportunity to teach on the fellowship about topics that range from basic statistics to reinforcement learning, and to mentor the fellows through their 6-week project. Thanks to Faculty platform, you will have access to powerful computational resources, and you will enjoy the … become a fluent Python programmer in a short timeframe An excellent command of the basic libraries for data science (e.g. NumPy, Pandas, Scikit-Learn) and familiarity with a deep-learning framework (e.g. TensorFlow, PyTorch, Caffe) A high level of mathematical competence and proficiency in statistics A solid grasp of essentially all of the standard data science techniques, for example … supervised/unsupervised machine learning, model cross validation, Bayesian inference, time-series analysis, simple NLP, effective SQL database querying, or using/writing simple APIs for models. We regard the ability to develop new algorithms when an innovative solution is needed as a fundamental skill An appreciation for the scientific method as applied to the commercial world; a talent More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist - Fixed Term Contract

London, United Kingdom
Hybrid / WFH Options
Faculty
roundtables, or by contributing to large-scale open-source projects. You will also have the opportunity to teach on the fellowship about topics that range from basic statistics to reinforcement learning, and to mentor the fellows through their 6-week project. Thanks to Faculty platform, you will have access to powerful computational resources, and you will enjoy the … become a fluent Python programmer in a short timeframe An excellent command of the basic libraries for data science (e.g. NumPy, Pandas, Scikit-Learn) and familiarity with a deep-learning framework (e.g. TensorFlow, PyTorch, Caffe) A high level of mathematical competence and proficiency in statistics A solid grasp of essentially all of the standard data science techniques, for example … supervised/unsupervised machine learning, model cross validation, Bayesian inference, time-series analysis, simple NLP, effective SQL database querying, or using/writing simple APIs for models. We regard the ability to develop new algorithms when an innovative solution is needed as a fundamental skill A leadership mindset focussed on growing the technical capabilities of the team; a caring More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

West London, London, United Kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Employment Type: Permanent
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

London, United Kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

london, south east england, united kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

west london, south east england, united kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Posted:

Applied Scientist, Generative AI Innovation Center

London, United Kingdom
Amazon
Applied Scientist, Generative AI Innovation Center Job ID: Amazon Web Services Singapore Private Limited Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply Generative AI algorithms to solve real world problems with significant impact? The Generative AI Innovation Center helps AWS customers implement Generative AI solutions and realize transformational … our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We're continuously raising our …/ML/NLP conferences or journals PREFERRED QUALIFICATIONS 2+ years demonstrated experience with Large Language Model (LLM) and Foundational Model post-training, continual pre-training, fine-tuning, or reinforcement learning techniques. Demonstrated experience with building LLM-powered agentic workflow, orchestration, and agent customization Track record of building and deploying ML models at scale Experience with model optimization More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Applied Scientist, Generative AI Innovation Center

London, United Kingdom
Amazon
Senior Applied Scientist, Generative AI Innovation Center Job ID: Amazon Web Services Singapore Private Limited Overview Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply Generative AI algorithms to solve real world problems with significant impact? The Generative AI Innovation Center helps AWS customers implement Generative AI solutions and … data mining, parallel and distributed computing, high- performance computing 2+ years demonstrated experience with Large Language Model (LLM) and Foundational Model post-training, continual pre-training, fine-tuning, or reinforcement learning techniques. Scientific publication track record at top-tier AI/ML/NLP conferences or journals PREFERRED QUALIFICATIONS Demonstrated experience with building LLM-powered agentic workflow, orchestration More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist

London, United Kingdom
Hybrid / WFH Options
ECM Selection (Holdings) Limited
experimental, and it is understood that not all projects succeed, even failed projects contain valuable insights. You will be building upon cutting-edge ML techniques such as transformers and reinforcement learning to create novel multi-modal solutions. Examples include sensor fusion systems, physics-informed neural networks for simulations, and multi-purpose autonomous robots. Projects will be defence focused … surrounding area. Initially this is an 18-month contract with the expectation of extending this as more funding is released. Keywords: AI, ML, RF, EM, GNN, Transformer, Autoencoder, Reinforced Learning, Multi-Modal AI, Sensor Fusion, Python, PyTorch, Radio Frequency, RF Another top job from ECM, the high-tech recruitment experts. Even if this job's not quite right, do More ❯
Employment Type: Permanent
Salary: £50000 - £60000/annum DoE + Benefits
Posted:

Global Head of Data Science

London Area, United Kingdom
Harnham
We are working with a global lottery/gambling provider on a 'Group Head of Data Science' role - you’ll be responsible for building and scaling their global Machine Learning and AI function that drives commercial performance, enhances player engagement, and enables smarter, real-time decisioning across lottery, iGaming, and adjacent ventures. You’ll set the strategic vision, while … automated testing, monitoring, feature stores. Partner with Marketing, Digital, Technology, Innovation and local business units to embed intelligence into customer and operational workflows. Champion advanced analytics techniques such as reinforcement learning, causal inference, and generative AI. Build, mentor, and retain a diverse team of data science talent, fostering a culture of experimentation and excellence. Track and report on More ❯
Posted:

Global Head of Data Science

City of London, London, United Kingdom
Harnham
We are working with a global lottery/gambling provider on a 'Group Head of Data Science' role - you’ll be responsible for building and scaling their global Machine Learning and AI function that drives commercial performance, enhances player engagement, and enables smarter, real-time decisioning across lottery, iGaming, and adjacent ventures. You’ll set the strategic vision, while … automated testing, monitoring, feature stores. Partner with Marketing, Digital, Technology, Innovation and local business units to embed intelligence into customer and operational workflows. Champion advanced analytics techniques such as reinforcement learning, causal inference, and generative AI. Build, mentor, and retain a diverse team of data science talent, fostering a culture of experimentation and excellence. Track and report on More ❯
Posted:

Systems Optimisation Engineer

London, United Kingdom
Advanced Technical Recruitment
in software development for test automation (Python, C++, or C#). Experience with metaheuristic optimisation (e.g., GA, simulated annealing, particle swarm). Experience with AI/ML techniques (e.g., reinforcement learning, predictive modelling) for test optimisation. You should have a collaborative mindset to work closely with hardware engineers and manufacturing teams. A familiarity with production test time optimisation More ❯
Employment Type: Permanent
Salary: £60000 - £120000/annum Competitive £plus generous benefit
Posted:

AI Architect

City of London, London, United Kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain, AutoGen, CrewAI, or similar Experience supporting More ❯
Posted:

AI Architect

London Area, United Kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain, AutoGen, CrewAI, or similar Experience supporting More ❯
Posted:

LLM Researcher

London, South East, England, United Kingdom
Hybrid / WFH Options
MicroTECH Global Ltd
and regulatory requirements in fintech (SOC2, PCI-DSS, GDPR). Ability to thrive in a fast-moving startup environment. Desirables: Background in fintech, payments, or treasury systems. Experience with reinforcement learning with human feedback (RLHF). More ❯
Employment Type: Full-Time
Salary: Salary negotiable
Posted:

Customer Success Manager

London, United Kingdom
Anima
you'll help us extend the 3 existing product lines we have, that millions of patients use, and build out new ones at the very cutting edge of healthcare reinforcement learning and agentic AI. Your work will save countless lives. Do you want to save lives with every clinic you help to deploy Anima? Use your influence and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Reinforcement Learning
London
10th Percentile
£86,675
25th Percentile
£92,500
Median
£122,500
75th Percentile
£175,000