Reinforcement Learning Jobs in the City of London

21 of 21 Reinforcement Learning Jobs in the City of London

Generative AI Architect (Open AI) Professional Services

City of London, London, United Kingdom
HCLTech
grade workloads , focusing on efficiency, adaptability, and security . Algorithm & Model Development Develop and optimize AI algorithms for predictive analytics, recommendation engines, and automation in professional services. Apply machine learning (ML) and deep learning (DL) techniques to enhance decision-making and business intelligence. Data Engineering & Integration Architect robust data pipelines to support AI model training and inference. Implement … data inputs. Collaborate with data engineering teams to align AI solutions with real-world business applications. AI Model Training, Evaluation & Performance Optimization Train AI models using supervised, unsupervised, and reinforcement learning techniques , refining them based on business insights . Define model performance metrics to ensure AI solutions deliver tangible value. Real-Time AI Applications & System Integration Work alongside … and collaboration skills in cross-functional teams . Preferred Skills High-Performance Computing (HPC) and AI workloads for large-scale enterprise solutions. NVIDIA CUDA, cuDNN, TensorRT experience for deep learning acceleration. Big Data platforms (Hadoop, Spark) for AI-driven analytics in professional services. Pls share CV at payal.c@hcltech.com More ❯
Posted:

Experienced CRM Data Scientist

City of London, London, United Kingdom
Hybrid / WFH Options
EveryMatrix
Develop predictive models to forecast customer activities and trends. • Collaborate with cross-functional teams to implement data-driven strategies. • Define and track key performance indicators (KPIs). • Utilize machine learning techniques to optimize CRM strategies, especially related to automated campaign optimization. • Communicate findings and recommendations to stakeholders in a clear and concise manner. Requirements: • Master's or PhD degree …/B tests. • Familiarity with Bayesian statistics, especially for hypothesis testing. Extra points for good knowledge of Bayesian programming (PyMC3, etc). • Experience with application of optimization theory or reinforcement learning based on automated A/B testing. Projects you might expect to work on: • Predicting future expected value and future potential value of individual customers. • Real-time … work from home per quarter (during your first 3 months, you'll be in the office full-time). Benefits & Perks: • Monthly meal allowance. • Gym allowance. • Access online learning platforms like Udemy for Business and LinkedIn Learning, and a budget for external training. • Support for New Parents: - 52 weeks of maternity leave, with 21 weeks being fully paid. More ❯
Posted:

Senior Data Scientist

City of London, London, United Kingdom
algo1
About Us We are a VC-backed startup focused on behavioural AI, currently in stealth. We are building for retail with a focus on Recommendation Systems, Reinforcement Learning and Generative AI. Our platform is designed from the ground up — no legacy, no patchwork systems — just a clean slate and a clear vision. Our mission is to build products … customers love by harnessing cutting-edge AI to transform their shopping experience. About the Job We are looking for a Senior Data Scientist with experience in bringing advanced machine learning and data science systems to production to work with our team of industry leading domain experts and engineers. You'll be working across our entire data science stack, from … advanced recommender systems to comprehensive performance analytics. Key Responsibilities: Design and implement scalable machine learning for complex data analysis, optimised recommendations, and predictive modelling. Translate the latest advances in machine learning into impactful solutions and products, from rapid MVPs to fully deployed, production-ready systems. Bring your models to production and optimise for inference in edge computing environments. More ❯
Posted:

Senior Research Engineer

City of London, London, United Kingdom
algo1
About Us We are a VC-backed startup focused on behavioural AI, currently in stealth. We are building for retail with a focus on Recommendation Systems, Reinforcement Learning and Generative AI. Our platform is designed from the ground up — no legacy, no patchwork systems — just a clean slate and a clear vision. Our mission is to build products … that customers love by harnessing cutting-edge AI to transform their shopping experience. About the Job We are looking for a Senior Research Engineer with experience in advanced machine learning systems to work with our team of industry leading domain experts and engineers to build foundation models for retail shopping. Key Responsibilities: Translate latest ML advances into impactful solutions … validate their effectiveness, and deploy them to production systems. Innovate upon state-of-the-art ML research applied to consumer personalisation. Optimise machine learning models for performance in cloud environments (e.g., distributed clusters, GPUs). Essential Qualifications: 3-5+ years implementing applied machine learning research in a production setting MSc in Computer Science, Machine Learning, or More ❯
Posted:

AI Research Residency

City of London, London, United Kingdom
Hybrid / WFH Options
MediaTek
pursuit. Program Highlights Mentorship: Collaborate with experienced researchers and engineers on innovative AI projects Research Focus: Engage in areas such as large language models (LLMs), optimization methods for deep learning, reinforcement learning (RL), and generative models Professional Development: Gain hands-on experience in AI research with real-world applications, contributing to both academic publications and product development … leading universities and contribute to the broader scientific community Work Arrangement: Benefit from a hybrid work model, combining remote and on-site collaboration Responsibilities Conduct innovative research in machine learning and artificial intelligence Develop and implement algorithms Collaborate with cross-functional teams to integrate AI solutions Publish research findings in top-tier conferences and journals Stay up to date … with the latest advancements in AI and related fields Qualifications Required: PhD in Machine Learning, Artificial Intelligence, Mathematics, Computer Science, Physics, or a related field Proficiency in programming languages such as Python, C++, or similar Strong problem-solving skills and the ability to work both independently and collaboratively Excellent communication skills, with the ability to present complex ideas clearly More ❯
Posted:

Quant Developer

City Of London, England, United Kingdom
Hybrid / WFH Options
Glocomms
tooling, and models that will drive alpha generation and risk management for years to come. As a Quant Developer, you will work at the intersection of quantitative research, machine learning, and software engineering. You'll collaborate with quants, data scientists, and portfolio managers to design and implement scalable systems for data ingestion, model training, and real-time signal deployment. … to architectural decisions and technology stack selection for the greenfield platform. Required Skills & Experience Strong programming skills in Python, with experience in production-grade systems. Solid understanding of machine learning workflows, including model training, validation, and deployment. Experience with quantitative finance, including time series analysis, alpha modelling, or risk analytics. Familiarity with cloud infrastructure (e.g., AWS, GCP) and containerisation … engineering tools Experience working in fast-paced, collaborative environments with agile methodologies. Nice to Have Prior experience in a hedge fund, prop trading firm, or investment bank. Exposure to reinforcement learning, deep learning, or LLMs in financial contexts. Knowledge of market microstructure and execution algorithms. Contract Details Duration: 12 months Start Date: July 2025 Location: London (Hybrid More ❯
Posted:

Machine Learning Engineer (PhD)

City of London, London, United Kingdom
Hybrid / WFH Options
microTECH Global LTD
leader in advanced computing, with a dedicated research team focused on applying artificial intelligence to next-generation semiconductor design and optimization. Role Overview: We're seeking a motivated Machine Learning Researcher with a strong background in machine learning, AI, or related fields. You’ll contribute to innovative projects in areas such as large language models (LLMs), reinforcement learning, and optimization for chip design and AI system integration. Responsibilities: Conduct and publish cutting-edge AI/ML research Design algorithms for chip optimization and intelligent systems Collaborate with engineering teams to integrate AI into real-world tools Stay current on AI trends and contribute to open research Requirements: PhD or equivalent experience in ML, AI, CS … physics, or mathematics Strong publication record (NeurIPS, ICML, ICLR, etc.) Proficient in Python, C++, and deep learning frameworks (e.g., PyTorch, TensorFlow) Solid grasp of ML techniques; independent and team-oriented mindse Preferred: Experience in LLMs, reinforcement learning, or chip design Familiarity with JAX and optimization frameworks Why Join Us: Work on impactful AI research with real-world More ❯
Posted:

Machine Learning Researcher

City of London, London, United Kingdom
Adamas Knight
the Role We’re looking for smart and curious individuals from industry and academia to join our client's growing AI Lab and push the boundaries of applied deep learning in trading. On their AI team, you’ll build and train deep learning models that directly power their trading strategies, supported by a massive and rapidly expanding compute … next, you’re analysing how a model trades in production or designing a new architecture to capture subtle market signals. They will rely on your deep knowledge of deep learning, whether your background is in LLMs, recsys, image models, RL agents, or classical methods, to help shape the next generation of their ML-driven trading. You’ll also contribute … and conference travel. Who We’re Looking For We’re open to a range of backgrounds and experiences, but the ideal candidate will have: An advanced degree in machine learning, statistics, applied math, or a related discipline; or equivalent experience in industry applying ML to challenging problems Expertise in one or more of: deep learning, reinforcement learning More ❯
Posted:

Machine Learning Engineer – Computer Vision Focus

City of London, London, United Kingdom
Hybrid / WFH Options
Brio Digital
Job Title: Machine Learning Engineer – Computer Vision Focus Overview: We’re looking for a skilled Machine Learning Engineer to join a growing technology company building cutting-edge solutions for real-world automation. You’ll be part of a small, collaborative team applying computer vision to improve performance, efficiency, and user experience across multiple sectors. This role offers the … chance to work on high-impact machine learning problems, shape production-ready models, and contribute to the development of a platform that’s democratising access to AI-driven automation. Key Responsibilities: Model Development: Design, train, and deploy machine learning models for computer vision use cases such as object detection, classification, and segmentation. Data Handling: Collaborate with data engineers … Research & Innovation: Keep current with advances in ML and CV, and apply new methods to solve business problems. Your Profile: Master’s degree (or equivalent) in Computer Science, Machine Learning, or a related field. 3+ years of experience deploying ML models in production. Proficient in Python and ML frameworks (e.g., TensorFlow, PyTorch). Experience working with cloud platforms and More ❯
Posted:

Full Stack Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Higher - AI recruitment
Open AI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, Generative AI, and reinforcement learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Full Stack Engineers who are passionate about using technology to solve complex, real-world More ❯
Posted:

Robotics Control Engineer - Reinforcement Learning

City of London, London, United Kingdom
Humanoid
robot HMND 01 is a next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. We’re seeking a highly skilled Senior Reinforcement Learning (RL) Control Engineer to develop locomotion and whole body control skills for our humanoid robots. You’ll be at the cutting edge of robotics, responsible for developing … and adaptability. This role focuses on designing robust controllers for walking, balancing while manipulating, fall recovery, and other advanced mobility tasks. We’re seeking candidates with deep expertise in reinforcement learning and a strong track record of deploying control systems on physical robots. Our Mission: At Humanoid we strive to create the world’s leading, commercially scalable, safe … conditions. Collaborate closely with software and perception teams to integrate control strategies into the full-stack robotic system. Required Qualifications : Master’s or PhD in Robotics, Control Systems, Machine Learning, or a related field. At least 3+ years of experience in the design and implementation of control systems for legged robots, focusing on locomotion. Strong expertise in reinforcement More ❯
Posted:

Machine Learning Engineer

City of London, London, Finsbury Square, United Kingdom
The Portfolio Group
An exceptional opportunity for a Machine Learning Engineer (with Full-Stack experience) to join an innovative market leader at the forefront of developing next-generation solutions that transform digital interactions. The role will focus on projects to leverage state-of-the-art generative AI, retrieval-augmented generation (RAG), and reasoning frameworks to build intelligent and context-aware systems. We … are seeking talented Machine Learning Engineers with full-stack software development experience to join our client's team and help shape the future of AI-powered automation. Within this dynamic role varied duties will include: Search relevancy engineering. Conversational AI Development : Design, train, fine-tune, and deploy LLMs with reasoning capabilities. Retrieval-Augmented Generation (RAG): Implement, optimise, and scale … RAG pipelines for effective information retrieval from structured and unstructured sources. Model Fine-Tuning & Training : Train domain-specific models using techniques like LoRA, QLoRA, PEFT, reinforcement learning, and supervised fine-tuning (SFT). Model Deployment & Inferencing : Optimise model serving and inference using vLLM, DeepSpeed, TensorRT, Triton, and other acceleration frameworks. Multi-Agent Systems : Develop and integrate agentic capabilities More ❯
Employment Type: Permanent
Posted:

AI Research Scientist

City of London, London, United Kingdom
Hybrid / WFH Options
Anson Mccade
highly motivated Research Scientist to lead work in knowledge and semantics. This role offers the opportunity to lead a specialist R&D team working at the intersection of machine learning, NLP, and graph technologies, contributing to real-world challenges across defence, security, and commercial sectors. Key Responsibilities: Lead a team of researchers and engineers across projects involving NLP, knowledge … graphs, and LLM applications Design and deliver research programmes, collaborating with universities and innovation partners Develop prototypes, proof of concepts, and technical solutions in graph machine learning and semantic technologies Secure funding through successful proposal writing and manage stakeholder engagement Contribute to multi-disciplinary teams including work in reinforcement learning, computer vision, RF & EW, and sonar Essential More ❯
Employment Type: Permanent, Work From Home
Posted:

Research Scientist

City of London, London, United Kingdom
Adamas Knight
team. This lab is composed of researchers from elite institutions and industry labs including OpenAI, Google DeepMind, and Microsoft AI, and is focused on advancing the SOTA in LLMs, reinforcement learning, and deep learning for complex systems. The team is building the models with the goal of powering the next generation of AI supercomputing systems. The Role … As a Research Scientist, you’ll be at the heart of groundbreaking research, working on core problems in deep learning, generative modelling, and RL. With larger compute resources per capita than any other tech company, you’ll also have the opportunity to conduct and publish your research in top-tier conferences, such as NeurIPS, ICML, and ICLR, etc. and … collaborate with world-renowned researchers across multiple DL disciplines. You might be a fit if you have... PhD in Machine Learning, Computer Science, Mathematics or related field; Strong track-record in research in Big Tech; Deep technical expertise in training, fine-tuning, or scaling deep learning models; Experience developing models in language modelling, reinforcement learning, or More ❯
Posted:

Head of Applied AI - Robotics

City of London, London, United Kingdom
Hybrid / WFH Options
Acquired Talent Ltd
opening up possibilities to build far richer, more capable intelligent behaviours. This is a unique opportunity to help build a dedicated AI function focused on embedding cutting-edge multimodal learning models into next-gen robotic platforms. The role will involve defining the long-term roadmap for how intelligence is deployed across the stack, from perception through to action—combining … unified, responsive behaviours. To be successful in this role, you’ll need to bring: Strong technical leadership across applied AI/ML, with deep hands-on experience in robotic learning or embodied intelligence A solid background in multimodal model development—especially in areas that combine computer vision, language understanding, and interactive learning (LLM, VLM or VLA) Real-world … deployment experience of learning-based systems, ideally within robotic or physical environments (embodied systems (AI) & reinforcement learning) Comfort collaborating across functions, especially with engineering, hardware, and system design teams Solid programming and prototyping skills using modern deep learning frameworks (e.g. PyTorch, TensorFlow, JAX) Location: London (hybrid, with 4 days onsite and flexible hours) This is a More ❯
Posted:

AI Solution Architect (Agentic & Autonomous Systems)

City of London, London, United Kingdom
Hybrid / WFH Options
Staffworx
pipelines Knowledge of AI chatbot frameworks (Rasa, BotPress, Dialogflow) or custom LLM-based UIs Awareness of AI governance , model auditing, and data privacy regulation (GDPR, DPA, etc.) Familiarity with reinforcement learning, retrieval-augmented generation (RAG), or dynamic reasoning workflows Offer Details 💼 Role Type: Contract (outside IR35) or Permanent 💸 Compensation: Market rates or six-figure salary 📍 Location: London EC4 More ❯
Posted:

Robotics & Machine Learning Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Opus Recruitment Solutions
Machine Learning Engineer – Robotic Manipulation | Cutting-Edge Space Tech | Equity Opportunity 🌌 Location: Hybrid (UK-based) Salary: Competitive + Equity Join a stealth-mode innovator at the forefront of autonomous robotics, building next-gen systems that operate in the most extreme environments—space. This is your chance to shape the future of orbital infrastructure and space domain defence. We're … on the hunt for a Machine Learning Engineer with a passion for robotic manipulation. You’ll be the mastermind behind intelligent grasping systems—designing, training, and deploying ML models that enable robots to interact with unknown objects in unpredictable environments. This isn’t just another ML role. It’s a chance to take your work from simulation to spaceflight. … across robotics, vision, hardware, and software teams to bring your models to life. Own performance benchmarking and real-world validation. 🧠 What You Bring 5+ years in ML, Robotics, or Reinforcement Learning (or equivalent hands-on experience). Deep understanding of robot manipulation—both classical and ML-based approaches. Proficiency in Python & C++, and experience with ML frameworks like More ❯
Posted:

Senior AI Engineer

City of London, London, United Kingdom
Nume
tool-using agents (MCP, API integration, data processing) Methods for agent monitoring, logging, and performance analysis Experience with model fine-tuning and evaluation for domain-specific applications Background in reinforcement learning or agent training methodologies Why you? You dream big You want to be part of a highly skilled and passionate team striving to create an amazing product More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

City of London, London, United Kingdom
Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
Posted:

Head of Product

City of London, London, United Kingdom
algo1
by a top-tier VC, building products that will redefine the retail experience. Currently in stealth mode, we’re blending cutting-edge advances in behavioural science, Generative AI, and reinforcement learning to create systems that anticipate and serve customer needs in radically new ways. We’re looking for a Head of Product to support our next phase of More ❯
Posted:

Senior Software Engineer

City of London, London, United Kingdom
Humanoid
directly with robots in both simulation and real hardware environments. You will be part of a focused team responsible for the application level software that connects control, navigation, perception, learning, and platform systems. Your work will ensure that these components operate as a coherent and reliable system that users can interact with seamlessly. This role involves close collaboration with … practical, cross disciplinary challenges. What You Will Do You will develop and maintain application level software for humanoid robots You will integrate software components from controls, navigation, computer vision, reinforcement learning, and platform teams You will contribute to the structure and evolution of the application architecture and its interfaces You will work closely with the product and engineering … closely related field You are highly proficient in C++ and have experience delivering production grade software You have a solid understanding of robotic subsystems including control, perception, navigation, and learning You are familiar with ROS or ROS2 or equivalent middleware platforms You are comfortable reading, understanding, and integrating code from a range of other engineering teams You have experience More ❯
Posted:
Reinforcement Learning
the City of London
25th Percentile
£125,000
Median
£150,000
75th Percentile
£175,000