Remote Reinforcement Learning Jobs in London

25 of 25 Remote Reinforcement Learning Jobs in London

Founding Engineer

London Area, United Kingdom
Hybrid / WFH Options
Explore Group
Founding Software Engineer (React and Python) About the Opportunity You will be joining an elite founding engineering team that includes the world’s number one authority in Multi Agent Reinforcement Learning, seasoned Meta Product leaders, and one of the only people in the UK who has trained a trillion token language model from scratch. This role is based … a pivotal role in shaping and scaling the technology platform from the ground up. You will work alongside leading researchers and product strategists at the intersection of cutting edge reinforcement learning and large scale foundational models. You will enjoy a rare opportunity to influence direction, architecture and go to market execution, with both immediate impact and long term … product strategists to refine MVPs, prototype innovative workflows and accelerate iterations to production • Scale and optimise infrastructure to support high throughput training, inference and deployment of both multi agent reinforcement learning systems and enormous language models • Implement robust MLOps best practices including data pipelines, model versioning, monitoring, automated deployment and continuous integration • Drive end to end ownership from More ❯
Posted:

Founding Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Explore Group
Founding Software Engineer (React and Python) About the Opportunity You will be joining an elite founding engineering team that includes the world’s number one authority in Multi Agent Reinforcement Learning, seasoned Meta Product leaders, and one of the only people in the UK who has trained a trillion token language model from scratch. This role is based … a pivotal role in shaping and scaling the technology platform from the ground up. You will work alongside leading researchers and product strategists at the intersection of cutting edge reinforcement learning and large scale foundational models. You will enjoy a rare opportunity to influence direction, architecture and go to market execution, with both immediate impact and long term … product strategists to refine MVPs, prototype innovative workflows and accelerate iterations to production • Scale and optimise infrastructure to support high throughput training, inference and deployment of both multi agent reinforcement learning systems and enormous language models • Implement robust MLOps best practices including data pipelines, model versioning, monitoring, automated deployment and continuous integration • Drive end to end ownership from More ❯
Posted:

AI Research Internship - Students Pursing PhD

London, England, United Kingdom
Hybrid / WFH Options
MediaTek
the other dedicated to fundamental research that supports both our applications and the broader scientific community. Current areas of interest include large language models (LLMs), optimization methods for deep learning, reinforcement learning (RL), and generative models. Responsibilities:- Contribute to ongoing research in machine learning and artificial intelligence Help develop and implement algorithms Collaborate with researchers and … preparing publications and technical reports Stay up to date with the latest advancements in AI and related fields Requirement Qualifications Required: Currently enrolled in a PhD program in Machine Learning, Artificial Intelligence, Mathematics, Computer Science, Physics, or a related field Strong interest in research and a background in machine learning or a related area Experience with programming languages … or similar Strong problem-solving skills and ability to work independently and collaboratively Good communication skills and ability to present complex ideas clearly Nice-to-have Experience in optimization, reinforcement learning, and/or large language models (LLMs) Familiarity with deep learning frameworks (e.g., TensorFlow, PyTorch, JAX) Previous research publications or submitted papers Why Join Us Gain More ❯
Posted:

Artificial Intelligence Researcher

City of London, London, United Kingdom
Hybrid / WFH Options
microTECH Global LTD
or London, UK This is a permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models with human preferences, ensuring … LLMs, and generative modelling, helping us build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. Design experiments to align … and human feedback teams to build scalable alignment pipelines. Publish findings in top-tier AI conferences and contribute to open-source frameworks. Key Requirements: PhD in Computer Science, Machine Learning, or related field. Publications at NeurIPS, ICML, ICLR, ACL, or related venues. Deep expertise in Reinforcement Learning (policy optimisation, reward modelling, RLHF). Hands-on experience training More ❯
Posted:

Artificial Intelligence Researcher

London Area, United Kingdom
Hybrid / WFH Options
microTECH Global LTD
or London, UK This is a permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models with human preferences, ensuring … LLMs, and generative modelling, helping us build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. Design experiments to align … and human feedback teams to build scalable alignment pipelines. Publish findings in top-tier AI conferences and contribute to open-source frameworks. Key Requirements: PhD in Computer Science, Machine Learning, or related field. Publications at NeurIPS, ICML, ICLR, ACL, or related venues. Deep expertise in Reinforcement Learning (policy optimisation, reward modelling, RLHF). Hands-on experience training More ❯
Posted:

Machine Learning Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Experis UK
Job Title: Machine Learning Engineer Location: London, UK (Hybrid – 2–3 days onsite per week) Contract Type: Contract Duration: 6–12 months (possibility of extension) Start Date: ASAP Overview We are seeking an experienced Machine Learning Engineer to join our data science and AI engineering team on a contract basis in London. The ideal candidate will be responsible … for designing, developing, and deploying machine learning models and scalable data pipelines that support advanced analytics and intelligent automation initiatives. This role offers a hybrid work arrangement , combining flexibility with collaboration, and is ideal for a contractor who thrives in fast-paced, data-driven environments. Key Responsibilities Design, build, and deploy machine learning models and AI-driven solutions … environments. Stay current with emerging trends in AI/ML technologies and contribute to innovation within the organisation. Required Skills & Experience Proven experience (3–5+ years) as a Machine Learning Engineer , Data Scientist , or similar role. Strong programming skills in Python (experience with libraries such as TensorFlow, PyTorch, scikit-learn, pandas, NumPy). Solid understanding of machine learning More ❯
Posted:

Machine Learning Engineer

London Area, United Kingdom
Hybrid / WFH Options
Experis UK
Job Title: Machine Learning Engineer Location: London, UK (Hybrid – 2–3 days onsite per week) Contract Type: Contract Duration: 6–12 months (possibility of extension) Start Date: ASAP Overview We are seeking an experienced Machine Learning Engineer to join our data science and AI engineering team on a contract basis in London. The ideal candidate will be responsible … for designing, developing, and deploying machine learning models and scalable data pipelines that support advanced analytics and intelligent automation initiatives. This role offers a hybrid work arrangement , combining flexibility with collaboration, and is ideal for a contractor who thrives in fast-paced, data-driven environments. Key Responsibilities Design, build, and deploy machine learning models and AI-driven solutions … environments. Stay current with emerging trends in AI/ML technologies and contribute to innovation within the organisation. Required Skills & Experience Proven experience (3–5+ years) as a Machine Learning Engineer , Data Scientist , or similar role. Strong programming skills in Python (experience with libraries such as TensorFlow, PyTorch, scikit-learn, pandas, NumPy). Solid understanding of machine learning More ❯
Posted:

Machine Learning Engineer (0–3 Years Experience)

London, South East, England, United Kingdom
Hybrid / WFH Options
IT Graduate Recruitment
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking for curious minds to help us shape the future of … edge of ML/AI research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning, LLMs, and MLOps, vector databases … Flexible working — remote-first culture with in-person team sessions for collaboration. Career acceleration — opportunities to own projects, lead development, and shape the product roadmap. An environment that values learning, creativity, and personal growth over bureaucracy. Perfect For Graduates or junior engineers with a passion for AI/ML looking to break into applied LLM engineering. Researchers or data More ❯
Employment Type: Full-Time
Salary: £45,000 - £75,000 per annum, OTE
Posted:

Machine Learning Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Higher - AI recruitment
We are partnering with an early-stage, mission-driven company at the intersection of AI and national defence to appoint exceptional Machine Learning Engineers . This fast-growing organisation is transforming mission-critical combat planning and operational decision-making by building next-generation AI software tools for Western forces. Founded in 2023, the company has already secured significant early … OpenAI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, GenAI, Computer Vision, and Reinforcement Learning technologies. Position location (hybrid) : London (Shoreditch) or Paris (Le Marais) We are seeking Machine Learning Engineers who are passionate about using AI technology to solve complex … product team, to identify and capitalise on high-impact opportunities Keep the company at the forefront of innovation by exploring and applying the latest research Requirements: MSc in Machine Learning, Computer Science, or a related field 5+ years of experience in machine learning, with an expertise in modelling, prototyping, and evaluation Strong software engineering skills with a focus More ❯
Posted:

Machine Learning Engineer

London Area, United Kingdom
Hybrid / WFH Options
Higher - AI recruitment
We are partnering with an early-stage, mission-driven company at the intersection of AI and national defence to appoint exceptional Machine Learning Engineers . This fast-growing organisation is transforming mission-critical combat planning and operational decision-making by building next-generation AI software tools for Western forces. Founded in 2023, the company has already secured significant early … OpenAI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, GenAI, Computer Vision, and Reinforcement Learning technologies. Position location (hybrid) : London (Shoreditch) or Paris (Le Marais) We are seeking Machine Learning Engineers who are passionate about using AI technology to solve complex … product team, to identify and capitalise on high-impact opportunities Keep the company at the forefront of innovation by exploring and applying the latest research Requirements: MSc in Machine Learning, Computer Science, or a related field 5+ years of experience in machine learning, with an expertise in modelling, prototyping, and evaluation Strong software engineering skills with a focus More ❯
Posted:

Artificial Intelligence Engineer

London Area, United Kingdom
Hybrid / WFH Options
Intellect Group
artificial intelligence, large language models (LLMs), and cloud-native applications . This is an exciting opportunity for an engineer with solid experience in AI development, software engineering, or machine learning systems to work on projects that deliver real business impact across the banking and finance sector . You’ll collaborate with an international team of AI engineers, data scientists … to Have Experience with prompt engineering , fine-tuning , or AI agents Understanding of retrieval-augmented generation (RAG) systems and semantic search Exposure to enterprise AI security , multimodal AI , or reinforcement learning (RLHF) Benefits 💰 Competitive Salary: Up to £70,000 + annual performance bonus 🏡 Hybrid Working: Flexible blend of office and remote work 📈 Career Development: Continuous learning, technical More ❯
Posted:

Artificial Intelligence Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Intellect Group
artificial intelligence, large language models (LLMs), and cloud-native applications . This is an exciting opportunity for an engineer with solid experience in AI development, software engineering, or machine learning systems to work on projects that deliver real business impact across the banking and finance sector . You’ll collaborate with an international team of AI engineers, data scientists … to Have Experience with prompt engineering , fine-tuning , or AI agents Understanding of retrieval-augmented generation (RAG) systems and semantic search Exposure to enterprise AI security , multimodal AI , or reinforcement learning (RLHF) Benefits 💰 Competitive Salary: Up to £70,000 + annual performance bonus 🏡 Hybrid Working: Flexible blend of office and remote work 📈 Career Development: Continuous learning, technical More ❯
Posted:

Full Stack Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Higher - AI recruitment
Open AI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, Generative AI, and reinforcement learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Full Stack Engineers who are passionate about using technology to solve complex, real-world More ❯
Posted:

Full Stack Engineer

London Area, United Kingdom
Hybrid / WFH Options
Higher - AI recruitment
Open AI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, Generative AI, and reinforcement learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Full Stack Engineers who are passionate about using technology to solve complex, real-world More ❯
Posted:

AI Engineer

London, South East, England, United Kingdom
Hybrid / WFH Options
Lorien
equivalent architectures. Work closely with data engineers, researchers, and platform teams to ensure robust deployment. Continuously research and integrate emerging techniques in agent-based AI, multi-agent systems, and reinforcement learning. Required Skills & Experience: 4+ years of experience in AI/ML engineering or data-intensive systems. Strong proficiency in Python for AI, ML, and data engineering tasks. Deep More ❯
Employment Type: Contractor
Rate: Salary negotiable
Posted:

Computer Vision Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Computer Vision Engineer

London Area, United Kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Full Stack Engineer (AI)

London Area, United Kingdom
Hybrid / WFH Options
Euphoric
contribute to the design, deployment, and iteration of our next-generation benefits platform features that leverage personalisation, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript) on the frontend and Python … it-takes attitude to deliver against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural non-negotiables. To do well here More ❯
Posted:

Full Stack Engineer (AI)

City of London, London, United Kingdom
Hybrid / WFH Options
Euphoric
contribute to the design, deployment, and iteration of our next-generation benefits platform features that leverage personalisation, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript) on the frontend and Python … it-takes attitude to deliver against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural non-negotiables. To do well here More ❯
Posted:

Artificial Intelligence Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Oho Group Ltd
you'll develop and deploy cutting-edge AI systems that help thousands of runners train smarter, stay motivated, and reach their goals. You’ll work across time-series analysis, reinforcement learning, and recommendation systems to bring true personalisation to our coaching engine. What You’ll Do: Design and deploy ML models Work closely with product and engineering teams More ❯
Posted:

Artificial Intelligence Engineer

London Area, United Kingdom
Hybrid / WFH Options
Oho Group Ltd
you'll develop and deploy cutting-edge AI systems that help thousands of runners train smarter, stay motivated, and reach their goals. You’ll work across time-series analysis, reinforcement learning, and recommendation systems to bring true personalisation to our coaching engine. What You’ll Do: Design and deploy ML models Work closely with product and engineering teams More ❯
Posted:

AI Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

AI Engineer

London Area, United Kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

Geospatial Analyst - 51201

City of London, London, United Kingdom
Hybrid / WFH Options
Turing
Role Overview: We’re looking for Geospatial Experts to help advance AI systems through Supervised Fine-Tuning (SFT), Reinforcement Learning with Human Feedback (RLHF), and Evaluation (Evals). In this role, you’ll apply your domain expertise to assess, improve, and validate AI performance on complex geospatial reasoning and decision-making tasks, contributing to real-world applications with More ❯
Posted:

Geospatial Analyst - 51201

London Area, United Kingdom
Hybrid / WFH Options
Turing
Role Overview: We’re looking for Geospatial Experts to help advance AI systems through Supervised Fine-Tuning (SFT), Reinforcement Learning with Human Feedback (RLHF), and Evaluation (Evals). In this role, you’ll apply your domain expertise to assess, improve, and validate AI performance on complex geospatial reasoning and decision-making tasks, contributing to real-world applications with More ❯
Posted:
Reinforcement Learning
London
10th Percentile
£66,650
25th Percentile
£88,750
Median
£95,000
75th Percentile
£100,000
90th Percentile
£123,000