Reinforcement Learning Jobs in the South East

26 to 50 of 56 Reinforcement Learning Jobs in the South East

Full Stack Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Higher - AI recruitment
Open AI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, Generative AI, and reinforcement learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Full Stack Engineers who are passionate about using technology to solve complex, real-world More ❯
Posted:

Full Stack Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Higher - AI recruitment
Open AI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, Generative AI, and reinforcement learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Full Stack Engineers who are passionate about using technology to solve complex, real-world More ❯
Posted:

Lead ML Engineer (London)

london, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Lead our ‘AI Powered Education’ meet-up in London, building a network of ML specialists Essential skills 🙏 Experience leading and mentoring other ML More ❯
Posted:

Lead ML Engineer (London)

slough, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Lead our ‘AI Powered Education’ meet-up in London, building a network of ML specialists Essential skills 🙏 Experience leading and mentoring other ML More ❯
Posted:

Lead ML Engineer (London)

london (city of london), south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Lead our ‘AI Powered Education’ meet-up in London, building a network of ML specialists Essential skills 🙏 Experience leading and mentoring other ML More ❯
Posted:

Senior ML Engineer (London)

london, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … looking for a Senior ML Engineer with a proven track record of delivering ML models to production. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in production Experience working with More ❯
Posted:

Senior ML Engineer (London)

slough, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … looking for a Senior ML Engineer with a proven track record of delivering ML models to production. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in production Experience working with More ❯
Posted:

Senior ML Engineer (London)

london (city of london), south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … looking for a Senior ML Engineer with a proven track record of delivering ML models to production. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in production Experience working with More ❯
Posted:

Computer Vision Engineer

London, South East, England, United Kingdom
MicroTECH Global Ltd
Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. Contribute to NeRF-based systems with GPU-optimized kernel development, screen space sampling … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Employment Type: Full-Time
Salary: Salary negotiable
Posted:

Computer Vision Engineer

london, south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Computer Vision Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Computer Vision Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Senior NLP Engineer (London)

london, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
Posted:

Senior NLP Engineer (London)

slough, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
Posted:

Senior NLP Engineer (London)

london (city of london), south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
Posted:

Member of Technical Staff

london, south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

Member of Technical Staff

slough, south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

Member of Technical Staff

london (city of london), south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

AI Engineer

london, south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques, including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

AI Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques, including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

AI Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques, including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

Deep Learning Engineer

slough, south east england, united kingdom
Brio Digital
Deep Learning Engineer – Manipulation (Brio Digital, on behalf of our client) Brio Digital is partnered with a pioneering robotics client who are building advanced, scalable, and safe AI-driven systems designed to transform how humans and machines interact in the real world. Their first-generation platform is tackling labour automation challenges, enabling efficiency and safety across industrial use cases … and logistics. This is a unique opportunity to join an organisation at the frontier of applied AI and robotics, where you’ll be solving cutting-edge challenges in deep learning and embodied intelligence. The Role We’re looking for multiple Deep Learning Engineers (Manipulation) to join the team. This is a deep learning-focused position where you … training policies, curating data, leveraging synthetic datasets, and deploying real-time inference models. Robotics experience is not strictly required, but you must bring a strong track record in deep learning and the ability to adapt quickly to new domains. What You’ll Be Doing Train manipulation policies using representation learning, behaviour cloning, and reinforcement learning. Partner with More ❯
Posted:

Deep Learning Engineer

london, south east england, united kingdom
Brio Digital
Deep Learning Engineer – Manipulation (Brio Digital, on behalf of our client) Brio Digital is partnered with a pioneering robotics client who are building advanced, scalable, and safe AI-driven systems designed to transform how humans and machines interact in the real world. Their first-generation platform is tackling labour automation challenges, enabling efficiency and safety across industrial use cases … and logistics. This is a unique opportunity to join an organisation at the frontier of applied AI and robotics, where you’ll be solving cutting-edge challenges in deep learning and embodied intelligence. The Role We’re looking for multiple Deep Learning Engineers (Manipulation) to join the team. This is a deep learning-focused position where you … training policies, curating data, leveraging synthetic datasets, and deploying real-time inference models. Robotics experience is not strictly required, but you must bring a strong track record in deep learning and the ability to adapt quickly to new domains. What You’ll Be Doing Train manipulation policies using representation learning, behaviour cloning, and reinforcement learning. Partner with More ❯
Posted:

Deep Learning Engineer

london (city of london), south east england, united kingdom
Brio Digital
Deep Learning Engineer – Manipulation (Brio Digital, on behalf of our client) Brio Digital is partnered with a pioneering robotics client who are building advanced, scalable, and safe AI-driven systems designed to transform how humans and machines interact in the real world. Their first-generation platform is tackling labour automation challenges, enabling efficiency and safety across industrial use cases … and logistics. This is a unique opportunity to join an organisation at the frontier of applied AI and robotics, where you’ll be solving cutting-edge challenges in deep learning and embodied intelligence. The Role We’re looking for multiple Deep Learning Engineers (Manipulation) to join the team. This is a deep learning-focused position where you … training policies, curating data, leveraging synthetic datasets, and deploying real-time inference models. Robotics experience is not strictly required, but you must bring a strong track record in deep learning and the ability to adapt quickly to new domains. What You’ll Be Doing Train manipulation policies using representation learning, behaviour cloning, and reinforcement learning. Partner with More ❯
Posted:

AI SME

slough, south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:
Reinforcement Learning
the South East
10th Percentile
£72,750
25th Percentile
£76,875
Median
£102,500
75th Percentile
£121,250