Reinforcement Learning Jobs in London

76 to 87 of 87 Reinforcement Learning Jobs in London

New Trading Team's 1st C++ Quant Developer | HFT

London Area, United Kingdom
Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

City of London, London, United Kingdom
Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
Posted:

AI SME

London Area, United Kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME

City of London, London, United Kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME

london, south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME

london (city of london), south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME- DB Pensions

London, South East, England, United Kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. Carbon60, Lorien & SRG - The Impellam Group STEM More ❯
Employment Type: Contractor
Rate: Salary negotiable
Posted:

AI Engineer

London Area, United Kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

AI Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

AI Engineer

london, south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

AI Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

IN YNIER PROJEKT W- j zyk angielski

London, United Kingdom
Deep Learning Engineer Manipulation (Brio Digital, on behalf of our client) Brio Digital is partnered with a pioneering robotics client who are building advanced, scalable, and safe AI-driven systems designed to transform how humans and machines interact in the real world. Their first-generation platform is tackling labour automation challenges, enabling efficiency and safety across industrial use cases … construction, and logistics. This is a unique opportunity to join an organisation at the frontier of applied AI and robotics, where youll be solving cutting-edge challenges in deep learning and embodied intelligence. Youll work across the full lifecycle: training policies, curating data, leveraging synthetic datasets, and deploying real-time inference models. Train manipulation policies using representation learning, behaviour cloning, and reinforcement learning. Partner with data and teleoperations teams to define quality standards and drive diverse data collection. Integrate multimodal inputs (vision, audio, proprioception, LiDAR/point clouds). Collaborate with MLOps teams to scale distributed training and deliver efficient models for real-time deployment. 3+ years experience building and deploying deep learning systems, with More ❯
Posted:
Reinforcement Learning
London
10th Percentile
£66,650
25th Percentile
£88,750
Median
£95,000
75th Percentile
£100,000
90th Percentile
£123,000