Employment Type

Remote Jobs

Hybrid/WFH 19

Sort By

Relevance
Date

Locations

Job Titles

26 to 38 of 38 Reinforcement Learning Jobs in Central London

Computer Vision Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options

microTECH Global LTD

Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯

Posted: 11 days ago

Full Stack Engineer (AI)

City of London, London, United Kingdom
Hybrid / WFH Options

Euphoric

contribute to the design, deployment, and iteration of our next-generation benefits platform features that leverage personalisation, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript) on the frontend and Python … it-takes attitude to deliver against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural non-negotiables. To do well here More ❯

Posted: Yesterday

Founding Software Engineer

City of London, London, United Kingdom

SEEKR

high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯

Posted: Yesterday

Founding Software Engineer

london (city of london), south east england, united kingdom

SEEKR

Posted: Yesterday

Member of Technical Staff

City of London, London, United Kingdom

Cubiq Recruitment

founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯

Posted: 2 days ago

Member of Technical Staff

london (city of london), south east england, united kingdom

Cubiq Recruitment

Posted: 12 days ago

Artificial Intelligence Engineer

City of London, London, United Kingdom
Hybrid / WFH Options

Oho Group Ltd

you'll develop and deploy cutting-edge AI systems that help thousands of runners train smarter, stay motivated, and reach their goals. You’ll work across time-series analysis, reinforcement learning, and recommendation systems to bring true personalisation to our coaching engine. What You’ll Do: Design and deploy ML models Work closely with product and engineering teams More ❯

Posted: Yesterday

New Trading Team's 1st C++ Quant Developer | HFT

City of London, London, United Kingdom

Augmentti

optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯

Posted: Yesterday

New Trading Team's 1st C++ Quant Developer | HFT

london (city of london), south east england, united kingdom

Augmentti

Posted: Yesterday

AI SME

City of London, London, United Kingdom

Lorien

non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯

Posted: 2 days ago

AI SME

london (city of london), south east england, united kingdom

Lorien

Posted: 2 days ago

AI Engineer

City of London, London, United Kingdom
Hybrid / WFH Options

Amber Labs

the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯

Posted: 4 days ago

AI Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options

Amber Labs

Posted: 9 days ago

Salary Guide

Reinforcement Learning
Central London

10th Percentile: £111,500
25th Percentile: £113,750
Median: £130,000
75th Percentile: £158,750
90th Percentile: £168,500

More Reinforcement Learning insights