Reinforcement Learning Jobs in Central London

26 to 38 of 38 Reinforcement Learning Jobs in Central London

Computer Vision Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Full Stack Engineer (AI)

City of London, London, United Kingdom
Hybrid / WFH Options
Euphoric
contribute to the design, deployment, and iteration of our next-generation benefits platform features that leverage personalisation, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript) on the frontend and Python … it-takes attitude to deliver against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural non-negotiables. To do well here More ❯
Posted:

Founding Software Engineer

City of London, London, United Kingdom
SEEKR
high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯
Posted:

Founding Software Engineer

london (city of london), south east england, united kingdom
SEEKR
high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯
Posted:

Member of Technical Staff

City of London, London, United Kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

Member of Technical Staff

london (city of london), south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

Artificial Intelligence Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Oho Group Ltd
you'll develop and deploy cutting-edge AI systems that help thousands of runners train smarter, stay motivated, and reach their goals. You’ll work across time-series analysis, reinforcement learning, and recommendation systems to bring true personalisation to our coaching engine. What You’ll Do: Design and deploy ML models Work closely with product and engineering teams More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

City of London, London, United Kingdom
Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

london (city of london), south east england, united kingdom
Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
Posted:

AI SME

City of London, London, United Kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME

london (city of london), south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

AI Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:
Reinforcement Learning
Central London
10th Percentile
£111,500
25th Percentile
£113,750
Median
£130,000
75th Percentile
£158,750
90th Percentile
£168,500