Employment Type

Remote Jobs

Hybrid/WFH 18

Sort By

Relevance
Date

Locations

Central London 37

Job Titles

26 to 37 of 37 Reinforcement Learning Jobs in the City of London

Computer Vision Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options

microTECH Global LTD

Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯

Posted: 11 days ago

Senior NLP Engineer (London)

City of London, London, United Kingdom

Glite Tech

a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯

Posted: 2 days ago

Senior NLP Engineer (London)

london (city of london), south east england, united kingdom

Glite Tech

Posted: 17 days ago

Full Stack Engineer (AI)

City of London, London, United Kingdom
Hybrid / WFH Options

Euphoric

contribute to the design, deployment, and iteration of our next-generation benefits platform features that leverage personalisation, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript) on the frontend and Python … it-takes attitude to deliver against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural non-negotiables. To do well here More ❯

Posted: Yesterday

Founding Software Engineer

City of London, London, United Kingdom

SEEKR

high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯

Posted: Yesterday

Founding Software Engineer

london (city of london), south east england, united kingdom

SEEKR

Posted: Yesterday

New Trading Team's 1st C++ Quant Developer | HFT

City of London, London, United Kingdom

Augmentti

optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯

Posted: Yesterday

New Trading Team's 1st C++ Quant Developer | HFT

london (city of london), south east england, united kingdom

Augmentti

Posted: Yesterday

AI SME

City of London, London, United Kingdom

Lorien

non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯

Posted: 2 days ago

AI SME

london (city of london), south east england, united kingdom

Lorien

Posted: 2 days ago

AI Engineer

City of London, London, United Kingdom
Hybrid / WFH Options

Amber Labs

the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯

Posted: 4 days ago

AI Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options

Amber Labs

Posted: 9 days ago

Salary Guide

Reinforcement Learning
the City of London

10th Percentile: £111,500
25th Percentile: £113,750
Median: £130,000
75th Percentile: £158,750
90th Percentile: £168,500

More Reinforcement Learning insights