Reinforcement Learning Jobs in the City of London

26 to 37 of 37 Reinforcement Learning Jobs in the City of London

Computer Vision Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Senior NLP Engineer (London)

City of London, London, United Kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
Posted:

Senior NLP Engineer (London)

london (city of london), south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
Posted:

Full Stack Engineer (AI)

City of London, London, United Kingdom
Hybrid / WFH Options
Euphoric
contribute to the design, deployment, and iteration of our next-generation benefits platform features that leverage personalisation, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript) on the frontend and Python … it-takes attitude to deliver against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural non-negotiables. To do well here More ❯
Posted:

Founding Software Engineer

City of London, London, United Kingdom
SEEKR
high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯
Posted:

Founding Software Engineer

london (city of london), south east england, united kingdom
SEEKR
high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

City of London, London, United Kingdom
Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

london (city of london), south east england, united kingdom
Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
Posted:

AI SME

City of London, London, United Kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI SME

london (city of london), south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:

AI Engineer

london (city of london), south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted:
Reinforcement Learning
the City of London
10th Percentile
£111,500
25th Percentile
£113,750
Median
£130,000
75th Percentile
£158,750
90th Percentile
£168,500