|
|
26 to 37 of 37 Reinforcement Learning Jobs in the City of London
london (city of london), south east england, united kingdom Hybrid / WFH Options microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
City of London, London, United Kingdom Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
london (city of london), south east england, united kingdom Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
City of London, London, United Kingdom Hybrid / WFH Options Euphoric
contribute to the design, deployment, and iteration of our next-generation benefits platform features that leverage personalisation, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript) on the frontend and Python … it-takes attitude to deliver against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural non-negotiables. To do well here More ❯
City of London, London, United Kingdom SEEKR
high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯
london (city of london), south east england, united kingdom SEEKR
high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯
City of London, London, United Kingdom Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
london (city of london), south east england, united kingdom Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
City of London, London, United Kingdom Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
london (city of london), south east england, united kingdom Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
City of London, London, United Kingdom Hybrid / WFH Options Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
|
Salary Guide Reinforcement Learning the City of London - 10th Percentile
- £111,500
- 25th Percentile
- £113,750
- Median
- £130,000
- 75th Percentile
- £158,750
- 90th Percentile
- £168,500
|