15 of 15 Permanent Reinforcement Learning Jobs in the City of London

Artificial Intelligence Engineer

Hiring Organisation
Addition
Location
City of London, London, United Kingdom
live deployment. What You’ll Be Doing: Designing, building and deploying production-ready AI and data science solutions Training and shipping advanced machine learning, deep learning and reinforcement learning models Developing AI systems using Large Language Models Building robust data pipelines for extraction, cleaning and transformation … data science solutions end to end Advanced Python skills with libraries such as Pandas, NumPy and scikit-learn Hands-on experience with machine learning techniques (e.g. tree-based models, regression) Practical exposure to deep learning and reinforcement learning approaches Experience working with SQL and structured data ...

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
City of London, London, United Kingdom
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Machine Learning Engineer

Hiring Organisation
mthree
Location
City of London, London, United Kingdom
Machine Learning Engineer/ML Engineer Machine Learning Development Design and implement machine learning models for financial applications, with a focus on pricing and risk analytics Build scalable ML pipelines for processing large-scale financial data Develop deep learning architectures for time series prediction, anomaly detection … data-driven approaches to complement traditional quantitative finance models Support implementation of ML solutions for derivatives pricing and risk management Core Technical Skills Machine Learning Expertise : Deep understanding of ML algorithms (supervised/unsupervised learning, reinforcement learning) Extensive experience with neural networks, including RNNs, LSTMs, Transformers ...

Principal AI Research & Engineering Leader

Hiring Organisation
HCLTech
Location
City of London, London, United Kingdom
engineering roadmap for the AI platform, prioritizing projects that solve critical, high-complexity technical challenges (e.g., model efficiency, interpretability, real-time inference). Deep Learning & Modeling: Lead the design, implementation, and optimization of advanced deep learning, Generative AI, and Reinforcement Learning models from scratch, pushing … prototypes suitable for mission-critical production deployment. 2. MLOps & Production Engineering Excellence MLOps Leadership: Establish and enforce industry-leading best practices for MLOps (Machine Learning Operations), ensuring automation, reproducibility, version control, and continuous integration/continuous delivery (CI/CD) for all models. Architecture Review: Personally review and approve ...

Senior Data Scientist

Hiring Organisation
algo1
Location
City of London, London, United Kingdom
backed startup focused on behavioural AI, currently in stealth. We are building for retail with a focus on Recommendation Systems, Reinforcement Learning and Generative AI. Our platform is designed from the ground up - no legacy, no patchwork systems - just a clean slate and a clear vision. Our mission … edge AI to transform their shopping experience. About the Job We are looking for a Senior Data Scientist with experience in bringing advanced machine learning and data science systems to production to work with our team of industry leading domain experts and engineers. You'll be working across ...

AI/ML Engineer

Hiring Organisation
Brio Digital
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£75000 - £100000/annum
Senior Machine Learning Engineer (Generative AI/LLMs) Location: Fully Remote (UK-based) Salary: £75,000 - £100,000 (depending on experience) The Role We're hiring a Senior Machine Learning Engineer to lead the design and productionisation of Generative AI and Large Language Model (LLM) applications. This role … strong influence over architecture, tooling, and the future direction of LLM-powered products. What You'll Be Doing Design, develop, and deploy advanced machine learning and deep learning models into production. Architect scalable LLMOps pipelines on GCP/Vertex AI , including fine-tuning, vector search, and low-latency ...

Machine Learning Engineer

Hiring Organisation
JLA Resourcing Ltd
Location
City of London, London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£70,000
Role - Machine Learning Engineer Location Hybrid, 1 day on site in London (with flex) Salary - £70,000 to £75,000 The Opportunity Were partnering with a client in the financial services sector who are looking to bring on a Machine Learning Engineer to their growing Intelligent Automation Team … position is hybrid, typically one day per week on site in London (with flexibility around which day). The Role Reporting to the Machine Learning Lead, youll be a hands-on Machine Learning Engineer with a strong track record of building and deploying ML solutions at scaleparticularly ...

Machine Learning Engineer

Hiring Organisation
Xcede
Location
City of London, London, United Kingdom
Machine Learning Engineer ~2 days a week in the London office (hybrid, there is a little flexibility on this) About the Company Join a fast-growing platform business that connects buyers and sellers across a diverse marketplace. With deep investments in data and personalisation, the company is building advanced … machine learning capabilities to improve discovery, user experience, and recommendation quality at scale. The environment is fast-moving, collaborative, and focused on bringing practical AI into live systems that support millions of transactions across unique and varied inventory types. You’ll work at the intersection of engineering, product ...

Machine Learning Engineer

Hiring Organisation
Higher - AI recruitment
Location
City of London, London, United Kingdom
partnering with an early-stage, mission-driven company at the intersection of AI and national defence to appoint exceptional Machine Learning Engineers . This fast-growing organisation is transforming mission-critical combat planning and operational decision-making by building next-generation AI software tools for Western forces. Founded … sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, GenAI, Computer Vision, and Reinforcement Learning technologies. Position location (hybrid) : London (Shoreditch) or Paris (Le Marais) We are seeking Machine Learning Engineers who are passionate about using ...

Senior Data Scientist

Hiring Organisation
La Fosse
Location
City of London, London, United Kingdom
preparing for an international launch, we’re now looking for a Senior Data Scientist. What you’ll be doing: Build, deploy, and scale machine learning systems that forecast demand, optimise staffing, and improve operational performance across thousands of venues. Lead projects end-to-end, from data design and modelling … Confident working in AWS or similar cloud environments (SageMaker, Lambda, Docker, etc.). Experienced in (or eager to explore) areas such as forecasting, optimisation, reinforcement learning, generative AI, or computer vision. Solid engineering mindset, you know how to take models from research to production and keep them running ...

Lead ML Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
City of London, London, United Kingdom
requirements through first principles • You communicate technical concepts clearly to non-technical stakeholders Highly Valued (Differentiated Candidates) • Publications or research in multi-agent systems, reinforcement learning, AI safety, or agent architectures • Experience at AI labs (Anthropic, OpenAI, DeepMind) or leading AI research groups • Production experience with agents : LangChain … Dr. Ayman Hindy, Marcel Cassard, and leading figures in AI, high frequency risk management and financial regulation. Early team of sharp, mission-driven builders. Learning Curve : You'll gain expertise in cutting-edge AI architectures, enterprise software, regulatory frameworks, and category creation simultaneously. This is one of those roles ...

Software Engineer

Hiring Organisation
bloom
Location
City of London, London, United Kingdom
Software Engineer - Stealth AI Startup (London/Paris/Europe) 🚀 (RL/Reinforcement Learning) 💸 Package: £100,000-£200,000 + strong early-stage equity 🌍 Location: London/Paris hubs, or elsewhere in Europe with occasional travel to be with the team (remote friendly/hybrid - however you like ...

AI Engineer

Hiring Organisation
bloom
Location
City of London, London, United Kingdom
Engineer - Stealth AI Startup (London/Paris/Europe) 🚀 (RL/Reinforcement Learning) 💸 Package: £100,000-£200,000 + strong early-stage equity 🌍 Location: London/Paris hubs, or elsewhere in Europe with occasional travel to be with the team (remote friendly/hybrid - however you like ...

New Trading Team's 1st C++ Quant Developer | HFT

Hiring Organisation
Augmentti
Location
City of London, London, United Kingdom
frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies ...