21 of 21 Reinforcement Learning Jobs in London

Artificial Intelligence Engineer

Hiring Organisation
Vareon
Location
South London, UK
Employment Type
Full-time
Reinforcement Learning Specialist — Machine Autonomy Division (R&D Prototypes) — Contract-to-Hire — UK & Germany Preferred — Remote About Vareon Vareon is a systems architecture and engineering company building reliable, deterministic AI for physical systems. We develop transparent, steerable intelligence that can be validated under real-world constraints—latency, safety … sensor noise, and imperfect dynamics. Our approach blends controlled, physics-inspired methods with modern machine learning to create systems that are robust, debuggable, and deployable on real machines. Machine Autonomy Division Machine Autonomy is Vareon's R&D division focused on rapid prototyping and demonstration development in robotics ...

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
City of London, London, United Kingdom
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Machine Learning Engineer

Hiring Organisation
mthree
Location
City, London, United Kingdom
Employment Type
Permanent
Salary
GBP Annual
Machine Learning Engineer/ML Engineer Machine Learning Development Design and implement machine learning models for financial applications, with a focus on pricing and risk analytics Build scalable ML pipelines for processing large-scale financial data Develop deep learning architectures for time series prediction, anomaly detection … data-driven approaches to complement traditional quantitative finance models Support implementation of ML solutions for derivatives pricing and risk management Core Technical Skills Machine Learning Expertise : Deep understanding of ML algorithms (supervised/unsupervised learning, reinforcement learning) Extensive experience with neural networks, including RNNs, LSTMs, Transformers ...

Machine Learning Engineer

Hiring Organisation
mthree
Location
EC4M, Old Bailey, Greater London, United Kingdom
Employment Type
Permanent, Contract
Machine Learning Engineer/ML Engineer Machine Learning Development Design and implement machine learning models for financial applications, with a focus on pricing and risk analytics Build scalable ML pipelines for processing large-scale financial data Develop deep learning architectures for time series prediction, anomaly detection … data-driven approaches to complement traditional quantitative finance models Support implementation of ML solutions for derivatives pricing and risk management Core Technical Skills Machine Learning Expertise : Deep understanding of ML algorithms (supervised/unsupervised learning, reinforcement learning) Extensive experience with neural networks, including RNNs, LSTMs, Transformers ...

AI Engineer

Hiring Organisation
Microtech Global Ltd
Location
West London, London, United Kingdom
Employment Type
Contract, Work From Home
these AI solutions to ensure they meet user needs and drive meaningful impact in both healthcare and accessibility domains. Responsibilities: Develop and optimize machine learning models for disease prediction, early diagnosis and personalised healthcare solutions. Process and analyze structured and unstructured health data (EHR, Wearables, HL7/FHIR … implement deep learning algorithms for predictive healthcare applications. Contribute to research on AI-driven personalization strategies to empower users in managing their health effectively. Develop AI-powered accessibility solutions for our products, leveraging multi-modal AI (text, image, audio). Adhere to data privacy regulations (GDPR, MDR, HIPPA, EHDS ...

AI/ML Architect

Hiring Organisation
Stott & May Professional Search Limited
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£460 - £510 per day
Inside IR35) Role Overview We are seeking an experienced AI/ML Architect to design, build, and lead the deployment of advanced machine learning and AI-driven solutions. The successful consultant will enable real-world value from data by architecting models, pipelines, and integration patterns that scale across … This role combines technical depth, applied ML engineering, solution design, and hands-on development. Key Responsibilities Design, build, and train end-to-end machine learning models across NLP, predictive analytics, classification, and computer vision use cases Experiment with algorithms, optimise hyperparameters, and evaluate model performance Collect, process, and prepare ...

AI / ML Architect

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £450 to £500 per day
Hybrid): 2 days from office Number of Positions: 4 The Role An AI/ML Developer is responsible for designing, building, and deploying machine learning models and AI solutions that solve business problems. This role focuses on coding, data preparation, and integrating models into production systems. Your Responsibilities … Model Development Design, build, and train machine learning models for predictive analytics, classification, NLP, computer vision, or other AI applications. Experiment with algorithms and optimize hyperparameters for performance. Data Preparation Collect, clean, and preprocess large datasets for training and validation. Implement feature engineering and data augmentation techniques. Integration & Deployment ...

Machine Learning Engineer (0–3 Years Experience).

Hiring Organisation
IT Graduate Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £75,000 per annum, OTE
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning ...

AI/ML Engineer

Hiring Organisation
Brio Digital
Location
City of London, London, United Kingdom
Employment Type
Permanent
Salary
£75000 - £100000/annum
Senior Machine Learning Engineer (Generative AI/LLMs) Location: Fully Remote (UK-based) Salary: £75,000 - £100,000 (depending on experience) The Role We're hiring a Senior Machine Learning Engineer to lead the design and productionisation of Generative AI and Large Language Model (LLM) applications. This role … strong influence over architecture, tooling, and the future direction of LLM-powered products. What You'll Be Doing Design, develop, and deploy advanced machine learning and deep learning models into production. Architect scalable LLMOps pipelines on GCP/Vertex AI , including fine-tuning, vector search, and low-latency ...

Machine Learning Engineer

Hiring Organisation
Higher - AI recruitment
Location
City of London, London, United Kingdom
partnering with an early-stage, mission-driven company at the intersection of AI and national defence to appoint exceptional Machine Learning Engineers . This fast-growing organisation is transforming mission-critical combat planning and operational decision-making by building next-generation AI software tools for Western forces. Founded … sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, GenAI, Computer Vision, and Reinforcement Learning technologies. Position location (hybrid) : London (Shoreditch) or Paris (Le Marais) We are seeking Machine Learning Engineers who are passionate about using ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
London, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

Applied Scientist

Hiring Organisation
Cubiq Recruitment
Location
City of London, London, United Kingdom
internal research papers, technical memos, and (where appropriate) external publications. Who You Are Currently completing or recently completed a PhD in Physics, Mathematics, Machine Learning, Computer Science, or a related field from a top university. Strong publication record at leading venues such as NeurIPS, ICML, ICLR, ACL, CVPR, ICCV … EMNLP. Solid understanding of modern ML architectures (transformers, diffusion, retrieval-augmented systems, reinforcement learning, etc.). Strong coding skills in Python and experience with at least one major ML framework (PyTorch, JAX, TensorFlow). Ability to bridge high-level research with practical implementation. Curious, humble, and excited ...

Senior ML Infrastructure Engineer Robotics

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£600 - £1,000 per day
heart of their technology. This is a hands-on engineering role focused on scale, performance, and reliability. You will work across the full machine learning lifecycle, from distributed training pipelines to highly optimised inference systems deployed into production robotics environments. The Role You will join a highly technical team … working at the intersection of software engineering, machine learning infrastructure, and robotics. Your focus will be on turning cutting-edge models into robust, production-ready systems that run efficiently across cloud and constrained hardware environments. You will collaborate closely with researchers and ML engineers, help shape architectural decisions ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
City of London, Greater London, UK
requirements through first principles • You communicate technical concepts clearly to non-technical stakeholders Highly Valued (Differentiated Candidates) • Publications or research in multi-agent systems, reinforcement learning, AI safety, or agent architectures • Experience at AI labs (Anthropic, OpenAI, DeepMind) or leading AI research groups • Production experience with agents : LangChain … Dr. Ayman Hindy, Marcel Cassard, and leading figures in AI, high frequency risk management and financial regulation. Early team of sharp, mission-driven builders. Learning Curve : You'll gain expertise in cutting-edge AI architectures, enterprise software, regulatory frameworks, and category creation simultaneously. This is one of those roles ...

Lead ML Engineer (London)

Hiring Organisation
Glite Tech
Location
London Area, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language ...

AI Developer/Engineer

Hiring Organisation
Damia Group Ltd
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£500 - £650/day
Models (LLMs) and other generative architectures. Optimise pre-trained models (OpenAI, Anthropic, or open-source LLMs) for business use cases using prompt engineering and reinforcement learning. Experiment with model configurations to balance performance, cost, and scalability. Build robust data pipelines for continuous model improvement and retraining. Ensure compliance with ...

Senior NLP Engineer (London)

Hiring Organisation
Glite Tech
Location
London, UK
Employment Type
Full-time
English to intermediate and advanced learners. We're on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone. We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to join the ML team in our growing company. What you will do Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal 'next task' Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense ...

Senior AI Engineer

Hiring Organisation
Wilgot
Location
London, UK
Employment Type
Full-time
from AI theory to solving tangible, high-scale engineering challenges. You will work alongside our founding team and our Founding AI Scientist (PhD in Reinforcement Learning) to translate cutting-edge research into enterprise-scale production features. Key Responsibilities Scale Agentic Architectures: Advance existing workflows into multi-agent systems ...

Senior AI Engineer - Remote-first (Stockholm/London hubs)

Hiring Organisation
Wilgot
Location
City of London, London, United Kingdom
from AI theory to solving tangible, high-scale engineering challenges. You will work alongside our founding team and our Founding AI Scientist (PhD in Reinforcement Learning) to translate cutting-edge research into enterprise-scale production features. Key Responsibilities Scale Agentic Architectures: Advance existing workflows into multi-agent systems ...

Senior AI Engineer - Remote-first (Stockholm/London hubs)

Hiring Organisation
Wilgot
Location
East London, London, United Kingdom
from AI theory to solving tangible, high-scale engineering challenges. You will work alongside our founding team and our Founding AI Scientist (PhD in Reinforcement Learning) to translate cutting-edge research into enterprise-scale production features. Key Responsibilities Scale Agentic Architectures: Advance existing workflows into multi-agent systems ...

C# .Net Developer

Hiring Organisation
Damia Group Ltd
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£500 - £650/day
Models (LLMs) and other generative architectures. Optimise pre-trained models (OpenAI, Anthropic, or open-source LLMs) for business use cases using prompt engineering and reinforcement learning. Experiment with model configurations to balance performance, cost, and scalability. Build robust data pipelines for continuous model improvement and retraining. Ensure compliance with ...