13 of 13 Reinforcement Learning Jobs in the South East

AI Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £85,000 per annum
currently working on an AI Engineer role with a global language and translation company. You will be designing, developing, and deploying cutting-edge machine learning solutions across the company.If you enjoy end-to-end ownership (from experimentation to deployment), working with AWS, Docker, MLflow, TensorFlow/PyTorch, and contributing … code in Python Experience with TensorFlow, PyTorch and Scikit-learn Experience with NLPs and LLMs Speech, Text or Audio data Strong knowledge of machine learning techniques and algorithms, including supervised and unsupervised learning, deep learning, and reinforcement learning ...

Machine Learning Quant Engineer - Investment banking/ XVA

Hiring Organisation
Harvey Nash
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£1,000 - £1,200 per day
Senior Quant Machine Learning Engineer sought by leading investment bank based in the city of London. **Inside IR35, 4 days a week on site** The role:To lead the design and deployment of ML-driven models across our trading and investment platforms. This is a high-impact, front-office … production deployment Mentor junior quants and engineers; contribute to knowledge-sharing and model governance processes Stay current with cutting-edge ML research (e.g., deep learning, generative models, reinforcement learning) and assess applicability to financial markets Collaborate closely with cross-functional teams, including traders, data engineers, and software ...

Machine Learning Engineer (0–3 Years Experience)

Hiring Organisation
IT Graduate Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £75,000 per annum, OTE
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning ...

Senior Data Scientist

Hiring Organisation
La Fosse
Location
Slough, Berkshire, UK
Employment Type
Full-time
preparing for an international launch, we're now looking for a Senior Data Scientist. What you'll be doing: Build, deploy, and scale machine learning systems that forecast demand, optimise staffing, and improve operational performance across thousands of venues. Lead projects end-to-end, from data design and modelling … Confident working in AWS or similar cloud environments (SageMaker, Lambda, Docker, etc.). Experienced in (or eager to explore) areas such as forecasting, optimisation, reinforcement learning, generative AI, or computer vision. Solid engineering mindset, you know how to take models from research to production and keep them running ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Dartford, Kent, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Portsmouth, Hampshire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Guildford, Surrey, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Brighton, East Sussex, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Crawley, West Sussex, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Director - Finance & Procurement Product Management

Hiring Organisation
GSK
Location
Slough, Berkshire, UK
Employment Type
Full-time
vendor relationships and ensure service delivery meets contractual obligations. Technology Innovation Architect next-generation solutions that support dynamic business needs. Design and implement self-learning agents for cross-process optimization. Integrate AI solutions with existing ERP and financial platforms. Operational Excellence Provide L3 support and manage major incidents, including … agile teams and manage backlogs for multiproduct assets. Performance & Continuous Improvement Define and monitor KPIs to assess digital solution effectiveness. Establish feedback loops and reinforcement learning mechanisms. Foster a culture of innovation and continuous improvement. Required Qualifications: University degree or equivalent. Director level of experience in finance ...

Principal Data Scientist

Hiring Organisation
SoTalent
Location
Slough, Berkshire, UK
Employment Type
Full-time
based analytics solutions in collaboration with technology and data engineering teams. Apply advanced methodologies including forecasting, predictive modelling, optimisation, clustering, NLP, Generative AI, and Reinforcement Learning. Translate business objectives into analytical solutions, testing and measuring value creation. Help shape and strengthen the data science practice by driving innovation, mentoring … talent, and building world-class capabilities. What you'll bring: Expertise in machine learning (supervised/unsupervised, regression, decision trees, random forests, boosting, clustering). Strong experience in Python, TensorFlow, SQL, and deploying models in cloud environments (B2C scale preferred). Proven track record in applying data science ...