16 of 16 Reinforcement Learning Jobs in the South East

AI Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £85,000 per annum
currently working on an AI Engineer role with a global language and translation company. You will be designing, developing, and deploying cutting-edge machine learning solutions across the company.If you enjoy end-to-end ownership (from experimentation to deployment), working with AWS, Docker, MLflow, TensorFlow/PyTorch, and contributing … code in Python Experience with TensorFlow, PyTorch and Scikit-learn Experience with NLPs and LLMs Speech, Text or Audio data Strong knowledge of machine learning techniques and algorithms, including supervised and unsupervised learning, deep learning, and reinforcement learning ...

DevOps Engineer

Hiring Organisation
Matchtech
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£85,000 per annum, Negotiable
DevOps Engineer - Reinforcement Learning Platforms We are seeking an experienced DevOps Engineer to help build and scale a web-based platform for reinforcement learning (RL) training and RLOps. You will design, implement, and maintain the cloud infrastructure, CI/CD pipelines, and deployment systems that support … solving and communication skills Compensation & Benefits * Stock options* 30 days' holiday plus bank holidays* Flexible and remote working options* Enhanced parental leave* £500 annual learning and development budget* Pension scheme* Regular socials and quarterly gatherings* Bike-to-Work scheme ...

Principal Data Scientist

Hiring Organisation
Sky
Location
Middlesex, south east england, united kingdom
Experience building and deploying advanced analytics solutions in a large scale (preferably B2C) cloud environmen t Experience in deploying commercially viable applications using deep learning techniques combining structured and unstructured data is highly desirable Ability to quickly understand a business objective, problem solving to create an analytical solution …/or data analysis e.g. Python, Tensorflow (essential) Database experience, preferably SQL (essential) Expertise in cutting-edge AI methodologies, including Generative AI and Reinforcement Learning Machine learning - Supervised/unsupervised learning, regression, decision trees, random forests, boosting, clustering (essential) The rewards There's one thing people ...

Machine Learning Quant Engineer - Investment banking/ XVA

Hiring Organisation
Harvey Nash
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£1,000 - £1,200 per day
Senior Quant Machine Learning Engineer sought by leading investment bank based in the city of London. **Inside IR35, 4 days a week on site** The role:To lead the design and deployment of ML-driven models across our trading and investment platforms. This is a high-impact, front-office … production deployment Mentor junior quants and engineers; contribute to knowledge-sharing and model governance processes Stay current with cutting-edge ML research (e.g., deep learning, generative models, reinforcement learning) and assess applicability to financial markets Collaborate closely with cross-functional teams, including traders, data engineers, and software ...

Machine Learning Engineer (0–3 Years Experience)

Hiring Organisation
IT Graduate Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £75,000 per annum, OTE
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning ...

Senior AI Research Scientist (m/f/d)

Hiring Organisation
AMLZ Recruiting
Location
Slough, Berkshire, UK
Employment Type
Full-time
Explore methods in synthetic data generation, foundation-model adaptation, and model compression. Your Profile • Ideally holding a Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, Physics (or related quantitative field), or equivalent industry research experience. • Ideally 3+ years in machine learning/deep learning/LLMs … adjacent advanced AI research. • Proficient in Python; experience with PyTorch or JAX. • Understanding of reinforcement learning, multi-agent systems, probabilistic/Bayesian methods, or large-scale training. • Curiosity-driven mindset, strong problem solving, and the ability to work cross-functionally. • English is the working language; additional languages ...

Senior Data Scientist

Hiring Organisation
La Fosse
Location
Slough, Berkshire, UK
Employment Type
Full-time
preparing for an international launch, we're now looking for a Senior Data Scientist. What you'll be doing: Build, deploy, and scale machine learning systems that forecast demand, optimise staffing, and improve operational performance across thousands of venues. Lead projects end-to-end, from data design and modelling … Confident working in AWS or similar cloud environments (SageMaker, Lambda, Docker, etc.). Experienced in (or eager to explore) areas such as forecasting, optimisation, reinforcement learning, generative AI, or computer vision. Solid engineering mindset, you know how to take models from research to production and keep them running ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Slough, Berkshire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Brighton, East Sussex, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

Lead AI Engineer

Hiring Organisation
Prolific
Location
Newport, Isle of Wight, UK
Employment Type
Full-time
strong command of system design, infrastructure, and CI/CD pipelines, and you're comfortable taking a feature from concept to deployment. Machine Learning Engineering & Research: A comprehensive background in machine learning, with deep experience in ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and a strong grasp … Experience in agentic system design and tool calling (Adk, A2A, MCP, etc...) Prior work on human-in-the-loop systems, data annotation platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building ...

ML Solutions Architect (Data Agents)

Hiring Organisation
JetBrains
Location
Slough, Berkshire, UK
Employment Type
Full-time
long-term stability for external users). Evaluated LLM output quality in real-world applications. Worked with information retrieval or knowledge engineering. Worked with reinforcement learning for agents and/or multi-agent systems, especially in production or complex simulated environments. We process the data provided in your ...

Lead AI Researcher

Hiring Organisation
Lloyds Banking Group
Location
Slough, Berkshire, UK
Employment Type
Full-time
forefront of the Group's AI transformation. In this role, you will lead the research and development of cutting-edge artificial intelligence, machine learning, and generative AI technologies, guiding their application to drive business value while ensuring ethical and responsible implementation. You will work within the AI Centre … create innovative, customer-centric banking solutions. Role Responsibilities: AI Research & Innovation: Lead and conduct hands-on, end-to-end research, developing novel AI, Machine Learning (ML), Generative AI (GenAI), and Agentic AI solutions and patterns. Strategic Leadership: Set the technical direction and research roadmap for the AI team, translating ...

AI Lead

Hiring Organisation
Xcede
Location
Slough, Berkshire, UK
Employment Type
Full-time
Requirements: Previous experience leading, managing and growing teams of AI Researchers and Engineers Strong technical knowledge and experience around AI agents, LLMs, RAG systems, reinforcement learning, fine-tuning, etc Data Engineering and Infrastructure knowledge and experience Strong product mindset Experience working in a B2B SaaS start ...

Tech lead (C++/Python)

Hiring Organisation
Signify Technology
Location
Slough, Berkshire, UK
Employment Type
Full-time
systems. What You'll Be Doing Developing and maintaining top-level robot application code. Integrating modules from cross-functional teams: controls, navigation, computer vision, reinforcement learning, and platform. Designing and evolving application-side architecture and interfaces. Making pragmatic technical decisions to accelerate delivery. Collaborating with product teams ...