8 of 8 Reinforcement Learning Jobs in the South East

Lead AI Engineer

Hiring Organisation
Akixi
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...

Lead AI Engineer

Hiring Organisation
Akixi
Location
Newport, Isle of Wight, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...

Data Scientist

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£55,000 - £65,000 per annum
will join a collaborative cross functional unit working across engineering, product and data. The Role Develop and deploy production grade Python and deep learning models. Build NLP and LLM features including embeddings, intent detection and conversational AI. Contribute to end to end pipelines using cloud services, microservices and containerisation. … Experiment with advanced techniques including reinforcement learning and RAG workflows. Collaborate closely with engineering and product on delivery and performance. Present work clearly in team sessions and contribute to technical decision making. Your Skills and Experience Strong Python skills and experience deploying ML models into production. Hands ...

Senior NLP Engineer (London)

Hiring Organisation
Glite Tech
Location
Slough, Berkshire, UK
Employment Type
Full-time
English to intermediate and advanced learners. We're on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone. We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to join the ML team in our growing company. What you will do Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal 'next task' Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense ...

Software Engineer (Applied AI)

Hiring Organisation
TalentCo
Location
Slough, Berkshire, UK
Employment Type
Full-time
into production-ready code. Build AI-first features: Design and ship cutting-edge platform features using personalisation, LLMs, recommender systems, and reinforcement learning. Move fast with the latest tools: Leverage modern AI dev tools (Claude Code, Cursor) to experiment, prototype, and launch quickly. Be a collaborative, growth-minded engineer ...

Software Engineer (Applied AI)

Hiring Organisation
TalentCo
Location
Newport, Isle of Wight, UK
Employment Type
Full-time
into production-ready code. Build AI-first features: Design and ship cutting-edge platform features using personalisation, LLMs, recommender systems, and reinforcement learning. Move fast with the latest tools: Leverage modern AI dev tools (Claude Code, Cursor) to experiment, prototype, and launch quickly. Be a collaborative, growth-minded engineer ...

Reinforcement Learning (RL) Engineer, Manipulation

Hiring Organisation
Randstad Technologies
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £120,000 per annum
talent to solve the most complex challenges in high-DOF autonomous systems and embodied AI. We are looking for experts across: AI/Machine Learning MLOps Software Engineering Data Science The Environment The mission is driven by high-bandwidth, in-person collaboration. This is a 5-day-a-week ...

Data Scientist - Inside IR35 - Hybrid

Hiring Organisation
Halian Technology Limited
Location
Croydon, Surrey, South East, United Kingdom
Employment Type
Contract
Role We are recruiting on behalf of a mobility technology business building intelligent fleet orchestration systems. This role suits an experienced Applied Machine Learning Engineer or Data Scientist comfortable working with messy real-world data, operational constraints, and production systems. Youll join a small, high-calibre team solving complex … years Geospatial data experience (H3, GeoPandas, PostGIS or similar) Optimisation/operations research exposure Logistics/mobility/marketplace domain experience Nice to Have Reinforcement learning Simulation modelling Experience deploying models into cloud environments Experimentation frameworks (A/B testing, model validation at scale) How to Apply ...