1 to 25 of 32 Reinforcement Learning Jobs in the South East

Senior Reinforcement Learning expert

Hiring Organisation
Barrington James
Location
Slough, Berkshire, UK
Employment Type
Full-time
development of intelligent controllers for real-world robotic systems. This is a hands-on, highly technical role: you'll design, build, and maintain advanced learning pipelines that combine imitation learning, reinforcement learning, and language or vision-conditioned models. You will play a pivotal role … infrastructure and becoming a core pillar of the research organization. What You'll Do Design and implement training pipelines that blend Imitation Learning and Reinforcement Learning (both offline and online) to teach robotic behaviors. Collect high-quality demonstration data by teleoperating robots (around 4–10 hours ...

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
Slough, Berkshire, UK
Employment Type
Full-time
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Staff Data Scientist

Hiring Organisation
loveholidays
Location
Slough, Berkshire, UK
Employment Type
Full-time
this. About The Team Our Data Science team comprises eight members, including four Senior Data Scientists, two Data Scientists, a Machine Learning Engineer and the Head of Data Science. We specialise in various areas such as Recommender Systems, Time Series Forecasting, Deep Learning, and Reinforcement Learning, fostering a collaborative learning environment. Our focus is on modelling and problem-solving, leveraging advanced machine learning techniques to create solutions to challenging business problems. We prioritise clean, well-tested code with a culture of documentation and knowledge sharing. Our tech stack includes GCP, Python, GitHub, PyTorch ...

Senior Data Scientist - Healthcare

Hiring Organisation
Kainos
Location
Reading, Berkshire, UK
Employment Type
Full-time
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government's AI Cyber Security Code of Practice … RESPONSIBILITIES IN THE BUSINESS: As a Senior Data Scientist at Kainos, you will be building advanced AI solutions leveraging state-of-the-art machine learning, generative and agentic AI technologies. You will drive the adoption of modern AI frameworks, AIOps best practices and scalable cloud-native architectures. Your role ...

Senior Data Scientist - Healthcare

Hiring Organisation
Kainos
Location
Woking, Surrey, UK
Employment Type
Full-time
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government's AI Cyber Security Code of Practice … RESPONSIBILITIES IN THE BUSINESS: As a Senior Data Scientist at Kainos, you will be building advanced AI solutions leveraging state-of-the-art machine learning, generative and agentic AI technologies. You will drive the adoption of modern AI frameworks, AIOps best practices and scalable cloud-native architectures. Your role ...

Senior Data Scientist - Healthcare

Hiring Organisation
Kainos
Location
Dartford, Kent, UK
Employment Type
Full-time
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government's AI Cyber Security Code of Practice … RESPONSIBILITIES IN THE BUSINESS: As a Senior Data Scientist at Kainos, you will be building advanced AI solutions leveraging state-of-the-art machine learning, generative and agentic AI technologies. You will drive the adoption of modern AI frameworks, AIOps best practices and scalable cloud-native architectures. Your role ...

Senior Data Scientist - Healthcare

Hiring Organisation
Kainos
Location
Slough, Berkshire, UK
Employment Type
Full-time
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government's AI Cyber Security Code of Practice … RESPONSIBILITIES IN THE BUSINESS: As a Senior Data Scientist at Kainos, you will be building advanced AI solutions leveraging state-of-the-art machine learning, generative and agentic AI technologies. You will drive the adoption of modern AI frameworks, AIOps best practices and scalable cloud-native architectures. Your role ...

Senior Data Scientist - Healthcare

Hiring Organisation
Kainos
Location
Maidstone, Kent, UK
Employment Type
Full-time
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government's AI Cyber Security Code of Practice … RESPONSIBILITIES IN THE BUSINESS: As a Senior Data Scientist at Kainos, you will be building advanced AI solutions leveraging state-of-the-art machine learning, generative and agentic AI technologies. You will drive the adoption of modern AI frameworks, AIOps best practices and scalable cloud-native architectures. Your role ...

Senior Data Scientist - Healthcare

Hiring Organisation
Kainos
Location
Guildford, Surrey, UK
Employment Type
Full-time
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government's AI Cyber Security Code of Practice … RESPONSIBILITIES IN THE BUSINESS: As a Senior Data Scientist at Kainos, you will be building advanced AI solutions leveraging state-of-the-art machine learning, generative and agentic AI technologies. You will drive the adoption of modern AI frameworks, AIOps best practices and scalable cloud-native architectures. Your role ...

Senior Data Scientist - Healthcare

Hiring Organisation
Kainos
Location
High Wycombe, Buckinghamshire, UK
Employment Type
Full-time
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government's AI Cyber Security Code of Practice … RESPONSIBILITIES IN THE BUSINESS: As a Senior Data Scientist at Kainos, you will be building advanced AI solutions leveraging state-of-the-art machine learning, generative and agentic AI technologies. You will drive the adoption of modern AI frameworks, AIOps best practices and scalable cloud-native architectures. Your role ...

Senior Data Scientist - Healthcare

Hiring Organisation
Kainos
Location
Brighton, East Sussex, UK
Employment Type
Full-time
others to deliver advanced AI and data solutions at citizen scale. Our 150-strong AI and Data Practice brings together deep expertise in machine learning, generative AI, agentic AI and data. We are pioneers in responsible AI, having authored the UK government's AI Cyber Security Code of Practice … RESPONSIBILITIES IN THE BUSINESS: As a Senior Data Scientist at Kainos, you will be building advanced AI solutions leveraging state-of-the-art machine learning, generative and agentic AI technologies. You will drive the adoption of modern AI frameworks, AIOps best practices and scalable cloud-native architectures. Your role ...

AI Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £85,000 per annum
currently working on an AI Engineer role with a global language and translation company. You will be designing, developing, and deploying cutting-edge machine learning solutions across the company.If you enjoy end-to-end ownership (from experimentation to deployment), working with AWS, Docker, MLflow, TensorFlow/PyTorch, and contributing … code in Python Experience with TensorFlow, PyTorch and Scikit-learn Experience with NLPs and LLMs Speech, Text or Audio data Strong knowledge of machine learning techniques and algorithms, including supervised and unsupervised learning, deep learning, and reinforcement learning ...

Sr. Machine Learning Engineer

Hiring Organisation
Waymo
Location
Slough, Berkshire, UK
Employment Type
Full-time
public roads and tens of billions in simulation across 15+ U.S. states. The DUE ML Core London team will build and operate scalable machine learning and data systems, simulation workflow and insight tools, improve and speed up the evaluation and onboard developer journeys. It will combine expert human judgements … advanced machine learning models to deliver training and evaluation data for hundreds of metrics and components that make up the Waymo driver. We are looking for researchers and software engineers who are passionate about developing machine learning techniques for the Evaluation systems on our autonomous vehicles, and have ...

Research Scientist, LLM Agents (Foundational Research)

Hiring Organisation
Thomson Reuters
Location
Slough, Berkshire, UK
Employment Type
Full-time
curious and open-minded individual with an interest in conducting state-of-theart foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex agent-based AI systems in a data-rich, complex academic environment driven by real-world problems. Foundational Research … Dedicated Core Machine Learning Research Division Of Thomson Reuters. We Are Focused On Research And Development, With a Particular Focus On Advanced Algorithms And Training Techniques For Large Language Models (LLMs). We Are Building a Strong Foundation Of Research Capabilities Across Different Areas And Are Looking For Scientists ...

Machine Learning Quant Engineer - Investment banking/ XVA

Hiring Organisation
Harvey Nash
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£1,000 - £1,200 per day
Senior Quant Machine Learning Engineer sought by leading investment bank based in the city of London. **Inside IR35, 4 days a week on site** The role:To lead the design and deployment of ML-driven models across our trading and investment platforms. This is a high-impact, front-office … production deployment Mentor junior quants and engineers; contribute to knowledge-sharing and model governance processes Stay current with cutting-edge ML research (e.g., deep learning, generative models, reinforcement learning) and assess applicability to financial markets Collaborate closely with cross-functional teams, including traders, data engineers, and software ...

Machine Learning Engineer (0–3 Years Experience)

Hiring Organisation
IT Graduate Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £75,000 per annum, OTE
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning ...

Senior AI Research Scientist (m/f/d)

Hiring Organisation
AMLZ Recruiting
Location
Slough, Berkshire, UK
Employment Type
Full-time
Explore methods in synthetic data generation, foundation-model adaptation, and model compression. Your Profile • Ideally holding a Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, Physics (or related quantitative field), or equivalent industry research experience. • Ideally 3+ years in machine learning/deep learning/LLMs … adjacent advanced AI research. • Proficient in Python; experience with PyTorch or JAX. • Understanding of reinforcement learning, multi-agent systems, probabilistic/Bayesian methods, or large-scale training. • Curiosity-driven mindset, strong problem solving, and the ability to work cross-functionally. • English is the working language; additional languages ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Southampton, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Maidstone, Kent, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
High Wycombe, Buckinghamshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Reading, Berkshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Woking, Surrey, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Portsmouth, Hampshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Guildford, Surrey, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...