17 of 17 Reinforcement Learning Jobs in the South East

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
Slough, Berkshire, UK
Employment Type
Full-time
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Reinforcement Learning RL control Engineer

Hiring Organisation
Randstad Technologies
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £120,000 per annum
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

Machine Learning Engineer (0–3 Years Experience).

Hiring Organisation
IT Graduate Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £75,000 per annum, OTE
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning ...

AI Engineer

Hiring Organisation
Akixi
Location
Guildford, Surrey, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineer

Hiring Organisation
Akixi
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineer

Hiring Organisation
Akixi
Location
High Wycombe, Buckinghamshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineer

Hiring Organisation
Akixi
Location
Dartford, Kent, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineer

Hiring Organisation
Akixi
Location
Slough, Berkshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineer

Hiring Organisation
Akixi
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineer

Hiring Organisation
Akixi
Location
Crawley, West Sussex, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineer

Hiring Organisation
Akixi
Location
Newport, Isle of Wight, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineer

Hiring Organisation
Akixi
Location
Brighton, East Sussex, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
Slough, Berkshire, UK
Employment Type
Full-time
requirements through first principles • You communicate technical concepts clearly to non-technical stakeholders Highly Valued (Differentiated Candidates) • Publications or research in multi-agent systems, reinforcement learning, AI safety, or agent architectures • Experience at AI labs (Anthropic, OpenAI, DeepMind) or leading AI research groups • Production experience with agents: LangChain … Dr. Ayman Hindy, Marcel Cassard, and leading figures in AI, high frequency risk management and financial regulation. Early team of sharp, mission-driven builders. Learning Curve: You'll gain expertise in cutting-edge AI architectures, enterprise software, regulatory frameworks, and category creation simultaneously. This is one of those roles ...

Lead ML Engineer (London)

Hiring Organisation
Glite Tech
Location
Slough, Berkshire, UK
Employment Type
Full-time
English to intermediate and advanced learners. We're on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone. We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal 'next task' Work with a vast amount of unique data - we have data from over 1M language ...

Senior NLP Engineer (London)

Hiring Organisation
Glite Tech
Location
Slough, Berkshire, UK
Employment Type
Full-time
English to intermediate and advanced learners. We're on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone. We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to join the ML team in our growing company. What you will do Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal 'next task' Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense ...

Senior AI Engineer - Remote-first (Stockholm/London hubs)

Hiring Organisation
Wilgot
Location
Slough, Berkshire, UK
Employment Type
Full-time
from AI theory to solving tangible, high-scale engineering challenges. You will work alongside our founding team and our Founding AI Scientist (PhD in Reinforcement Learning) to translate cutting-edge research into enterprise-scale production features. Key Responsibilities Scale Agentic Architectures: Advance existing workflows into multi-agent systems ...

AI Engineers

Hiring Organisation
Stackstudio Digital Ltd
Location
Redhill, Surrey, South East, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £500 to £550 per day
vector search technology. Collaborate with cross-functional teams to integrate AI models into products. Stay updated with the latest advancements in AI and machine learning technologies. Conduct research to improve existing AI systems and develop new approaches. Knowledge of NLP, computer vision, or reinforcement learning. Experience deploying … models in production environments. Your Profile Essential Skills/Knowledge/Experience Proven experience in AI, machine learning, or deep learning. Proficiency in programming languages such as Python, R, or Java. Experience with AI frameworks like TensorFlow, PyTorch, or Keras. Experience with large language models (GPT, BERT, etc.). ...