8 of 8 Permanent Reinforcement Learning Jobs in Berkshire

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
Slough, Berkshire, UK
Employment Type
Full-time
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Machine Learning Engineer

Hiring Organisation
Brio Digital
Location
Slough, Berkshire, UK
Employment Type
Full-time
Senior Machine Learning Engineer (Generative AI/LLMs) Location: Fully Remote (UK-based) Salary: £75,000 – £100,000 (depending on experience) The Role We're hiring a Senior Machine Learning Engineer to lead the design and productionisation of Generative AI and Large Language Model (LLM) applications. This role … strong influence over architecture, tooling, and the future direction of LLM-powered products. What You'll Be Doing Design, develop, and deploy advanced machine learning and deep learning models into production. Architect scalable LLMOps pipelines on GCP/Vertex AI, including fine-tuning, vector search, and low-latency ...

Machine Learning Engineer

Hiring Organisation
Brio Digital
Location
Reading, Berkshire, UK
Employment Type
Full-time
Senior Machine Learning Engineer (Generative AI/LLMs) Location: Fully Remote (UK-based) Salary: £75,000 – £100,000 (depending on experience) The Role We're hiring a Senior Machine Learning Engineer to lead the design and productionisation of Generative AI and Large Language Model (LLM) applications. This role … strong influence over architecture, tooling, and the future direction of LLM-powered products. What You'll Be Doing Design, develop, and deploy advanced machine learning and deep learning models into production. Architect scalable LLMOps pipelines on GCP/Vertex AI, including fine-tuning, vector search, and low-latency ...

Senior Data Scientist

Hiring Organisation
algo1
Location
Slough, Berkshire, UK
Employment Type
Full-time
backed startup focused on behavioural AI, currently in stealth. We are building for retail with a focus on Recommendation Systems, Reinforcement Learning and Generative AI. Our platform is designed from the ground up - no legacy, no patchwork systems - just a clean slate and a clear vision. Our mission … edge AI to transform their shopping experience. About the Job We are looking for a Senior Data Scientist with experience in bringing advanced machine learning and data science systems to production to work with our team of industry leading domain experts and engineers. You'll be working across ...

Senior AI Research Scientist (m/f/d)

Hiring Organisation
AMLZ Recruiting
Location
Slough, Berkshire, UK
Employment Type
Full-time
Explore methods in synthetic data generation, foundation-model adaptation, and model compression. Your Profile • Ideally holding a Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, Physics (or related quantitative field), or equivalent industry research experience. • Ideally 3+ years in machine learning/deep learning/LLMs … adjacent advanced AI research. • Proficient in Python; experience with PyTorch or JAX. • Understanding of reinforcement learning, multi-agent systems, probabilistic/Bayesian methods, or large-scale training. • Curiosity-driven mindset, strong problem solving, and the ability to work cross-functionally. • English is the working language; additional languages ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Slough, Berkshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

Lead ML Engineer (London)

Hiring Organisation
Glite Tech
Location
Slough, Berkshire, UK
Employment Type
Full-time
English to intermediate and advanced learners. We're on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone. We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal 'next task' Work with a vast amount of unique data - we have data from over 1M language ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
Slough, Berkshire, UK
Employment Type
Full-time
requirements through first principles • You communicate technical concepts clearly to non-technical stakeholders Highly Valued (Differentiated Candidates) • Publications or research in multi-agent systems, reinforcement learning, AI safety, or agent architectures • Experience at AI labs (Anthropic, OpenAI, DeepMind) or leading AI research groups • Production experience with agents: LangChain … Dr. Ayman Hindy, Marcel Cassard, and leading figures in AI, high frequency risk management and financial regulation. Early team of sharp, mission-driven builders. Learning Curve: You'll gain expertise in cutting-edge AI architectures, enterprise software, regulatory frameworks, and category creation simultaneously. This is one of those roles ...