13 of 13 Reinforcement Learning Jobs in the Thames Valley

Artificial Intelligence Engineer

Hiring Organisation
Addition
Location
Slough, Berkshire, UK
Employment Type
Full-time
live deployment. What You'll Be Doing: Designing, building and deploying production-ready AI and data science solutions Training and shipping advanced machine learning, deep learning and reinforcement learning models Developing AI systems using Large Language Models Building robust data pipelines for extraction, cleaning and transformation … data science solutions end to end Advanced Python skills with libraries such as Pandas, NumPy and scikit-learn Hands-on experience with machine learning techniques (e.g. tree-based models, regression) Practical exposure to deep learning and reinforcement learning approaches Experience working with SQL and structured data ...

AI Research Internship - Students Pursing PhD

Hiring Organisation
MediaTek
Location
Slough, Berkshire, UK
Employment Type
Full-time
research that supports both our applications and the broader scientific community. Current areas of interest include large language models (LLMs), optimization methods for deep learning, reinforcement learning (RL), and generative models. Responsibilities:- Contribute to ongoing research in machine learning and artificial intelligence Help develop and implement … Stay up to date with the latest advancements in AI and related fields Requirement Qualifications Required: Currently enrolled in a PhD program in Machine Learning, Artificial Intelligence, Mathematics, Computer Science, Physics, or a related field Strong interest in research and a background in machine learning or a related ...

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
Slough, Berkshire, UK
Employment Type
Full-time
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Machine Learning Engineer

Hiring Organisation
Brio Digital
Location
High Wycombe, Buckinghamshire, UK
Employment Type
Full-time
Senior Machine Learning Engineer (Generative AI/LLMs) Location: Fully Remote (UK-based) Salary: £75,000 – £100,000 (depending on experience) The Role We're hiring a Senior Machine Learning Engineer to lead the design and productionisation of Generative AI and Large Language Model (LLM) applications. This role … strong influence over architecture, tooling, and the future direction of LLM-powered products. What You'll Be Doing Design, develop, and deploy advanced machine learning and deep learning models into production. Architect scalable LLMOps pipelines on GCP/Vertex AI, including fine-tuning, vector search, and low-latency ...

Machine Learning Engineer

Hiring Organisation
Brio Digital
Location
Slough, Berkshire, UK
Employment Type
Full-time
Senior Machine Learning Engineer (Generative AI/LLMs) Location: Fully Remote (UK-based) Salary: £75,000 – £100,000 (depending on experience) The Role We're hiring a Senior Machine Learning Engineer to lead the design and productionisation of Generative AI and Large Language Model (LLM) applications. This role … strong influence over architecture, tooling, and the future direction of LLM-powered products. What You'll Be Doing Design, develop, and deploy advanced machine learning and deep learning models into production. Architect scalable LLMOps pipelines on GCP/Vertex AI, including fine-tuning, vector search, and low-latency ...

Machine Learning Engineer

Hiring Organisation
Brio Digital
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
Senior Machine Learning Engineer (Generative AI/LLMs) Location: Fully Remote (UK-based) Salary: £75,000 – £100,000 (depending on experience) The Role We're hiring a Senior Machine Learning Engineer to lead the design and productionisation of Generative AI and Large Language Model (LLM) applications. This role … strong influence over architecture, tooling, and the future direction of LLM-powered products. What You'll Be Doing Design, develop, and deploy advanced machine learning and deep learning models into production. Architect scalable LLMOps pipelines on GCP/Vertex AI, including fine-tuning, vector search, and low-latency ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
High Wycombe, Buckinghamshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Reading, Berkshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

AI Engineering Lead

Hiring Organisation
Akixi
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
Slough, Berkshire, UK
Employment Type
Full-time
requirements through first principles • You communicate technical concepts clearly to non-technical stakeholders Highly Valued (Differentiated Candidates) • Publications or research in multi-agent systems, reinforcement learning, AI safety, or agent architectures • Experience at AI labs (Anthropic, OpenAI, DeepMind) or leading AI research groups • Production experience with agents: LangChain … Dr. Ayman Hindy, Marcel Cassard, and leading figures in AI, high frequency risk management and financial regulation. Early team of sharp, mission-driven builders. Learning Curve: You'll gain expertise in cutting-edge AI architectures, enterprise software, regulatory frameworks, and category creation simultaneously. This is one of those roles ...

AI Engineer

Hiring Organisation
bloom
Location
Slough, Berkshire, UK
Employment Type
Full-time
Engineer - Stealth AI Startup (London/Paris/Europe) (RL/Reinforcement Learning) Package: £100,000-£200,000 + strong early-stage equity Location: London/Paris hubs, or elsewhere in Europe with occasional travel to be with the team (remote friendly/hybrid - however you like ...

New Trading Team's 1st C++ Quant Developer | HFT

Hiring Organisation
Augmentti
Location
Slough, Berkshire, UK
Employment Type
Full-time
frequency, low-latency trading. Work with a Humble Leader: You'll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who's not only mastered complex trading strategies ...