20 of 20 Reinforcement Learning Jobs in England

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
City of London, London, United Kingdom
permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Senior Research Engineer

Hiring Organisation
algo1
Location
London Area, United Kingdom
backed startup focused on hyper-personalisation, currently in stealth. Inspired by the latest in recommender systems, we leverage transformers and graph learning alongside decision-making models to build the most engaging customer experiences for in-store retail. Our mission is to change retail forever through hyper-personalised experiences that … both simple and beautiful. About the Job - Foundation Models for Retail We are looking for a Senior Research Engineer with experience in advanced machine learning systems to work with our team of industry leading domain experts and engineers to build foundation models for retail shopping. Key Responsibilities: Translate latest ...

Reader in Artificial Intelligence

Hiring Organisation
University of Bath
Location
Bath, Somerset, South West, United Kingdom
Employment Type
Permanent
Salary
£55,000
Reader in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security) The Department of Computer Science wishes to appoint academics to strengthen our growing Artificial Intelligence and Machine Learning Research Group. We welcome strong applications from all areas of AI and machine learning. … particularly keen to recruit in Natural Language Processing, Reinforcement Learning and/or AI Security. Appointments would be at the Reader level. We are a highly collaborative team, working not only with other researchers in our department, but across the university and beyond. We will offer you support ...

Reinforcement Learning (RL) control Engineer

Hiring Organisation
Randstad Digital
Location
City of London, London, United Kingdom
Employment Type
Permanent
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

Reinforcement Learning RL control Engineer

Hiring Organisation
Randstad Technologies
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £120,000 per annum
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

AI / ML Architect

Hiring Organisation
Stackstudio Digital Ltd
Location
London, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £450 to £500 per day
Hybrid): 2 days from office Number of Positions: 4 The Role An AI/ML Developer is responsible for designing, building, and deploying machine learning models and AI solutions that solve business problems. This role focuses on coding, data preparation, and integrating models into production systems. Your Responsibilities … Model Development Design, build, and train machine learning models for predictive analytics, classification, NLP, computer vision, or other AI applications. Experiment with algorithms and optimize hyperparameters for performance. Data Preparation Collect, clean, and preprocess large datasets for training and validation. Implement feature engineering and data augmentation techniques. Integration & Deployment ...

Reinforcement Learning (RL) control Engineer

Location
London, United Kingdom
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world ...

Machine Learning Engineer

Hiring Organisation
Kingsgate Recruitment Ltd
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£30,000 - £40,000 per annum
Machine Learning Engineer (Graduate/Early Career) Location: London Type: Full-time About the Role Kickstart your AI career! This role is designed specifically for recent graduates or early-career professionals eager to launch their careers in machine learning and AI. You’ll work on exciting real-world … required—what matters is curiosity, problem-solving skills, and passion for AI. What You’ll Do Implement, train, and optimize ML models (supervised, unsupervised, reinforcement learning) Preprocess data, engineer features, and evaluate model performance Deploy models to production and monitor performance Collaborate with cross-functional teams to integrate ...

Senior Data Scientist

Hiring Organisation
Anson Mccade
Location
London, United Kingdom
Employment Type
Permanent
Responsibilities End-to-End Delivery: Lead the technical execution of AI projects, from initial problem discovery and hypothesis testing to deploying production-grade machine learning models. Strategic Advisory: Act as a "translator" between technical complexity and business value. You will work closely with C-suite stakeholders to identify … solve their most pressing strategic challenges. Technical Leadership: Architect robust, scalable data pipelines and state-of-the-art models (including LLMs, Reinforcement Learning, or Bayesian Inference) tailored to specific client needs. Mentorship: Guide and upskill junior Data Scientists, fostering a culture of rigorous peer review, clean coding standards ...

Reader in Artificial Intelligence

Hiring Organisation
University of Bath
Location
Bath, Somerset, United Kingdom
Employment Type
Permanent
Salary
GBP 55,000 Annual
Reader in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security) The Department of Computer Science wishes to appoint academics to strengthen our growing Artificial Intelligence and Machine Learning Research Group. We welcome strong applications from all areas of AI and machine learning click ...

Reader in Artificial Intelligence

Location
Bath, Somerset, United Kingdom
Reader in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security) The Department of Computer Science wishes to appoint academics to strengthen our growing Artificial Intelligence and Machine Learning Research Group. We welcome strong applications from all areas of AI and machine learning. ...

Research Software Scientist / Engineer

Hiring Organisation
ECM Selection (Holdings) Limited
Location
Guildford, Surrey, United Kingdom
Employment Type
Permanent
Salary
£50000 - £80000/annum DoE + Benefits
Sciences which has included demonstrable project experience in scientific programming using Python, C++ or C# (ideally all three). Novel research experience in machine learning, optimisation or probabilistic modelling which should have included the development of new algorithms, use of numerical methods and computational modelling. Any exposure to digital … twins, agent-based systems, reinforcement learning or advanced optimisations methods would be desirable. For senior candidates, additional experience in mentoring, project leadership on innovation-based projects, or strategic direction would be advantageous. The role is fully onsite at their Guildford offices with opportunities for international travel. Compensation includes ...

Software Engineer (Numerical Modelling, AI/ML, C++/Python)

Hiring Organisation
Hays
Location
Guildford, Surrey, South East, United Kingdom
Employment Type
Permanent
level scientific language (e.g., Python, Julia). We are particularly looking at experience with scientific computing, numerical methods, or computational modelling. Desirables are Machine learning, optimization, control, probabilistic modelling, or related fields. Familiarity digital twins, agentic systems, reinforcement learning or advanced optimisation. What ...

Lead ML Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation
governr
Location
London Area, United Kingdom
requirements through first principles • You communicate technical concepts clearly to non-technical stakeholders Highly Valued (Differentiated Candidates) • Publications or research in multi-agent systems, reinforcement learning, AI safety, or agent architectures • Experience at AI labs (Anthropic, OpenAI, DeepMind) or leading AI research groups • Production experience with agents : LangChain … Dr. Ayman Hindy, Marcel Cassard, and leading figures in AI, high frequency risk management and financial regulation. Early team of sharp, mission-driven builders. Learning Curve : You'll gain expertise in cutting-edge AI architectures, enterprise software, regulatory frameworks, and category creation simultaneously. This is one of those roles ...

Senior NLP Engineer (London)

Hiring Organisation
Glite Tech
Location
City of London, London, United Kingdom
English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense ...

AI Developer/Engineer

Hiring Organisation
Damia Group Ltd
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£500 - £650 per day
Models (LLMs) and other generative architectures. Optimise pre-trained models (OpenAI, Anthropic, or open-source LLMs) for business use cases using prompt engineering and reinforcement learning. Experiment with model configurations to balance performance, cost, and scalability. Build robust data pipelines for continuous model improvement and retraining. Ensure compliance with ...

AI Architect (Wealth)

Hiring Organisation
Teksystems
Location
Central London, London, United Kingdom
Employment Type
Contract, Work From Home
Title: AI Architect (Wealth) Job Description This position is pivotal in designing AI and Machine Learning solutions on cloud-based platforms, exploring emerging AI trends, developing proof-of-concepts, and collaborating with internal and external ecosystems to advance these concepts to production. The role demands expertise in designing … least 6-10 years of hands-on development and architectural experience. Proficiency in Python, PyTorch, TensorFlow, or similar frameworks. experience with supervised, unsupervised, and reinforcement learning. Solid grounding in Natural Language Processing (NLP) concepts such as tokenisation, embeddings, semantic search, text classification, and summarisation. Strong understanding of Large Language ...

Senior AI / ML Engineer - Bristol or Bath (hybrid)

Hiring Organisation
Method-Resourcing
Location
Bristol, Avon, South West, United Kingdom
Employment Type
Permanent
path toward deployment and commercial relevance. Initial focus areas include: Agentic AI systems Energy systems flexibility Creative industries applications Emerging work across generative AI, reinforcement learning, neuro-symbolic AI, verifiable AI, and MCP-style architectures You will work closely with academic partners, industry collaborators, and funding bodies, translating ...

AI Engineers

Hiring Organisation
Stackstudio Digital Ltd
Location
Redhill, Surrey, South East, United Kingdom
Employment Type
Contract, Work From Home
Contract Rate
From £500 to £550 per day
vector search technology. Collaborate with cross-functional teams to integrate AI models into products. Stay updated with the latest advancements in AI and machine learning technologies. Conduct research to improve existing AI systems and develop new approaches. Knowledge of NLP, computer vision, or reinforcement learning. Experience deploying … models in production environments. Your Profile Essential Skills/Knowledge/Experience Proven experience in AI, machine learning, or deep learning. Proficiency in programming languages such as Python, R, or Java. Experience with AI frameworks like TensorFlow, PyTorch, or Keras. Experience with large language models (GPT, BERT, etc.). ...