23 of 23 Reinforcement Learning Jobs in the UK

Artificial Intelligence Researcher

Hiring Organisation: microTECH Global LTD
Location: City of London, London, United Kingdom

permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models … build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. ...

Senior Research Engineer

Hiring Organisation: algo1
Location: London Area, United Kingdom

backed startup focused on hyper-personalisation, currently in stealth. Inspired by the latest in recommender systems, we leverage transformers and graph learning alongside decision-making models to build the most engaging customer experiences for in-store retail. Our mission is to change retail forever through hyper-personalised experiences that … both simple and beautiful. About the Job - Foundation Models for Retail We are looking for a Senior Research Engineer with experience in advanced machine learning systems to work with our team of industry leading domain experts and engineers to build foundation models for retail shopping. Key Responsibilities: Translate latest ...

Reader in Artificial Intelligence

Hiring Organisation: University of Bath
Location: Bath, Somerset, South West, United Kingdom
Employment Type: Permanent
Salary: £55,000

Reader in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security) The Department of Computer Science wishes to appoint academics to strengthen our growing Artificial Intelligence and Machine Learning Research Group. We welcome strong applications from all areas of AI and machine learning. … particularly keen to recruit in Natural Language Processing, Reinforcement Learning and/or AI Security. Appointments would be at the Reader level. We are a highly collaborative team, working not only with other researchers in our department, but across the university and beyond. We will offer you support ...

Reinforcement Learning (RL) control Engineer

Hiring Organisation: Randstad Digital
Location: City of London, London, United Kingdom
Employment Type: Permanent

Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world … cloning. High-Performance Engineering: Designing and profiling research-grade PyTorch/JAX code to support large-scale, distributed RL infrastructure. Essential Skills Needed Deep Learning Mastery: 5+ years building and shipping models, with deep hands-on expertise in LLMs, VLMs, or generative architectures. Industry Experience: 3+ years of commercial ...

Reinforcement Learning RL control Engineer

Hiring Organisation: Randstad Technologies
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £80,000 - £120,000 per annum

AI / ML Architect

Hiring Organisation: Stackstudio Digital Ltd
Location: London, United Kingdom
Employment Type: Contract, Work From Home
Contract Rate: From £450 to £500 per day

Hybrid): 2 days from office Number of Positions: 4 The Role An AI/ML Developer is responsible for designing, building, and deploying machine learning models and AI solutions that solve business problems. This role focuses on coding, data preparation, and integrating models into production systems. Your Responsibilities … Model Development Design, build, and train machine learning models for predictive analytics, classification, NLP, computer vision, or other AI applications. Experiment with algorithms and optimize hyperparameters for performance. Data Preparation Collect, clean, and preprocess large datasets for training and validation. Implement feature engineering and data augmentation techniques. Integration & Deployment ...

Reinforcement Learning (RL) control Engineer

Location: London, United Kingdom

Machine Learning Engineer

Hiring Organisation: Kingsgate Recruitment Ltd
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £30,000 - £40,000 per annum

Machine Learning Engineer (Graduate/Early Career) Location: London Type: Full-time About the Role Kickstart your AI career! This role is designed specifically for recent graduates or early-career professionals eager to launch their careers in machine learning and AI. You’ll work on exciting real-world … required—what matters is curiosity, problem-solving skills, and passion for AI. What You’ll Do Implement, train, and optimize ML models (supervised, unsupervised, reinforcement learning) Preprocess data, engineer features, and evaluate model performance Deploy models to production and monitor performance Collaborate with cross-functional teams to integrate ...

Senior Data Scientist

Hiring Organisation: Anson Mccade
Location: London, United Kingdom
Employment Type: Permanent

Responsibilities End-to-End Delivery: Lead the technical execution of AI projects, from initial problem discovery and hypothesis testing to deploying production-grade machine learning models. Strategic Advisory: Act as a "translator" between technical complexity and business value. You will work closely with C-suite stakeholders to identify … solve their most pressing strategic challenges. Technical Leadership: Architect robust, scalable data pipelines and state-of-the-art models (including LLMs, Reinforcement Learning, or Bayesian Inference) tailored to specific client needs. Mentorship: Guide and upskill junior Data Scientists, fostering a culture of rigorous peer review, clean coding standards ...

Reader in Artificial Intelligence

Hiring Organisation: University of Bath
Location: Bath, Somerset, United Kingdom
Employment Type: Permanent
Salary: GBP 55,000 Annual

Reader in Artificial Intelligence

Location: Bath, Somerset, United Kingdom

Software Engineer - Large Language Models

Hiring Organisation: Fastino
Location: United Kingdom

overall performance metrics Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards Build robust … Required - Great velocity for building and shipping agents/AI products. Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies Optional - Demonstrated ability to do independent research in Academic ...

AI Engineer

Hiring Organisation: Akixi
Location: United Kingdom

similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (Azure Cognitive Services, AWS SageMaker, GCP Vertex AI). Experience deploying and monitoring AI workloads ...

Software Engineer (Applied AI)

Hiring Organisation: Euphoric
Location: United Kingdom

iteration of our next-generation benefits platform features that leverage personalization, experimentation, and AI/ML methods (e.g. agents/LLMs, recommender systems, reinforcement learning) to enhance user experience in a meaningful business domain. Contribute across the tech stack: You’ll work in React (JavaScript/TypeScript … against important business goals that help the entire team win Pragmatic Best Practices: An overarching desire to build efficient, scalable, and maintainable code, while learning the tradeoffs between technical debt and delivery speed What we look for: We’re a great bunch but we have some "Euph" cultural ...

Research Software Scientist / Engineer

Hiring Organisation: ECM Selection (Holdings) Limited
Location: Guildford, Surrey, United Kingdom
Employment Type: Permanent
Salary: £50000 - £80000/annum DoE + Benefits

Sciences which has included demonstrable project experience in scientific programming using Python, C++ or C# (ideally all three). Novel research experience in machine learning, optimisation or probabilistic modelling which should have included the development of new algorithms, use of numerical methods and computational modelling. Any exposure to digital … twins, agent-based systems, reinforcement learning or advanced optimisations methods would be desirable. For senior candidates, additional experience in mentoring, project leadership on innovation-based projects, or strategic direction would be advantageous. The role is fully onsite at their Guildford offices with opportunities for international travel. Compensation includes ...

Software Engineer (Numerical Modelling, AI/ML, C++/Python)

Hiring Organisation: Hays
Location: Guildford, Surrey, South East, United Kingdom
Employment Type: Permanent

level scientific language (e.g., Python, Julia). We are particularly looking at experience with scientific computing, numerical methods, or computational modelling. Desirables are Machine learning, optimization, control, probabilistic modelling, or related fields. Familiarity digital twins, agentic systems, reinforcement learning or advanced optimisation. What ...

Lead ML Engineer (London)

Hiring Organisation: Glite Tech
Location: City of London, London, United Kingdom

English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language ...

Agentic Developer - Building guardrails for autonomous AI

Hiring Organisation: governr
Location: London Area, United Kingdom

requirements through first principles • You communicate technical concepts clearly to non-technical stakeholders Highly Valued (Differentiated Candidates) • Publications or research in multi-agent systems, reinforcement learning, AI safety, or agent architectures • Experience at AI labs (Anthropic, OpenAI, DeepMind) or leading AI research groups • Production experience with agents : LangChain … Dr. Ayman Hindy, Marcel Cassard, and leading figures in AI, high frequency risk management and financial regulation. Early team of sharp, mission-driven builders. Learning Curve : You'll gain expertise in cutting-edge AI architectures, enterprise software, regulatory frameworks, and category creation simultaneously. This is one of those roles ...

Senior NLP Engineer (London)

Hiring Organisation: Glite Tech
Location: City of London, London, United Kingdom

English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting … models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense ...

AI Developer/Engineer

Hiring Organisation: Damia Group Ltd
Location: London, United Kingdom
Employment Type: Contract
Contract Rate: £500 - £650 per day

Models (LLMs) and other generative architectures. Optimise pre-trained models (OpenAI, Anthropic, or open-source LLMs) for business use cases using prompt engineering and reinforcement learning. Experiment with model configurations to balance performance, cost, and scalability. Build robust data pipelines for continuous model improvement and retraining. Ensure compliance with ...

AI Architect (Wealth)

Hiring Organisation: Teksystems
Location: Central London, London, United Kingdom
Employment Type: Contract, Work From Home

Title: AI Architect (Wealth) Job Description This position is pivotal in designing AI and Machine Learning solutions on cloud-based platforms, exploring emerging AI trends, developing proof-of-concepts, and collaborating with internal and external ecosystems to advance these concepts to production. The role demands expertise in designing … least 6-10 years of hands-on development and architectural experience. Proficiency in Python, PyTorch, TensorFlow, or similar frameworks. experience with supervised, unsupervised, and reinforcement learning. Solid grounding in Natural Language Processing (NLP) concepts such as tokenisation, embeddings, semantic search, text classification, and summarisation. Strong understanding of Large Language ...

Senior AI / ML Engineer - Bristol or Bath (hybrid)

Hiring Organisation: Method-Resourcing
Location: Bristol, Avon, South West, United Kingdom
Employment Type: Permanent

path toward deployment and commercial relevance. Initial focus areas include: Agentic AI systems Energy systems flexibility Creative industries applications Emerging work across generative AI, reinforcement learning, neuro-symbolic AI, verifiable AI, and MCP-style architectures You will work closely with academic partners, industry collaborators, and funding bodies, translating ...

AI Engineers

Hiring Organisation: Stackstudio Digital Ltd
Location: Redhill, Surrey, South East, United Kingdom
Employment Type: Contract, Work From Home
Contract Rate: From £500 to £550 per day

vector search technology. Collaborate with cross-functional teams to integrate AI models into products. Stay updated with the latest advancements in AI and machine learning technologies. Conduct research to improve existing AI systems and develop new approaches. Knowledge of NLP, computer vision, or reinforcement learning. Experience deploying … models in production environments. Your Profile Essential Skills/Knowledge/Experience Proven experience in AI, machine learning, or deep learning. Proficiency in programming languages such as Python, R, or Java. Experience with AI frameworks like TensorFlow, PyTorch, or Keras. Experience with large language models (GPT, BERT, etc.). ...