Permanent Reinforcement Learning Jobs in the UK excluding London

19 of 19 Permanent Reinforcement Learning Jobs in the UK excluding London

Power Platform - London, UK

London, South East, England, United Kingdom
Hybrid / WFH Options
Randstad Technologies
experience in Investment Banking environment would be a plus Spanish would be a plus Mandatory Skills : Python, ServiceNow Orchestrator, Azure Cognitive Services, GenAI - LLMOps, RPA - Microsoft Power Automate, Machine Learning - AIOPS, Deep Learning - AIOPS, Reinforcement Learning - AIOPS Randstad Technologies Ltd is a leading specialist recruitment business for the IT & Engineering industries. Please note that due to More ❯
Employment Type: Full-Time
Salary: £60,000 - £65,000 per annum
Posted:

Machine Learning Researcher

london, south east england, united kingdom
Darcie Talent
A PhD level Machine Learning Scientist is needed to join this amazing research team based near King’s Cross. This is a unique opportunity to work at the forefront of AI research, contributing to projects that push the boundaries of what’s possible in machine learning. You will play a key role in designing, implementing, and advancing cutting-edge … methods in: Reinforcement Learning (RL) Large Language Models (LLMs) Optimisation for Deep Learning You’ll work alongside a team of world-class researchers, developing new approaches, publishing at top-tier venues, and translating research into impactful real-world applications. Experience Required: A PhD in Machine Learning, Computer Science, Mathematics, Physics, or a related field A track … of publications at leading AI conferences (e.g., NeurIPS, ICML, ICLR, ACL) (This is essential) Deep expertise in at least one of: RL, LLMs, or large-scale Optimisation for deep learning Good programming skills in Python and experience with deep learning frameworks A passion for advancing AI research and contributing to the global ML community Key selling points Work More ❯
Posted:

Machine Learning Researcher

slough, south east england, united kingdom
Darcie Talent
A PhD level Machine Learning Scientist is needed to join this amazing research team based near King’s Cross. This is a unique opportunity to work at the forefront of AI research, contributing to projects that push the boundaries of what’s possible in machine learning. You will play a key role in designing, implementing, and advancing cutting-edge … methods in: Reinforcement Learning (RL) Large Language Models (LLMs) Optimisation for Deep Learning You’ll work alongside a team of world-class researchers, developing new approaches, publishing at top-tier venues, and translating research into impactful real-world applications. Experience Required: A PhD in Machine Learning, Computer Science, Mathematics, Physics, or a related field A track … of publications at leading AI conferences (e.g., NeurIPS, ICML, ICLR, ACL) (This is essential) Deep expertise in at least one of: RL, LLMs, or large-scale Optimisation for deep learning Good programming skills in Python and experience with deep learning frameworks A passion for advancing AI research and contributing to the global ML community Key selling points Work More ❯
Posted:

Machine Learning Researcher

london (city of london), south east england, united kingdom
Darcie Talent
A PhD level Machine Learning Scientist is needed to join this amazing research team based near King’s Cross. This is a unique opportunity to work at the forefront of AI research, contributing to projects that push the boundaries of what’s possible in machine learning. You will play a key role in designing, implementing, and advancing cutting-edge … methods in: Reinforcement Learning (RL) Large Language Models (LLMs) Optimisation for Deep Learning You’ll work alongside a team of world-class researchers, developing new approaches, publishing at top-tier venues, and translating research into impactful real-world applications. Experience Required: A PhD in Machine Learning, Computer Science, Mathematics, Physics, or a related field A track … of publications at leading AI conferences (e.g., NeurIPS, ICML, ICLR, ACL) (This is essential) Deep expertise in at least one of: RL, LLMs, or large-scale Optimisation for deep learning Good programming skills in Python and experience with deep learning frameworks A passion for advancing AI research and contributing to the global ML community Key selling points Work More ❯
Posted:

Senior Research Scientist: Data Science and Machine Learning AIP

Chelmsford, Essex, United Kingdom
Hybrid / WFH Options
NLP PEOPLE
of interest to you. The Data and Decision Support Capability has teams working across AI/ML areas such as RF, EW, radar, sonar, distributed sensing-processing, data fusion, reinforcement learning, autonomy, image analysis and computer vision, generative AI, NLP, knowledge graphs and more. You will work with these colleagues in multi-disciplinary teams. Typical Responsibilities Lead technical … and/or statistical signal processing to sequential data and decision-making post PhD. Experience in software development for proof of concept in Python. Experience with machine and deep learning frameworks: TensorFlow, PyTorch, scikit-learn, etc. Domains of Particular Interest RF communications and CEMA Electronic or Electromagnetic Warfare (EW) Tracking and sensor data fusion Radar signal processing Acoustic data … and Project Management teams that design and implement defence solutions and digital transformation projects. Company BAE Systems Experience and Education Senior (5+ years of experience) Tagged as: Industry, Machine Learning, NLP, United Kingdom More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Science and Machine Learning Scientist

Chelmsford, Essex, South East, United Kingdom
Hybrid / WFH Options
Stott & May Professional Search Limited
ML algorithms and statistical signal processing techniques , with applications across sectors including space, defence, security, and commercial industries. You'll work at the forefront of innovation, applying advanced machine learning and data science techniques to time-series, sensor and sequential data , delivering high-impact research, prototypes, and demonstrators. You'll also collaborate with academic partners and multidisciplinary teams working … across areas such as radar, sonar, RF, distributed sensing, reinforcement learning, computer vision, NLP, and generative AI. Key Responsibilities Lead delivery of technical research projects, mentoring junior researchers. Develop prototypes, proof-of-concepts, and novel inference algorithms. Produce technical reports, proposals, and present findings to technical and non-technical stakeholders. Contribute to publications, patents, and academic partnerships. Work … on cutting-edge AI/ML research that supports real-world applications. Essential Skills & Experience PhD (or equivalent industry experience) in a relevant discipline. Strong background in Machine Learning and/or statistical signal processing applied to sequential/sensor data. Proficiency in Python with experience in frameworks such as TensorFlow, PyTorch, or scikit-learn. Demonstrated expertise in developing More ❯
Employment Type: Permanent, Work From Home
Salary: £70,000
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

london, south east england, united kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

west london, south east england, united kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Posted:

Python Developer

Glasgow, Scotland, United Kingdom
Hybrid / WFH Options
Venesky Brown
programming, code reviews, system design and requirements analysis/refinement, etc. - Coaching and mentoring other team members, as appropriate. Essential Skills: - OCR, Object Detection and LLM analysis implementation - Machine Learning & AI Libraries including Transformers/Hugging Face for working with pre-trained LLMs, fine tuning, and inference, PyTorch for deep learning model development and training, OpenCV for computer … Desirable Skills: - Custom model architecture design and implementation - Advanced fine-tuning techniques including LoRA, QLoRA, and parameter efficient methods - Multi-modal AI systems combining text, image, and structured data - Reinforcement Learning from Human Feedback (RLHF) for model alignment - Apache Airflow/Dagster for ML workflow orchestration and ETL pipeline management - Model versioning and experiment tracking (MLflow, Weights & Biases More ❯
Posted:

Python Developer

paisley, central scotland, united kingdom
Hybrid / WFH Options
Venesky Brown
programming, code reviews, system design and requirements analysis/refinement, etc. - Coaching and mentoring other team members, as appropriate. Essential Skills: - OCR, Object Detection and LLM analysis implementation - Machine Learning & AI Libraries including Transformers/Hugging Face for working with pre-trained LLMs, fine tuning, and inference, PyTorch for deep learning model development and training, OpenCV for computer … Desirable Skills: - Custom model architecture design and implementation - Advanced fine-tuning techniques including LoRA, QLoRA, and parameter efficient methods - Multi-modal AI systems combining text, image, and structured data - Reinforcement Learning from Human Feedback (RLHF) for model alignment - Apache Airflow/Dagster for ML workflow orchestration and ETL pipeline management - Model versioning and experiment tracking (MLflow, Weights & Biases More ❯
Posted:

Python Developer

milton, central scotland, united kingdom
Hybrid / WFH Options
Venesky Brown
programming, code reviews, system design and requirements analysis/refinement, etc. - Coaching and mentoring other team members, as appropriate. Essential Skills: - OCR, Object Detection and LLM analysis implementation - Machine Learning & AI Libraries including Transformers/Hugging Face for working with pre-trained LLMs, fine tuning, and inference, PyTorch for deep learning model development and training, OpenCV for computer … Desirable Skills: - Custom model architecture design and implementation - Advanced fine-tuning techniques including LoRA, QLoRA, and parameter efficient methods - Multi-modal AI systems combining text, image, and structured data - Reinforcement Learning from Human Feedback (RLHF) for model alignment - Apache Airflow/Dagster for ML workflow orchestration and ETL pipeline management - Model versioning and experiment tracking (MLflow, Weights & Biases More ❯
Posted:

Applied Scientist, Generative AI Innovation Center

Cambridge, Cambridgeshire, United Kingdom
Amazon
Applied Scientist, Generative AI Innovation Center Job ID: Amazon Web Services Singapore Private Limited Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply Generative AI algorithms to solve real world problems with significant impact? The Generative AI Innovation Center helps AWS customers implement Generative AI solutions and realize transformational … our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We're continuously raising our …/ML/NLP conferences or journals PREFERRED QUALIFICATIONS 2+ years demonstrated experience with Large Language Model (LLM) and Foundational Model post-training, continual pre-training, fine-tuning, or reinforcement learning techniques. Demonstrated experience with building LLM-powered agentic workflow, orchestration, and agent customization Track record of building and deploying ML models at scale Experience with model optimization More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Python Engineer

Edinburgh, United Kingdom
Harvey Nash Group
and actively engage in team events and wider communities of practice. Essential Skills & Experience Commercial experience with AI/ML technology: OCR, Object Detection and LLM analysis implementation Machine Learning & AI Libraries including: Transformers/Hugging Face for working with pre-trained LLMs, fine-tuning, and inference PyTorch for deep learning model development and training OpenCV for computer … ML Technologies Custom model architecture design and implementation Advanced fine-tuning techniques including LoRA, QLoRA, and parameter-efficient methods Multi-modal AI systems combining text, image, and structured data Reinforcement Learning from Human Feedback (RLHF) for model alignment Production ML Systems Apache Airflow/Dagster for ML workflow orchestration and ETL pipeline management Model versioning and experiment tracking More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Tech Lead

Chelmsford, Essex, South East, United Kingdom
Hybrid / WFH Options
Stott & May Professional Search Limited
Lead Technologist - Artificial Intelligence & Machine Learning Location: Great Baddow (Hybrid - 2 days onsite per week) Salary: Up to £90,000 + Benefits The Opportunity A leading digital intelligence organisation is seeking a Lead Technologist to join their rapidly growing Data and Decision Support capability . This is an exciting opportunity to work at the forefront of Artificial Intelligence and … Machine Learning (AI/ML) , delivering impactful solutions across defence, security, space, and commercial sectors. You'll play a pivotal role in shaping and guiding innovative AI/ML projects while mentoring colleagues, engaging with customers, and driving forward research collaborations with academic and industry partners. About the Role You will be responsible for: Leading the technical delivery of … Dstl, NS, EPSRC). Expertise in at least one of the following areas, with broad knowledge across AI/ML: AI/ML for imagery (including remote sensing applications) Reinforcement learning Natural Language Processing (NLP) and Large Language Models (LLMs) Knowledge graphs and graph-based neural networks AI/ML for RF, EW, radar, sonar, or acoustics Autonomy More ❯
Employment Type: Permanent, Work From Home
Posted:

AI Tech Lead

Basildon, Essex, United Kingdom
Hybrid / WFH Options
Stott & May Professional Search Limited
Lead Technologist - Artificial Intelligence & Machine Learning Location: Great Baddow (Hybrid - 2 days onsite per week) Salary: Up to £90,000 + Benefits The Opportunity A leading digital intelligence organisation is seeking a Lead Technologist to join their rapidly growing Data and Decision Support capability . This is an exciting opportunity to work at the forefront of Artificial Intelligence and … Machine Learning (AI/ML) , delivering impactful solutions across defence, security, space, and commercial sectors. You'll play a pivotal role in shaping and guiding innovative AI/ML projects while mentoring colleagues, engaging with customers, and driving forward research collaborations with academic and industry partners. About the Role You will be responsible for: Leading the technical delivery of … Dstl, NS, EPSRC). Expertise in at least one of the following areas, with broad knowledge across AI/ML: AI/ML for imagery (including remote sensing applications) Reinforcement learning Natural Language Processing (NLP) and Large Language Models (LLMs) Knowledge graphs and graph-based neural networks AI/ML for RF, EW, radar, sonar, or acoustics Autonomy More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Tech Lead

Chelmsford, Essex, United Kingdom
Hybrid / WFH Options
Stott & May Professional Search Limited
Lead Technologist - Artificial Intelligence & Machine Learning Location: Great Baddow (Hybrid - 2 days onsite per week) Salary: Up to £90,000 + Benefits The Opportunity A leading digital intelligence organisation is seeking a Lead Technologist to join their rapidly growing Data and Decision Support capability . This is an exciting opportunity to work at the forefront of Artificial Intelligence and … Machine Learning (AI/ML) , delivering impactful solutions across defence, security, space, and commercial sectors. You'll play a pivotal role in shaping and guiding innovative AI/ML projects while mentoring colleagues, engaging with customers, and driving forward research collaborations with academic and industry partners. About the Role You will be responsible for: Leading the technical delivery of … Dstl, NS, EPSRC). Expertise in at least one of the following areas, with broad knowledge across AI/ML: AI/ML for imagery (including remote sensing applications) Reinforcement learning Natural Language Processing (NLP) and Large Language Models (LLMs) Knowledge graphs and graph-based neural networks AI/ML for RF, EW, radar, sonar, or acoustics Autonomy More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI Tech Lead

basildon, east anglia, united kingdom
Hybrid / WFH Options
Stott & May Professional Search Limited
Lead Technologist - Artificial Intelligence & Machine Learning Location: Great Baddow (Hybrid - 2 days onsite per week) Salary: Up to £90,000 + Benefits The Opportunity A leading digital intelligence organisation is seeking a Lead Technologist to join their rapidly growing Data and Decision Support capability . This is an exciting opportunity to work at the forefront of Artificial Intelligence and … Machine Learning (AI/ML) , delivering impactful solutions across defence, security, space, and commercial sectors. You'll play a pivotal role in shaping and guiding innovative AI/ML projects while mentoring colleagues, engaging with customers, and driving forward research collaborations with academic and industry partners. About the Role You will be responsible for: Leading the technical delivery of … Dstl, NS, EPSRC). Expertise in at least one of the following areas, with broad knowledge across AI/ML: AI/ML for imagery (including remote sensing applications) Reinforcement learning Natural Language Processing (NLP) and Large Language Models (LLMs) Knowledge graphs and graph-based neural networks AI/ML for RF, EW, radar, sonar, or acoustics Autonomy More ❯
Posted:

AI Tech Lead

chelmsford, east anglia, united kingdom
Hybrid / WFH Options
Stott & May Professional Search Limited
Lead Technologist - Artificial Intelligence & Machine Learning Location: Great Baddow (Hybrid - 2 days onsite per week) Salary: Up to £90,000 + Benefits The Opportunity A leading digital intelligence organisation is seeking a Lead Technologist to join their rapidly growing Data and Decision Support capability . This is an exciting opportunity to work at the forefront of Artificial Intelligence and … Machine Learning (AI/ML) , delivering impactful solutions across defence, security, space, and commercial sectors. You'll play a pivotal role in shaping and guiding innovative AI/ML projects while mentoring colleagues, engaging with customers, and driving forward research collaborations with academic and industry partners. About the Role You will be responsible for: Leading the technical delivery of … Dstl, NS, EPSRC). Expertise in at least one of the following areas, with broad knowledge across AI/ML: AI/ML for imagery (including remote sensing applications) Reinforcement learning Natural Language Processing (NLP) and Large Language Models (LLMs) Knowledge graphs and graph-based neural networks AI/ML for RF, EW, radar, sonar, or acoustics Autonomy More ❯
Posted:

LLM Researcher

London, South East, England, United Kingdom
Hybrid / WFH Options
MicroTECH Global Ltd
and regulatory requirements in fintech (SOC2, PCI-DSS, GDPR). Ability to thrive in a fast-moving startup environment. Desirables: Background in fintech, payments, or treasury systems. Experience with reinforcement learning with human feedback (RLHF). More ❯
Employment Type: Full-Time
Salary: Salary negotiable
Posted:
Reinforcement Learning
the UK excluding London
10th Percentile
£68,000
25th Percentile
£71,875
Median
£80,000
75th Percentile
£109,237
90th Percentile
£121,250