Reinforcement Learning Jobs in the UK excluding London

11 of 11 Reinforcement Learning Jobs in the UK excluding London

Postdocs and Research Fellows: Probabilistic machine learning, Centre for AI Fundamentals, Manc ...

Manchester, Lancashire, United Kingdom
The International Society for Bayesian Analysis
Postdocs and Research Fellows: Probabilistic machine learning, Centre for AI Fundamentals, Manchester, UK, DL July 27, 2025 Jul 14, 2025 I am hiring in my machine learning group in Manchester, UK, DL July 27, 2025 Funded by UKRI Turing AI World-Leading Researcher Fellowship, in the Manchester Centre for AI Fundamentals. We are particularly interested in developing new … machine learning for research, which involves AI4Science and for health, sequential decision making and experimental design under uncertainty, and collaborative AI. Machine learning keywords include Bayesian inference, distribution shifts, generative modelling, human-in-the-loop learning, privacy-preserving learning, reinforcement learning, inverse reinforcement learning, computational rationality and user modelling, and simulation-based … inference. The positions come with excellent opportunities for collaboration with machine learning researchers in the ELLIS Unit Manchester and the rest of the ELLIS network, and with researchers in other fields for AI for Research. These are fixed-term positions for a year, but there will be opportunities for excellent researchers to continue in my team in Manchester or More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Power Platform - London, UK

London, South East, England, United Kingdom
Hybrid / WFH Options
Randstad Technologies
experience in Investment Banking environment would be a plus Spanish would be a plus Mandatory Skills : Python, ServiceNow Orchestrator, Azure Cognitive Services, GenAI - LLMOps, RPA - Microsoft Power Automate, Machine Learning - AIOPS, Deep Learning - AIOPS, Reinforcement Learning - AIOPS Randstad Technologies Ltd is a leading specialist recruitment business for the IT & Engineering industries. Please note that due to More ❯
Employment Type: Full-Time
Salary: £60,000 - £65,000 per annum
Posted:

Senior Research Scientist: Data Science and Machine Learning AIP

Chelmsford, Essex, United Kingdom
Hybrid / WFH Options
NLP PEOPLE
of interest to you. The Data and Decision Support Capability has teams working across AI/ML areas such as RF, EW, radar, sonar, distributed sensing-processing, data fusion, reinforcement learning, autonomy, image analysis and computer vision, generative AI, NLP, knowledge graphs and more. You will work with these colleagues in multi-disciplinary teams. Typical Responsibilities Lead technical … and/or statistical signal processing to sequential data and decision-making post PhD. Experience in software development for proof of concept in Python. Experience with machine and deep learning frameworks: TensorFlow, PyTorch, scikit-learn, etc. Domains of Particular Interest RF communications and CEMA Electronic or Electromagnetic Warfare (EW) Tracking and sensor data fusion Radar signal processing Acoustic data … and Project Management teams that design and implement defence solutions and digital transformation projects. Company BAE Systems Experience and Education Senior (5+ years of experience) Tagged as: Industry, Machine Learning, NLP, United Kingdom More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

london, south east england, united kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Posted:

Lead, Vision-Language-Action VLA, Behaviour Learning - Hybrid

west london, south east england, united kingdom
Hybrid / WFH Options
Skillsbay Limited
Role: Lead, Vision-Language-Action (VLA)/Behaviour Learning About the Client Our client is a pioneering robotics startup developing the worlds most advanced, reliable, and commercially scalable humanoid robots. Their mission is to create safe, next-generation robots that integrate seamlessly into daily life and amplify human capacity. Their first robot, HMND 01 , is designed for industrial automation … understand, and act in complex real-world environments. The role combines cutting-edge AI research with practical deployment in robotics. What Youll Do Define and drive strategy for representation learning, behaviour cloning, and reinforcement learning (RL) . Lead large-scale training of multi-modal LLM/VLM/VLA systems integrating inputs such as vision, audio, proprioception … optimise models for real-time deployment . Hire, mentor, and lead a high-calibre team of research scientists and engineers. What Were Looking For 6+ years experience building deep learning systems, including 2+ years in technical team leadership. Hands-on expertise with LLM/VLM architecture design, billion-parameter training, and fine-tuning . Proven track record applying RL More ❯
Posted:

Python Developer

Glasgow, Scotland, United Kingdom
Hybrid / WFH Options
Venesky Brown
programming, code reviews, system design and requirements analysis/refinement, etc. - Coaching and mentoring other team members, as appropriate. Essential Skills: - OCR, Object Detection and LLM analysis implementation - Machine Learning & AI Libraries including Transformers/Hugging Face for working with pre-trained LLMs, fine tuning, and inference, PyTorch for deep learning model development and training, OpenCV for computer … Desirable Skills: - Custom model architecture design and implementation - Advanced fine-tuning techniques including LoRA, QLoRA, and parameter efficient methods - Multi-modal AI systems combining text, image, and structured data - Reinforcement Learning from Human Feedback (RLHF) for model alignment - Apache Airflow/Dagster for ML workflow orchestration and ETL pipeline management - Model versioning and experiment tracking (MLflow, Weights & Biases More ❯
Posted:

Python Developer

milton, central scotland, united kingdom
Hybrid / WFH Options
Venesky Brown
programming, code reviews, system design and requirements analysis/refinement, etc. - Coaching and mentoring other team members, as appropriate. Essential Skills: - OCR, Object Detection and LLM analysis implementation - Machine Learning & AI Libraries including Transformers/Hugging Face for working with pre-trained LLMs, fine tuning, and inference, PyTorch for deep learning model development and training, OpenCV for computer … Desirable Skills: - Custom model architecture design and implementation - Advanced fine-tuning techniques including LoRA, QLoRA, and parameter efficient methods - Multi-modal AI systems combining text, image, and structured data - Reinforcement Learning from Human Feedback (RLHF) for model alignment - Apache Airflow/Dagster for ML workflow orchestration and ETL pipeline management - Model versioning and experiment tracking (MLflow, Weights & Biases More ❯
Posted:

Python Developer

paisley, central scotland, united kingdom
Hybrid / WFH Options
Venesky Brown
programming, code reviews, system design and requirements analysis/refinement, etc. - Coaching and mentoring other team members, as appropriate. Essential Skills: - OCR, Object Detection and LLM analysis implementation - Machine Learning & AI Libraries including Transformers/Hugging Face for working with pre-trained LLMs, fine tuning, and inference, PyTorch for deep learning model development and training, OpenCV for computer … Desirable Skills: - Custom model architecture design and implementation - Advanced fine-tuning techniques including LoRA, QLoRA, and parameter efficient methods - Multi-modal AI systems combining text, image, and structured data - Reinforcement Learning from Human Feedback (RLHF) for model alignment - Apache Airflow/Dagster for ML workflow orchestration and ETL pipeline management - Model versioning and experiment tracking (MLflow, Weights & Biases More ❯
Posted:

Applied Scientist, Generative AI Innovation Center

Cambridge, Cambridgeshire, United Kingdom
Amazon
Applied Scientist, Generative AI Innovation Center Job ID: Amazon Web Services Singapore Private Limited Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply Generative AI algorithms to solve real world problems with significant impact? The Generative AI Innovation Center helps AWS customers implement Generative AI solutions and realize transformational … our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship & Career Growth We're continuously raising our …/ML/NLP conferences or journals PREFERRED QUALIFICATIONS 2+ years demonstrated experience with Large Language Model (LLM) and Foundational Model post-training, continual pre-training, fine-tuning, or reinforcement learning techniques. Demonstrated experience with building LLM-powered agentic workflow, orchestration, and agent customization Track record of building and deploying ML models at scale Experience with model optimization More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

GenAI Prompt Engineer

South East London, London, United Kingdom
Certain Advantage
Face. Proficiency in Python and familiarity with tools like LangChain, PromptLayer, or similar. Excellent analytical, problem-solving, and communication skills. Preferred Qualifications: Experience with prompt tuning, fine-tuning, or reinforcement learning from human feedback (RLHF). Familiarity with multi-modal AI systems (e.g., text-to-image, speech-to-text). Background in UX writing, technical writing, or computational More ❯
Employment Type: Contract
Posted:

LLM Researcher

London, South East, England, United Kingdom
Hybrid / WFH Options
MicroTECH Global Ltd
and regulatory requirements in fintech (SOC2, PCI-DSS, GDPR). Ability to thrive in a fast-moving startup environment. Desirables: Background in fintech, payments, or treasury systems. Experience with reinforcement learning with human feedback (RLHF). More ❯
Employment Type: Full-Time
Salary: Salary negotiable
Posted:
Reinforcement Learning
the UK excluding London
10th Percentile
£68,000
25th Percentile
£71,875
Median
£80,000
75th Percentile
£109,237
90th Percentile
£121,250