4 of 4 Remote/Hybrid Reinforcement Learning Jobs in London

Data Science Manager

Hiring Organisation
Aristocrat
Location
Greater London, United Kingdom
Employment Type
Full Time
Data Science, Games Tech, you will be a transformational leader, responsible for guiding and inspiring a dedicated team of data scientists and machine learning engineers. In this role, you’ll drive the creation of groundbreaking data solutions that enhance gameplay, improve user engagement, and optimize business outcomes. … encourage and influence stakeholders. Key Technical Responsibilities Data Science Best Practices: Drive best practices in A/B-testing, predictive modelling, user clustering and reinforcement learning, to continually set the standard on data science benefit. Engineering Best Practices: Be responsible for the implementation of the best software engineering ...

Machine Learning Engineer PyTorch LLM

Hiring Organisation
Client Server
Location
East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Machine Learning Engineer (PyTorch LLM) London onsite to £110k Do you have expertise with Machine Learning in production? You could be progressing your career at a London based tech start-up with £5 million in recent pre-seed funding, in an impactful role that you'll shape. … Holidays) Daily lunch, monthly breakfasts Dog friendly office Pension Monthly socials Impactful role that you can shape and influence Your role: As a Machine Learning Engineer you'll take open-source LLMs (code and general models) and turn them into high-performance software engineer agents using supervised fine tuning ...

Data Scientist

Hiring Organisation
Randstad Technologies Recruitment
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£450 - £480/day
preprocessing to ensure high-quality inputs for ML models. Model Development: Select and train appropriate architectures (BERT, GPT, etc.) using supervised, unsupervised, or reinforcement learning strategies. Prompt Engineering: Design, test, and iterate on complex prompts to elicit high-quality responses from LLMs while mitigating unintended behaviors. Evaluation & Optimization … establish automated monitoring systems to track drift and performance. Technical Requirements Core AI/ML: Strong experience in ML algorithms, LLM architectures, and deep learning frameworks. Generative AI: Proven expertise in Prompt Engineering and fine-tuning pre-trained models. Engineering: Proficiency in Python and experience designing data pipelines ...

Head of Applied AI Research

Hiring Organisation
Higher - AI recruitment
Location
City of London, London, United Kingdom
publications role. It is a role for someone who wants to do serious science and see it matter. WHAT YOU BRING PhD in Machine Learning from a leading research institution 7+ years leading applied AI research teams in a commercial environment with access to large-scale, unique data Deep … expertise in at least two of: LLMs and foundation models (pretraining, fine-tuning, distillation), physics-informed neural networks, or reinforcement learning Experience with agent frameworks, tool-use planning and workflow orchestration Systems-level fluency across data pipelines, training infrastructure and inference (PyTorch, JAX, TensorFlow) Strong MLOps discipline ...