2 of 2 Remote/Hybrid Reinforcement Learning Jobs in London

Data Scientist

Hiring Organisation
Randstad Technologies Recruitment
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£450 - £480/day
preprocessing to ensure high-quality inputs for ML models. Model Development: Select and train appropriate architectures (BERT, GPT, etc.) using supervised, unsupervised, or reinforcement learning strategies. Prompt Engineering: Design, test, and iterate on complex prompts to elicit high-quality responses from LLMs while mitigating unintended behaviors. Evaluation & Optimization … establish automated monitoring systems to track drift and performance. Technical Requirements Core AI/ML: Strong experience in ML algorithms, LLM architectures, and deep learning frameworks. Generative AI: Proven expertise in Prompt Engineering and fine-tuning pre-trained models. Engineering: Proficiency in Python and experience designing data pipelines ...

Head of Applied AI Research

Hiring Organisation
Higher - AI recruitment
Location
City of London, London, United Kingdom
publications role. It is a role for someone who wants to do serious science and see it matter. WHAT YOU BRING PhD in Machine Learning from a leading research institution 7+ years leading applied AI research teams in a commercial environment with access to large-scale, unique data Deep … expertise in at least two of: LLMs and foundation models (pretraining, fine-tuning, distillation), physics-informed neural networks, or reinforcement learning Experience with agent frameworks, tool-use planning and workflow orchestration Systems-level fluency across data pipelines, training infrastructure and inference (PyTorch, JAX, TensorFlow) Strong MLOps discipline ...