3 of 3 Remote/Hybrid Reinforcement Learning Jobs in London

Senior Data Scientist

Hiring Organisation
Anson Mccade
Location
London, United Kingdom
Employment Type
Permanent
Responsibilities End-to-End Delivery: Lead the technical execution of AI projects, from initial problem discovery and hypothesis testing to deploying production-grade machine learning models. Strategic Advisory: Act as a "translator" between technical complexity and business value. You will work closely with C-suite stakeholders to identify … solve their most pressing strategic challenges. Technical Leadership: Architect robust, scalable data pipelines and state-of-the-art models (including LLMs, Reinforcement Learning, or Bayesian Inference) tailored to specific client needs. Mentorship: Guide and upskill junior Data Scientists, fostering a culture of rigorous peer review, clean coding standards ...

Machine Learning Engineer PyTorch LLM

Hiring Organisation
Client Server
Location
East London, London, United Kingdom
Employment Type
Permanent, Work From Home
Machine Learning Engineer (PyTorch LLM) London onsite to £110k Do you have expertise with Machine Learning in production? You could be progressing your career at a London based tech start-up with £5 million in recent pre-seed funding, in an impactful role that you'll shape. … Holidays) Daily lunch, monthly breakfasts Dog friendly office Pension Monthly socials Impactful role that you can shape and influence Your role: As a Machine Learning Engineer you'll take open-source LLMs (code and general models) and turn them into high-performance software engineer agents using supervised fine tuning ...

Data Scientist - Inside IR35 - Hybrid

Hiring Organisation
Halian Technology Limited
Location
Croydon, Surrey, South East, United Kingdom
Employment Type
Contract
Role We are recruiting on behalf of a mobility technology business building intelligent fleet orchestration systems. This role suits an experienced Applied Machine Learning Engineer or Data Scientist comfortable working with messy real-world data, operational constraints, and production systems. Youll join a small, high-calibre team solving complex … years Geospatial data experience (H3, GeoPandas, PostGIS or similar) Optimisation/operations research exposure Logistics/mobility/marketplace domain experience Nice to Have Reinforcement learning Simulation modelling Experience deploying models into cloud environments Experimentation frameworks (A/B testing, model validation at scale) How to Apply ...