2 of 2 Permanent Reinforcement Learning Jobs in South London

Software Engineer - Large Language Models

Hiring Organisation
Fastino Labs
Location
South London, UK
Employment Type
Full-time
overall performance metrics Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards Build robust … Required - Great velocity for building and shipping agents/AI products. Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies Optional - Demonstrated ability to do independent research in Academic ...

Data Scientist

Hiring Organisation
Odysse Ltd
Location
Croydon, England, United Kingdom
data infrastructure that supports both today’s human-driven fleets and tomorrow’s autonomous mobility networks. This is a hands-on applied machine learning role focused on building and improving decision systems that directly influence live fleet operations and contribute to long-term autonomous fleet orchestration capabilities. You will … real-world data and optimise for practical impact rather than just model accuracy Has exposure to advanced modelling approaches (e.g. neural networks, optimisation, or reinforcement learning) Nice to Have Experience with time-series or geospatial datasets, experimentation or optimisation problems Experience in logistics, marketplaces, mobility systems, ride-hailing ...