6 of 6 Remote/Hybrid Reinforcement Learning Jobs in London

Software Engineer, RL Data

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
down to reading transcripts, supporting users, and wrangling vendors. The company's RL Data team builds the systems that produce high‐quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data … Effective use of AI tools in your own day‐to‐day work. Care about the societal impacts of your work. Preferred qualifications Experience with reinforcement learning on LLMs, particularly on the data side: creating evals, environments, rewards, graders, or training data. Experience helping organizations use AI more effectively ...

Research Engineer, RL Scaling Science

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
policy experts, and business leaders working together to build beneficial AI systems. About the role The company's RL Scaling Science team studies how reinforcement learning behaves as we scale it (across model size, compute, and task horizon) and turns that understanding into the training recipes behind … scale Partner closely with adjacent RL teams across research and engineering and advance our overall RL stack Minimum qualifications Strong empirical research skills in Reinforcement Learning, large-scale ML training, or a closely adjacent area Demonstrated ability to own large experiments end-to-end, from design through interpretation ...

Research Scientist, LLM Agents (Foundational Research)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
curious and open-minded individual with an interest in conducting state-of-the-art foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex agent-based AI systems in a data-rich, complex academic environment driven by real-world problems. Foundational Research … dedicated core Machine Learning research division of Thomson Reuters. We are focused on research and development, with a particular focus on advanced algorithms and training techniques for Large Language Models (LLMs). We are building a strong foundation of research capabilities across different areas and are looking for scientists ...

Manager, Lead Research Scientist, LLM Agents (Foundational Research)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
curious and open-minded individual with an interest in conducting state-of-the-art foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex agent-based AI systems in a data-rich, complex academic environment driven by real-world problems. Foundational Research … sleeves and participate in designing, coding, conducting experiments, and translating findings into concrete deliverables. Our focus areas are: LLM Training (Continued Pretraining, Instruction Tuning, Reinforcement Learning Alignment, Distributed Training, Efficient ML techniques) Post-training techniques for planning, reasoning & complex workflows (e.g., Reasoning Models, LLMs + Knowledge Graphs, Test ...

Senior Robotics Software Engineer

Hiring Organisation
Your Tech Future
Location
South West London, London, United Kingdom
Employment Type
Permanent
Salary
£90,000
Mapping) Navigation and autonomy systems Sensor fusion and state estimation Gazebo, Isaac Sim, MATLAB/Simulink or similar simulation platforms Real-time systems development Reinforcement learning Pybind11 What We're Looking For Demonstrable experience delivering robotics projects Strong software engineering principles and coding standards Ability to take ownership ...

Senior Robotics Software Engineer

Hiring Organisation
17918
Location
South Ruislip, West London, United Kingdom
Mapping) Navigation and autonomy systems Sensor fusion and state estimation Gazebo, Isaac Sim, MATLAB/Simulink or similar simulation platforms Real-time systems development Reinforcement learning Pybind11 What We're Looking For Demonstrable experience delivering robotics projects Strong software engineering principles and coding standards Ability to take ownership ...