Research Scientist

Research Scientist (LLM/RL)

Frontier AI | London or Paris

Compensation: Up to £250k + equity package.

About

This is a well-funded frontier AI startup building state-of-the-art agentic systems that automate complex, multi-step tasks normally done by humans.

The team combines deep research (in computer use and proprietary models) with forward-deployed implementation alongside enterprise clients.

The Models team builds the core LLMs and vision-language models behind these agentic systems. The focus is on training models that work well for agents in practice: strong instruction following, reliable tool use, and good decision-making at a given inference cost.

What you'll do

Research post-training methods for large multimodal language models, with a focus on RL and feedback-driven learning
Design reward models and large-scale reinforcement learning setups for instruction following and tool use
Build automated data collection pipelines using human and machine feedback
Develop evaluations that capture real capability gains (not just benchmark improvements)
Translate concrete product failures and use cases into new training signals

What you'll need

Strong research background combined with hands-on experience with LLM post-training, alignment, or reinforcement learning
Proficiency in Python and at least one major DL framework (PyTorch, JAX, or TensorFlow)
Experience training large models on distributed systems
Publications at top-tier conferences (NeurIPS, ICML, ICLR, ACL, CVPR, etc.)
Comfortable working in fast-moving, loosely specified research problems

Shortlisted candidates will be contacted within 48 hours.

Apply Now

Similar Jobs

Deep Learning Engineer

Hiring Organisation: Humanoid
Location: London Area, United Kingdom

Humanoid is the first AI and robotics company in the UK, creating the world’s most advanced, reliable, commercially scalable, and safe humanoid robots. Our first humanoid robot HMND 01 is a next-gen labour automation unit, providing highly efficient ...

Reinforcement Learning (RL) control Engineer

Hiring Organisation: Randstad Digital
Location: City of London, London, United Kingdom
Employment Type: Permanent

Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in ...

Reinforcement Learning RL control Engineer

Hiring Organisation: Randstad Technologies
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £80,000 - £120,000 per annum

Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in ...

Senior Deep Learning Engineer

Hiring Organisation: Randstad Digital
Location: City of London, London, United Kingdom
Employment Type: Permanent

We are seeking a Senior Deep Learning Engineer to build the future of Embodied AI . This is a role for those who work 'in the weeds' of model architecture and training loops not API wrappers or prompt engineers. The Mission ...

Senior Deep Learning Engineer

Hiring Organisation: Randstad Technologies
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £80,000 - £120,000 per annum

We are seeking a Senior Deep Learning Engineer to build the future of Embodied AI . This is a role for those who work "in the weeds" of model architecture and training loops not API wrappers or prompt engineers. The Mission ...

Machine Learning Engineer

Hiring Organisation: Wave Talent
Location: England, United Kingdom

An early-stage UK AI company is building software that removes friction from complex logistics workflows by automating document processing and decision-making. Their product is already live with customers, and the focus now is on moving faster, shipping weekly ...

Senior Machine Learning Engineer

Hiring Organisation: Jack & Jill
Location: City of London, London, United Kingdom

This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. She will pick the best candidates from Jack's network. The next step is to speak to Jack. Senior Machine Learning ...

Research Engineer (Agents)

Hiring Organisation: Native
Location: City of London, London, United Kingdom

About Native Native is a VC backed AI startup building the world’s most accurate AI for databases reasoning. This is hard . This is an unsolved problem that once solved will change how every enterprise operates. We recently published our ...

Machine Learning Engineer

Hiring Organisation: BioTalent
Location: City of London, London, United Kingdom

We are looking for an ML Engineer to work within a growing technology company developing advanced machine learning systems to enable large-scale, computation-driven workflows in a highly complex technical domain. The organisation focuses on translating cutting-edge research ...

Agentic AI Engineer

Hiring Organisation: Oscar Associates (UK) Limited
Location: United Kingdom
Employment Type: Contract
Salary: £300 - £375 per day

Agentic AI Engineer | Remote | OIR35 | Contract We're working with a fast-growing AI product consultancy building production-grade AI software for clients across multiple industries. They're looking for an Agentic AI Engineer to design and build intelligent, autonomous ...

Staff Machine Learning Engineer

Hiring Organisation: Harnham
Location: United Kingdom

Staff Machine Learning Engineer Remote (UK & EU) Up to £100,000 base + benefits We’re hiring for a global language technology company building production-grade machine learning systems used at enterprise scale across regulated industries. This is a Staff-level ...

Deep Learning Engineer

Hiring Organisation: Block MB
Location: City of London, London, United Kingdom

Deep Learning Engineer About the Company Join a fast-growing AI-driven technology company that’s modernising prediction and decision systems for complex, real-world industries. The organisation builds powerful AI platforms that help partners, particularly in travel and transportation ...

AI Researcher

Hiring Organisation: Lorien
Location: City of London, London, England, United Kingdom
Employment Type: Contractor
Salary: Salary negotiable

AI Researcher (Guardrails & Responsible AI) Hybrid Working - Edinburgh OR London - 2 days a week on site. Financial Services Lorien's leading banking client is looking for an AI Researcher -a curious, high-end thinker (ideal for a recent Master's ...

Chief Technology Officer

Hiring Organisation: Teqconnect
Location: United Kingdom

Teqconnect are on the lookout for a CTO to join our client, they are a one-stop shop for customers who want to enhance their business with proven, up-to-date solutions based on conversational AI and technologies beyond it ...

AI Engineer (Conversational AI)

Hiring Organisation: TalentCo
Location: United Kingdom

TalentCo are delighted to be partnering with an established PE-backed tech company looking to hire several high-calibre AI Engineer into their UK business, as they build a new Conversational AI platform from the ground up. These hires will ...

Research Scientist, Machine Learning

Hiring Organisation: SoCode Recruitment
Location: Cambridge, England, United Kingdom

Research Scientist – Machine Learning Cambridge / Hybrid (Flexible 1 day per week in the office) You MUST have a PhD to apply Open to recent graduates, through to experienced research Leaders (Salary will match your level) About the Company We build ...

ML Research Consultant

Hiring Organisation: Tec Partners
Location: Cambridge, Cambridgeshire, United Kingdom
Employment Type: Permanent
Salary: GBP Annual

Job: Machine Learning Research Consultant (Experienced) Location: Cambridge, UK (on-site) Contract: Permanent, Full-time We're recruiting on behalf of a UK-based technology consultancy delivering advanced AI and machine learning solutions to complex, real-world problems in defence ...

Founding Engineer

Hiring Organisation: Corecom Consulting
Location: Manchester, North West, United Kingdom
Employment Type: Permanent
Salary: £90,000

Senior Python Engineer (£70k-£90k) - AI-Driven Product Build This is a hands-on senior engineering role for someone who wants to work on real-world AI systems , not proofs of concept or hype projects. You'll be joining a ...

Principal Machine Learning Engineer

Hiring Organisation: Harnham - Data & Analytics Recruitment
Location: Manchester, Lancashire, England, United Kingdom
Employment Type: Full-Time
Salary: £85,000 - £100,000 per annum

Remote - Europe Staff/ Lead Machine Learning Engineer We are working with a global languages and translation company. They are focused on designing, developing, and deploying cutting-edge machine learning solutions across the company. Responsibilites: Leadership of technical projects Mentoring junior ...