3 of 3 Remote/Hybrid Experimental Design Jobs in Berkshire

Senior AI Engineer | GenAI | Information Retrieval | ML | NLP | Hybrid London | Up to £130k

Hiring Organisation
Maze
Location
Slough, Berkshire, UK
Employment Type
Full-time
Information Retrieval (Research-Focused) Drive applied research within a production-focused Information Retrieval team. You will: Research and improve retrieval and NLP models. Design experiments, run evaluations, and iterate on modelling approaches. Collaborate closely with engineering to transition research into production. You bring: Strong applied research experience ...

Engineering Manager, Machine Learning, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000

Hiring Organisation
Owen Thomas | Pending B CorpTM
Location
Slough, Berkshire, UK
Employment Type
Full-time
Partner with Product, Data, and Engineering leaders to shape and deliver an actionable ML strategy that drives engagement, conversion, and growth. Oversee the design, training, and deployment of search and recommendation models — from data strategy to monitoring and performance optimisation. Collaborate with platform and MLOps teams to ensure … rank, personalisation algorithms). Hands-on experience with modern ML toolchains — Python, Spark, and frameworks such as PyTorch or TensorFlow. Strong grounding in experimental design, A/B testing, and the use of offline/online metrics to guide product strategy. Excellent communication and stakeholder management skills ...

Artificial Intelligence Researcher

Hiring Organisation
microTECH Global LTD
Location
Slough, Berkshire, UK
Employment Type
Full-time
client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models with human preferences, ensuring they are safe, controllable, and capable of producing high-quality outputs … learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. Design experiments to align generative outputs with human and organisational preferences. Collaborate with researchers, engineers, and human feedback teams to build scalable alignment pipelines. Publish ...