Reinforcement Learning (policy optimisation, reward modelling, RLHF). Hands-on experience training/fine-tuning generative models (LLMs, diffusion, transformers, GANs). Strong knowledge of deep learning frameworks (PyTorch, JAX, TensorFlow). Proficiency in Python and standard ML libraries. Solid foundations in probability, optimisation, and statistics. Experience working with large-scale distributed training on GPUs/TPUs. If this More ❯
City of London, London, United Kingdom Hybrid/Remote Options
microTECH Global LTD
Reinforcement Learning (policy optimisation, reward modelling, RLHF). Hands-on experience training/fine-tuning generative models (LLMs, diffusion, transformers, GANs). Strong knowledge of deep learning frameworks (PyTorch, JAX, TensorFlow). Proficiency in Python and standard ML libraries. Solid foundations in probability, optimisation, and statistics. Experience working with large-scale distributed training on GPUs/TPUs. If this More ❯
also are on the look out for candidates who: Have deep familiarity with Python data ecosystem Understanding of Jupyter notebooks Exposure to machine learning libraries like PyTorch, XGBoost and JAX Understanding of crypto or traditional financial markets Strong API design and documentation skills What do you get in return? Up to 250k base (depending on experience) 3 days in More ❯
Central London, London, England, United Kingdom Hybrid/Remote Options
Opus Recruitment Solutions Ltd
also are on the look out for candidates who: Have deep familiarity with Python data ecosystem Understanding of Jupyter notebooks Exposure to machine learning libraries like PyTorch, XGBoost and JAX Understanding of crypto or traditional financial markets Strong API design and documentation skills What do you get in return? Up to £250k base (depending on experience) 3 days in More ❯
at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. … of high-quality research, evidenced by publications in reputable scientific journals or conferences. Expertise in software development and programming in Python. Experience with deep learning frameworks such as PyTorch, JAX, and/or TensorFlow. Excellent ability to communicate complex ideas clearly, both verbally and in writing, and to work effectively within a multidisciplinary team. Strong analytical and critical thinking More ❯
AI solutions and are hiring exceptional researchers who can bridge the gap between cutting-edge generative AI research and real-world applications. A World-class team with alumni from Google Brain, Meta AI, DeepMind, Stanford, Oxford Unlimited research freedom with strong industry impact What You'll Do Pioneer multimodal AI research combining language models with advanced generative techniques Lead … record Deep expertise in diffusion models and generative AI Proven experience with text-to-image generation, multimodal AI Large-scale model training experience (distributed systems, GPUs) Proficiency in PyTorch, JAX, or similar frameworks Enterprise AI deployment experience Safety and alignment research background Multilingual/multicultural AI expertise Highly Valued: A Background at top-tier AI labs (DeepMind, OpenAI, Meta More ❯