Permanent Reinforcement Learning Jobs in the Thames Valley

18 of 18 Permanent Reinforcement Learning Jobs in the Thames Valley

Founding Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Explore Group
Founding Software Engineer (React and Python) About the Opportunity You will be joining an elite founding engineering team that includes the world’s number one authority in Multi Agent Reinforcement Learning, seasoned Meta Product leaders, and one of the only people in the UK who has trained a trillion token language model from scratch. This role is based … a pivotal role in shaping and scaling the technology platform from the ground up. You will work alongside leading researchers and product strategists at the intersection of cutting edge reinforcement learning and large scale foundational models. You will enjoy a rare opportunity to influence direction, architecture and go to market execution, with both immediate impact and long term … product strategists to refine MVPs, prototype innovative workflows and accelerate iterations to production • Scale and optimise infrastructure to support high throughput training, inference and deployment of both multi agent reinforcement learning systems and enormous language models • Implement robust MLOps best practices including data pipelines, model versioning, monitoring, automated deployment and continuous integration • Drive end to end ownership from More ❯
Posted:

Artificial Intelligence Researcher

slough, south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
or London, UK This is a permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models with human preferences, ensuring … LLMs, and generative modelling, helping us build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. Design experiments to align … and human feedback teams to build scalable alignment pipelines. Publish findings in top-tier AI conferences and contribute to open-source frameworks. Key Requirements: PhD in Computer Science, Machine Learning, or related field. Publications at NeurIPS, ICML, ICLR, ACL, or related venues. Deep expertise in Reinforcement Learning (policy optimisation, reward modelling, RLHF). Hands-on experience training More ❯
Posted:

Machine Learning Engineer (PhD)

slough, south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
leader in advanced computing, with a dedicated research team focused on applying artificial intelligence to next-generation semiconductor design and optimization. Role Overview: We're seeking a motivated Machine Learning Researcher with a strong background in machine learning, AI, or related fields. You’ll contribute to innovative projects in areas such as large language models (LLMs), reinforcement learning, and optimization for chip design and AI system integration. Responsibilities: Conduct and publish cutting-edge AI/ML research Design algorithms for chip optimization and intelligent systems Collaborate with engineering teams to integrate AI into real-world tools Stay current on AI trends and contribute to open research Requirements: PhD or equivalent experience in ML, AI, CS … physics, or mathematics Strong publication record (NeurIPS, ICML, ICLR, etc.) Proficient in Python, C++, and deep learning frameworks (e.g., PyTorch, TensorFlow) Solid grasp of ML techniques; independent and team-oriented mindse Preferred: Experience in LLMs, reinforcement learning, or chip design Familiarity with JAX and optimization frameworks Why Join Us: Work on impactful AI research with real-world More ❯
Posted:

Artificial Intelligence Engineer

slough, south east england, united kingdom
Searchability®
IR35 To apply, email: jordanna.ramsey@searchability.com THE OPPORTUNITY A new contract opportunity for an AI Engineer to join a cutting-edge AI Sports company revolutionising how data and machine learning drive athletic performance, fan engagement, and predictive analytics in the sports industry. You’ll be part of a highly skilled R&D team building next-generation AI solutions for … world data into actionable insights. Optimise AI systems for real-time environments, integrating with live data feeds and cloud infrastructure. Research and prototype cutting-edge AI techniques (e.g., deep learning, reinforcement learning, generative models). Support continuous model improvement and scalable MLOps deployment pipelines. TECH STACK/REQUIREMENTS Core Skills: Python, TensorFlow/PyTorch, scikit-learn, OpenCV … wearable/sensor data, player tracking, or sports video analytics TO BE CONSIDERED... Please apply directly by emailing jordanna.ramsey@searchability.com with your CV and availability. KEYWORDS: AI Engineer, Machine Learning Engineer, Sports Analytics, Computer Vision, Deep Learning, Python, TensorFlow, PyTorch, MLOps, Data Science, Predictive Modelling, Sports Tech, AI in Sports More ❯
Posted:

Lead AI & Data Science

slough, south east england, united kingdom
Dar
at www.dar.com. Our Vision and Values: We aspire to be the chosen home of those with a gift for crafting solutions that empower people and an unwavering passion for learning and innovation. Our core values shape our culture and guide our decision-making. We are committed to: Excellence Responsibility Empowerment Connectivity Courage Role Overview We are seeking a seasoned … technical oversight for AI initiatives across domains: Generative AI & LLMs (fine-tuning, RAG pipelines, multi-agent systems). Predictive Analytics & Time-Series Modeling . Computer Vision & Multimodal AI . Reinforcement Learning & Optimization . Knowledge Engineering & Semantic Search . Edge AI & Real-Time AI Deployments . Act as the architect and reviewer of AI systems, ensuring scalability, robustness, and … class AI & Data Science team , including hiring, onboarding, and performance management . Mentor and coach team members to elevate technical depth and problem-solving skills. Create career development plans, learning paths, and certification opportunities for the team. Foster a culture of collaboration, experimentation, and continuous improvement . Collaboration & Representation Work closely with Product Managers, Solution Architects, and Engineering Leads More ❯
Posted:

Machine Learning Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Experis UK
Job Title: Machine Learning Engineer Location: London, UK (Hybrid – 2–3 days onsite per week) Contract Type: Contract Duration: 6–12 months (possibility of extension) Start Date: ASAP Overview We are seeking an experienced Machine Learning Engineer to join our data science and AI engineering team on a contract basis in London. The ideal candidate will be responsible … for designing, developing, and deploying machine learning models and scalable data pipelines that support advanced analytics and intelligent automation initiatives. This role offers a hybrid work arrangement , combining flexibility with collaboration, and is ideal for a contractor who thrives in fast-paced, data-driven environments. Key Responsibilities Design, build, and deploy machine learning models and AI-driven solutions … environments. Stay current with emerging trends in AI/ML technologies and contribute to innovation within the organisation. Required Skills & Experience Proven experience (3–5+ years) as a Machine Learning Engineer , Data Scientist , or similar role. Strong programming skills in Python (experience with libraries such as TensorFlow, PyTorch, scikit-learn, pandas, NumPy). Solid understanding of machine learning More ❯
Posted:

Machine Learning Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Higher - AI recruitment
We are partnering with an early-stage, mission-driven company at the intersection of AI and national defence to appoint exceptional Machine Learning Engineers . This fast-growing organisation is transforming mission-critical combat planning and operational decision-making by building next-generation AI software tools for Western forces. Founded in 2023, the company has already secured significant early … OpenAI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, GenAI, Computer Vision, and Reinforcement Learning technologies. Position location (hybrid) : London (Shoreditch) or Paris (Le Marais) We are seeking Machine Learning Engineers who are passionate about using AI technology to solve complex … product team, to identify and capitalise on high-impact opportunities Keep the company at the forefront of innovation by exploring and applying the latest research Requirements: MSc in Machine Learning, Computer Science, or a related field 5+ years of experience in machine learning, with an expertise in modelling, prototyping, and evaluation Strong software engineering skills with a focus More ❯
Posted:

A.I Research Internship / Graduate Initial 1 Year FTC

Slough, Berkshire, South East, United Kingdom
Zorba Consulting
O ffice Based 5 Days Monday Friday Key skills: C omputer Science or A.I qualification or similar, Ideally Python, Machine Learning Concepts, Communication and Evaluation Skills. My Client is the rapidly growing European subsidiary of a global Film company who specialise in lighting, rigging, generators, etc. The also own film studies as well as facilitate studios for other organisations. … solutions to support operations. You will have the opportunity to gain comprehensive, hands-on experience in multiple areas of a multilocation, commercial business. This role will offer an insightful learning experience in a dynamic and fast-paced work environment. with exposure to real-world applications requiring artificial intelligence and automation. You will work closely with the IT department and … AI deployments. Produce technical documentation, including usage guides and risk assessment templates. Present findings and recommendations to stakeholders through structured viability reports. Recognise personal development needs and proactively seek learning opportunities to support growth in AI and automation. You Will Need To: Currently pursuing or recently completed a qualification in Computer Science, Artificial Intelligence, or a related field Exposure More ❯
Employment Type: Permanent
Posted:

Full Stack Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Higher - AI recruitment
Open AI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, Generative AI, and reinforcement learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Full Stack Engineers who are passionate about using technology to solve complex, real-world More ❯
Posted:

Lead ML Engineer (London)

slough, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to own the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Lead our ‘AI Powered Education’ meet-up in London, building a network of ML specialists Essential skills 🙏 Experience leading and mentoring other ML More ❯
Posted:

Senior ML Engineer (London)

slough, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … looking for a Senior ML Engineer with a proven track record of delivering ML models to production. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in production Experience working with More ❯
Posted:

Computer Vision Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Senior NLP Engineer (London)

slough, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … track record of delivering ML models to production, to join the ML team in our growing company. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Build fully-automated pipelines for dictionary building; including span identification, word sense distribution, and sense granularity decision Work with a … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Strong expertise in NLP Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in More ❯
Posted:

Member of Technical Staff

slough, south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

AI Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Omnis Partners
and build strong partnerships with major tech vendors. 🛠️ What We Need Proven experience designing and deploying agentic AI systems in production Strong understanding of AI/ML techniques, including reinforcement learning, LLM agents, and simulations Proficient in Python and Infrastructure as Code for cloud environments Familiarity with AI orchestration platforms like LangChain and LangGraph, AutoGen, CrewAI, or similar More ❯
Posted:

Deep Learning Engineer

slough, south east england, united kingdom
Brio Digital
Deep Learning Engineer – Manipulation (Brio Digital, on behalf of our client) Brio Digital is partnered with a pioneering robotics client who are building advanced, scalable, and safe AI-driven systems designed to transform how humans and machines interact in the real world. Their first-generation platform is tackling labour automation challenges, enabling efficiency and safety across industrial use cases … and logistics. This is a unique opportunity to join an organisation at the frontier of applied AI and robotics, where you’ll be solving cutting-edge challenges in deep learning and embodied intelligence. The Role We’re looking for multiple Deep Learning Engineers (Manipulation) to join the team. This is a deep learning-focused position where you … training policies, curating data, leveraging synthetic datasets, and deploying real-time inference models. Robotics experience is not strictly required, but you must bring a strong track record in deep learning and the ability to adapt quickly to new domains. What You’ll Be Doing Train manipulation policies using representation learning, behaviour cloning, and reinforcement learning. Partner with More ❯
Posted:

AI SME

slough, south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted: