Reinforcement Learning Jobs in the Thames Valley

19 of 19 Reinforcement Learning Jobs in the Thames Valley

Founding Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Explore Group
Founding Software Engineer (React and Python) About the Opportunity You will be joining an elite founding engineering team that includes the world’s number one authority in Multi Agent Reinforcement Learning, seasoned Meta Product leaders, and one of the only people in the UK who has trained a trillion token language model from scratch. This role is based … a pivotal role in shaping and scaling the technology platform from the ground up. You will work alongside leading researchers and product strategists at the intersection of cutting edge reinforcement learning and large scale foundational models. You will enjoy a rare opportunity to influence direction, architecture and go to market execution, with both immediate impact and long term … product strategists to refine MVPs, prototype innovative workflows and accelerate iterations to production • Scale and optimise infrastructure to support high throughput training, inference and deployment of both multi agent reinforcement learning systems and enormous language models • Implement robust MLOps best practices including data pipelines, model versioning, monitoring, automated deployment and continuous integration • Drive end to end ownership from More ❯
Posted:

Artificial Intelligence Researcher

slough, south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
or London, UK This is a permanent position with candidates required to do hybrid working in either Cambridge or London. Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models with human preferences, ensuring … LLMs, and generative modelling, helping us build the next generation of foundation models Responsibilities: Develop and refine RLHF algorithms for large language and generative models. Research and implement deep reinforcement learning methods (policy gradients, actor-critic, off-policy learning) for model alignment. Train, fine-tune, and evaluate LLMs and diffusion models at scale. Design experiments to align … and human feedback teams to build scalable alignment pipelines. Publish findings in top-tier AI conferences and contribute to open-source frameworks. Key Requirements: PhD in Computer Science, Machine Learning, or related field. Publications at NeurIPS, ICML, ICLR, ACL, or related venues. Deep expertise in Reinforcement Learning (policy optimisation, reward modelling, RLHF). Hands-on experience training More ❯
Posted:

Machine Learning Engineer (PhD)

slough, south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
leader in advanced computing, with a dedicated research team focused on applying artificial intelligence to next-generation semiconductor design and optimization. Role Overview: We're seeking a motivated Machine Learning Researcher with a strong background in machine learning, AI, or related fields. You’ll contribute to innovative projects in areas such as large language models (LLMs), reinforcement learning, and optimization for chip design and AI system integration. Responsibilities: Conduct and publish cutting-edge AI/ML research Design algorithms for chip optimization and intelligent systems Collaborate with engineering teams to integrate AI into real-world tools Stay current on AI trends and contribute to open research Requirements: PhD or equivalent experience in ML, AI, CS … physics, or mathematics Strong publication record (NeurIPS, ICML, ICLR, etc.) Proficient in Python, C++, and deep learning frameworks (e.g., PyTorch, TensorFlow) Solid grasp of ML techniques; independent and team-oriented mindse Preferred: Experience in LLMs, reinforcement learning, or chip design Familiarity with JAX and optimization frameworks Why Join Us: Work on impactful AI research with real-world More ❯
Posted:

Artificial Intelligence Engineer

slough, south east england, united kingdom
Searchability®
IR35 To apply, email: jordanna.ramsey@searchability.com THE OPPORTUNITY A new contract opportunity for an AI Engineer to join a cutting-edge AI Sports company revolutionising how data and machine learning drive athletic performance, fan engagement, and predictive analytics in the sports industry. You’ll be part of a highly skilled R&D team building next-generation AI solutions for … world data into actionable insights. Optimise AI systems for real-time environments, integrating with live data feeds and cloud infrastructure. Research and prototype cutting-edge AI techniques (e.g., deep learning, reinforcement learning, generative models). Support continuous model improvement and scalable MLOps deployment pipelines. TECH STACK/REQUIREMENTS Core Skills: Python, TensorFlow/PyTorch, scikit-learn, OpenCV … wearable/sensor data, player tracking, or sports video analytics TO BE CONSIDERED... Please apply directly by emailing jordanna.ramsey@searchability.com with your CV and availability. KEYWORDS: AI Engineer, Machine Learning Engineer, Sports Analytics, Computer Vision, Deep Learning, Python, TensorFlow, PyTorch, MLOps, Data Science, Predictive Modelling, Sports Tech, AI in Sports More ❯
Posted:

Lead AI & Data Science

slough, south east england, united kingdom
Dar
at www.dar.com. Our Vision and Values: We aspire to be the chosen home of those with a gift for crafting solutions that empower people and an unwavering passion for learning and innovation. Our core values shape our culture and guide our decision-making. We are committed to: Excellence Responsibility Empowerment Connectivity Courage Role Overview We are seeking a seasoned … technical oversight for AI initiatives across domains: Generative AI & LLMs (fine-tuning, RAG pipelines, multi-agent systems). Predictive Analytics & Time-Series Modeling . Computer Vision & Multimodal AI . Reinforcement Learning & Optimization . Knowledge Engineering & Semantic Search . Edge AI & Real-Time AI Deployments . Act as the architect and reviewer of AI systems, ensuring scalability, robustness, and … class AI & Data Science team , including hiring, onboarding, and performance management . Mentor and coach team members to elevate technical depth and problem-solving skills. Create career development plans, learning paths, and certification opportunities for the team. Foster a culture of collaboration, experimentation, and continuous improvement . Collaboration & Representation Work closely with Product Managers, Solution Architects, and Engineering Leads More ❯
Posted:

Chief Data Scientist

slough, south east england, united kingdom
Xcede
hiring a Chief Data Scientist to take things to the next level. This person will lead technical direction, guide a growing team, and remain closely involved in delivering machine learning systems that power real-world impact across the platform. You’ll shape the roadmap, lead from the front, and play a key role in embedding intelligence into every aspect … solutions focused on forecasting, optimisation, and real-time performance enhancement Build the infrastructure needed to support experimentation, training, and deployment of production-grade models Evaluate new modelling approaches including reinforcement learning, multimodal architectures, and agentic methods Work closely with Product, Engineering, and Commercial teams to ensure AI delivers measurable results Act as the internal champion for intelligence, influencing … TensorFlow, or Hugging Face Good understanding of infrastructure and deployment, ideally in cloud environments such as AWS Exposure to areas such as time-series modelling, optimisation, computer vision, or reinforcement learning Confident working across technical and non-technical teams and communicating complex ideas clearly Motivated by impact, scalability, and helping an organisation make intelligence a core capability If More ❯
Posted:

Head of Data Science

slough, south east england, united kingdom
Xcede
of AI/Data Science to take things to the next level. This person will lead technical direction, guide a growing team, and remain closely involved in delivering machine learning systems that power real-world impact across the platform. You’ll shape the roadmap, lead from the front, and play a key role in embedding intelligence into every aspect … solutions focused on forecasting, optimisation, and real-time performance enhancement Build the infrastructure needed to support experimentation, training, and deployment of production-grade models Evaluate new modelling approaches including reinforcement learning, multimodal architectures, and agentic methods Work closely with Product, Engineering, and Commercial teams to ensure AI delivers measurable results Act as the internal champion for intelligence, influencing … TensorFlow, or Hugging Face Good understanding of infrastructure and deployment, ideally in cloud environments such as AWS Exposure to areas such as time-series modelling, optimisation, computer vision, or reinforcement learning Confident working across technical and non-technical teams and communicating complex ideas clearly Motivated by impact, scalability, and helping an organisation make intelligence a core capability If More ❯
Posted:

Machine Learning Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Experis UK
Job Title: Machine Learning Engineer Location: London, UK (Hybrid – 2–3 days onsite per week) Contract Type: Contract Duration: 6–12 months (possibility of extension) Start Date: ASAP Overview We are seeking an experienced Machine Learning Engineer to join our data science and AI engineering team on a contract basis in London. The ideal candidate will be responsible … for designing, developing, and deploying machine learning models and scalable data pipelines that support advanced analytics and intelligent automation initiatives. This role offers a hybrid work arrangement , combining flexibility with collaboration, and is ideal for a contractor who thrives in fast-paced, data-driven environments. Key Responsibilities Design, build, and deploy machine learning models and AI-driven solutions … environments. Stay current with emerging trends in AI/ML technologies and contribute to innovation within the organisation. Required Skills & Experience Proven experience (3–5+ years) as a Machine Learning Engineer , Data Scientist , or similar role. Strong programming skills in Python (experience with libraries such as TensorFlow, PyTorch, scikit-learn, pandas, NumPy). Solid understanding of machine learning More ❯
Posted:

Machine Learning Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Higher - AI recruitment
We are partnering with an early-stage, mission-driven company at the intersection of AI and national defence to appoint exceptional Machine Learning Engineers . This fast-growing organisation is transforming mission-critical combat planning and operational decision-making by building next-generation AI software tools for Western forces. Founded in 2023, the company has already secured significant early … OpenAI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, GenAI, Computer Vision, and Reinforcement Learning technologies. Position location (hybrid) : London (Shoreditch) or Paris (Le Marais) We are seeking Machine Learning Engineers who are passionate about using AI technology to solve complex … product team, to identify and capitalise on high-impact opportunities Keep the company at the forefront of innovation by exploring and applying the latest research Requirements: MSc in Machine Learning, Computer Science, or a related field 5+ years of experience in machine learning, with an expertise in modelling, prototyping, and evaluation Strong software engineering skills with a focus More ❯
Posted:

A.I Research Internship / Graduate Initial 1 Year FTC

Slough, Berkshire, South East, United Kingdom
Zorba Consulting
O ffice Based 5 Days Monday Friday Key skills: C omputer Science or A.I qualification or similar, Ideally Python, Machine Learning Concepts, Communication and Evaluation Skills. My Client is the rapidly growing European subsidiary of a global Film company who specialise in lighting, rigging, generators, etc. The also own film studies as well as facilitate studios for other organisations. … solutions to support operations. You will have the opportunity to gain comprehensive, hands-on experience in multiple areas of a multilocation, commercial business. This role will offer an insightful learning experience in a dynamic and fast-paced work environment. with exposure to real-world applications requiring artificial intelligence and automation. You will work closely with the IT department and … AI deployments. Produce technical documentation, including usage guides and risk assessment templates. Present findings and recommendations to stakeholders through structured viability reports. Recognise personal development needs and proactively seek learning opportunities to support growth in AI and automation. You Will Need To: Currently pursuing or recently completed a qualification in Computer Science, Artificial Intelligence, or a related field Exposure More ❯
Employment Type: Permanent
Posted:

Lead AI Engineer

slough, south east england, united kingdom
Oho Group Ltd
Lead AI Engineer | Deep Tech Start-Up | London | Cutting-Edge Machine Learning & Autonomous Systems Join one of the UK’s most exciting deep tech start-ups , backed by leaders from Meta, Google, and OpenAI . We’re building frontier AI and autonomous systems that redefine intelligent defence. The Opportunity As Lead AI Engineer , you’ll shape architecture, research direction … and real-world deployment of cutting-edge machine learning systems. You’ll lead a world-class team, blending research excellence with practical engineering impact. About You MSc or PhD in AI, Machine Learning, Robotics, or related field Proven experience leading AI/ML teams and shipping production solutions Strong in Python , C++ , or Rust , with ML frameworks (PyTorch … TensorFlow, JAX) Background in distributed systems , reinforcement learning , or autonomous agents Collaborative leader who thrives on complex technical challenges What You’ll Do Design and deploy scalable AI-driven systems Drive technical strategy, infrastructure, and innovation Collaborate with leading researchers from top global AI labs Benefits Competitive salary + meaningful equity Private health & dental cover Work with world More ❯
Posted:

Full Stack Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Higher - AI recruitment
Open AI leaders. They combine deep sector knowledge with cutting-edge design and software engineering expertise to deliver state-of-the-art SaaS capabilities leveraging NLP, Generative AI, and reinforcement learning technologies. Position location (hybrid): London (Shoreditch) or Paris (Le Marais) We are seeking Full Stack Engineers who are passionate about using technology to solve complex, real-world More ❯
Posted:

Senior ML Engineer (London)

slough, south east england, united kingdom
Glite Tech
a mobile application for teaching English to intermediate and advanced learners. We’re on the verge of solving one of the biggest challenges in education – making high-quality, personalised learning accessible to everyone . We are building a fundamental model for education - one that can accurately predict student knowledge and orchestrate lessons, adapting to the students needs. We’re … looking for a Senior ML Engineer with a proven track record of delivering ML models to production. What you will do 🚀 Build fundamental models for education - solving the ultimate learning task of predicting student knowledge and optimal ‘next task’ Work with a vast amount of unique data - we have data from over 1M language tests, including text and voice … managers Create new types of tests for language learners to gather more test results, analyse them, and build prediction models based on these results Optimise and fine-tune machine learning models for performance, scalability, and accuracy Essential skills 🙏 Complete end-to-end experience - from finding and cleaning data all the way to monitoring models in production Experience working with More ❯
Posted:

Computer Vision Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
microTECH Global LTD
Job Title: 3D Computer Vision Engineer Location: London, UK (Hybrid Working) Type: Permanent Key Responsibilities: Lead development of machine learning systems for digital human avatar generation using inverse rendering and Unity-based visualization. Design and implement mesh and texture warping algorithms (energy-based methods, Thin Plate Splines). Develop inverse rasterization systems and facial feature extraction algorithms using PyTorch. … Maps. Design Unity and Unreal Engine plugins for real-time physics, rendering, and visualization. Develop modules for 3D reconstruction, mesh processing, and camera calibration using C++ and Python. Implement reinforcement learning systems for animation synthesis and optimization-based cloth simulation. Build OpenGL/Compute shaders and GPU kernels for performance-critical 3D applications. Requirements: Degree in Computer Science More ❯
Posted:

Founding Software Engineer

slough, south east england, united kingdom
SEEKR
high volume and varied data structures without sacrificing reliability or customizability. Speed up client onboarding and knowledge by building frameworks that automate agent training and incorporate conversational elements using reinforcement learning to continually refine performance. Does this sound like you? Love tackling tough systems problems with deep comp sci fundamentals (distributed systems, data flows, event-driven architectures) Have More ❯
Posted:

Member of Technical Staff

slough, south east england, united kingdom
Cubiq Recruitment
founders on architecture, strategy, and product roadmap. Contribute to a high-performance, low-ego engineering culture focused on shipping. What We’re Looking For Deep experience in Applied Machine Learning and Agentic AI systems . Proficiency in modern ML stacks (Python, PyTorch, JAX, Ray, etc.) and production deployment. Proven ability to move fast, ship code, and bridge research with … plus. A “builder” mindset; you’re happiest when ideas turn into working systems. Key Experience: Agentic System Design LLM Engineering/Foundation Models Planning and Reasoning Scalable ML Infrastructure Reinforcement Learning (esp. RLHF/RLAIF) Simulation or feedback-driven adaptation Interview Process Initial Chat – Conversation with a Founder Technical Round 1 – Agentic System Design Technical Round 2 – Engineering More ❯
Posted:

New Trading Team's 1st C++ Quant Developer | HFT

slough, south east england, united kingdom
Augmentti
optimizing system performance for high-frequency, low-latency trading. Work with a Humble Leader : You’ll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who’s not only mastered complex trading strategies but is also focused on More ❯
Posted:

AI SME

slough, south east england, united kingdom
Lorien
non-technical stakeholders Experience Proven experience delivering AI solutions that improve operational efficiency, member engagement, or compliance Experience working with pension administration systems Strong understanding of supervised, unsupervised, and reinforcement learning. Good experience of dealing with DB, hybrid and DC occupational pension schemes Experience with AWS, Azure, or Google Cloud AI services. More ❯
Posted:

AI Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Amber Labs
the ethical and safety considerations in deploying AI systems. Desirable: Experience with agentic AI frameworks (e.g. AutoGPT, CrewAI, LangGraph, ReAct). Background in conversational design, human-AI interaction, or reinforcement learning. Familiarity with RAG, vector databases, or simulation environments. More ❯
Posted: