Employment Type

Remote Jobs

Hybrid/WFH 77

Sort By

Relevance
Date

Locations

Job Titles

1 to 25 of 77 Remote Reinforcement Learning Jobs in London

Temporary Staff - AI/ML/Modeling %26 Software Engineers

London, England, United Kingdom
Hybrid / WFH Options

NATO

an environment that is positive, inclusive, and collaborative. 2. OVERVIEW OF THE ROLE We are seeking talented and versatile engineers with experience in software development, Artificial Intelligence (AI), Machine Learning (ML), and Large Language Models (LLMs) for short-term (3-month) renewable contracts. In this role, you will be instrumental in designing, developing, deploying, and maintaining both core software … artificial intelligence, with the flexibility of short-term engagements. Key Responsibilities: • Design, develop, test, deploy, and maintain robust and scalable software applications and infrastructure. • Research, implement, and evaluate machine learning models (including deep learning and potentially reinforcement learning) to address specific business challenges. • Work with Large Language Models (LLMs): fine-tuning, prompt engineering, developing Retrieval-Augmented … or similar. • Strong proficiency in Python and its scientific computing/ML ecosystem (e.g., NumPy, Pandas, TensorFlow, AI Foundry, OpenAI, Scikit-learn etc.). • Solid understanding of core machine learning concepts • Hands-on experience with at least one major deep learning framework (e.g., TensorFlow, PyTorch, Keras). • Proven understanding of software engineering principles, data structures, algorithms, and design More ❯

Posted: 2 days ago

Senior Data Analyst at The Hawker’s Club

London, England, United Kingdom
Hybrid / WFH Options

HipHopTune Media

Strong proficiency in programming languages such as Python or R. Experience with SQL and familiarity with database management practices to handle and query large datasets efficiently. Knowledge of machine learning frameworks like TensorFlow or PyTorch and libraries such as scikit-learn. Proficiency in using data analysis and visualization tools to interpret data and present insights effectively. Proven experience as … As the Ideal Candidate, you should have: Analytical Mindset, Curiosity, and Inquisitiveness. Problem-Solving Skills. Attention to Detail. Adaptability and Communication Skills. Patience and Persistence. Experience with advanced machine learning techniques such as deep learning, natural language processing (NLP), or reinforcement learning. Experience in using advanced features of cloud platforms (AWS, Azure, Google Cloud) such as machine … learning services and automated data pipeline tools. Familiarity with big data frameworks like Hadoop or Spark is beneficial. Skills in advanced data visualization tools and software beyond basic reporting—such as Tableau, Power BI, or even more sophisticated interactive web visualization frameworks like D3.js. Benefits 15 Days paid off + local/US Bank holidays. Up to 10% of More ❯

Posted: 2 days ago

Machine Learning/ Generative AI Consultant

London, England, United Kingdom
Hybrid / WFH Options

Infosys Consulting

Get AI-powered advice on this job and more exclusive features. Infosys Consulting is a leading innovator in the tech industry, specializing in artificial intelligence and machine learning solutions. We work with a diverse range of clients to deliver cutting-edge AI technologies that drive business growth and innovation. We are looking for a talented Machine Learning/… and help us build the future of AI. Key Responsibilities: Collaborate with clients to understand their business needs and translate them into technical requirements Design, develop, and deploy machine learning and generative AI models, including but not limited to NLP models, computer vision models, and reinforcement learning systems Conduct research and stay updated on the latest advancements … in AI and machine learning technologies, particularly in generative models such as GANs, VAEs, and diffusion models Analyze large datasets to uncover patterns, insights, and opportunities for machine learning applications Optimize and fine-tune models for performance, scalability, and accuracy Build and maintain scalable data pipelines for model training and deployment Provide technical guidance and mentorship to junior More ❯

Posted: 2 days ago

Senior Director, Gen AI

London, England, United Kingdom
Hybrid / WFH Options

Paysafe

visionary with a passion for shaping the future of Generative AI ? We're seeking a Senior Director, Gen AI to lead the development of advanced AI agents, leveraging LLMs, reinforcement learning, vector databases, and autonomous systems . As a key AI strategy leader , you'll define the next-generation consumer experiences , enabling real-time merchant insights , AI-powered … St Paul's, Bank, Cannon Street, City Thameslink, Liverpool Street, Farringdon, Mansion House. The impact you will have: AI Agent Development: Design and optimize AI agents capable of autonomous learning, decision-making, and task execution. LLM Fine-Tuning & Enhancement: Adapt foundational models like GPT, Llama, integrating retrieval-augmented generation (RAG), personalization, and continuous improvement loops. Multi-Agent Systems: Implement … to enhance AI contextual understanding and improve user experiences. End-to-End AI Automation: Build architectures to integrate AI with APIs, databases, and external applications for seamless automation. Machine Learning Pipeline Optimization: Define best practices for AI experimentation, model deployment, and cloud-scale AI processing. Gen AI for Consumer-Facing Products: Elevate chatbot capabilities with consumer insights and conversational More ❯

Posted: Yesterday

Experienced CRM Data Scientist CRM Data Science · UK · Hybrid Remote

London, England, United Kingdom
Hybrid / WFH Options

EveryMatrix Ltd

Develop predictive models to forecast customer activities and trends. Collaborate with cross-functional teams to implement data-driven strategies. Define and track key performance indicators (KPIs). Utilize machine learning techniques to optimize CRM strategies, especially related to automated campaign optimization. Communicate findings and recommendations to stakeholders in a clear and concise manner. Requirements: Master's or PhD degree …/B tests. Familiarity with Bayesian statistics, especially for hypothesis testing. Extra points for good knowledge of Bayesian programming (PyMC3, etc). Experience with application of optimization theory or reinforcement learning based on automated A/B testing. Projects you might expect to work on: Predicting future expected value and future potential value of individual customers. Real-time … work from home per quarter (during your first 3 months, you'll be in the office full-time). Benefits & Perks: Monthly meal allowance. Gym allowance. Access online learning platforms like Udemy for Business and LinkedIn Learning, and a budget for external training. Support for New Parents: - 52 weeks of maternity leave, with 21 weeks being fully paid. More ❯

Posted: 2 days ago

Experienced CRM Data Scientist

London, England, United Kingdom
Hybrid / WFH Options

JR United Kingdom

Develop predictive models to forecast customer activities and trends. • Collaborate with cross-functional teams to implement data-driven strategies. • Define and track key performance indicators (KPIs). • Utilize machine learning techniques to optimize CRM strategies, especially related to automated campaign optimization. • Communicate findings and recommendations to stakeholders in a clear and concise manner. Requirements: • Master's or PhD degree …/B tests. • Familiarity with Bayesian statistics, especially for hypothesis testing. Extra points for good knowledge of Bayesian programming (PyMC3, etc). • Experience with application of optimization theory or reinforcement learning based on automated A/B testing. Projects you might expect to work on: • Predicting future expected value and future potential value of individual customers. • Real-time … work from home per quarter (during your first 3 months, you'll be in the office full-time). Benefits & Perks: • Monthly meal allowance. • Gym allowance. • Access online learning platforms like Udemy for Business and LinkedIn Learning, and a budget for external training. • Support for New Parents: - 52 weeks of maternity leave, with 21 weeks being fully paid. More ❯

Posted: 2 days ago

Experienced CRM Data Scientist

City of London, England, United Kingdom
Hybrid / WFH Options

JR United Kingdom

Posted: 4 days ago

Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options

InstaDeep Ltd

be a part of the AI revolution! About DeepPCB: DeepPCB is InstaDeep's AI-powered Place & Route PCB (Printed Circuit Board) design tool. We use a combination of deep reinforcement learning and high-performance computing to automate and scale PCB place-and-route workflows, accelerating hardware innovation globally. We are looking for a Machine Learning Engineer to … join the DeepPCB team and help push the boundaries of AI for electronic design automation (EDA). You will develop, optimize, and deploy cutting-edge machine learning and reinforcement learning models focused on automating complex PCB design problems, working closely with researchers and engineers to bring ideas to life. Responsibilities: Develop scalable and efficient machine learning … technical depth based on the audience. Participate in technical discussions, design reviews, and customer-facing activities when required. Requirements: B.Sc ., M.Sc ., or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related technical field. 2-5 years of professional experience in applied machine learning or engineering roles. Strong expertise in Machine Learning and Deep More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 16 days ago

Machine Learning Engineer (London)

London, UK
Hybrid / WFH Options

InstaDeep Ltd

be a part of the AI revolution! About DeepPCB: DeepPCB is InstaDeep’s AI-powered Place & Route PCB (Printed Circuit Board) design tool. We use a combination of deep reinforcement learning and high-performance computing to automate and scale PCB place-and-route workflows, accelerating hardware innovation globally. We are looking for a Machine Learning Engineer to … join the DeepPCB team and help push the boundaries of AI for electronic design automation (EDA). You will develop, optimize, and deploy cutting-edge machine learning and reinforcement learning models focused on automating complex PCB design problems, working closely with researchers and engineers to bring ideas to life. Responsibilities: Develop scalable and efficient machine learning … technical depth based on the audience. Participate in technical discussions, design reviews, and customer-facing activities when required. Requirements: B.Sc ., M.Sc ., or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related technical field. 2–5 years of professional experience in applied machine learning or engineering roles. Strong expertise in Machine Learning and Deep More ❯

Employment Type: Full-time

Posted: Yesterday

Senior Machine Learning Engineer

London, England, United Kingdom
Hybrid / WFH Options

InstaDeep Ltd

be a part of the AI revolution! About DeepPCB: DeepPCB is InstaDeep’s AI-powered Place & Route PCB (Printed Circuit Board) design tool. We use a combination of deep reinforcement learning and high-performance computing to automate and scale PCB place-and-route workflows, accelerating hardware innovation globally. We are looking for a Machine Learning Engineer to … join the DeepPCB team and help push the boundaries of AI for electronic design automation (EDA). You will develop, optimize, and deploy cutting-edge machine learning and reinforcement learning models focused on automating complex PCB design problems, working closely with researchers and engineers to bring ideas to life. Responsibilities: Develop scalable and efficient machine learning … technical depth based on the audience. Participate in technical discussions, design reviews, and customer-facing activities when required. Requirements: B.Sc ., M.Sc ., or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related technical field. 5 years of professional experience in applied machine learning or engineering roles. Strong expertise in Machine Learning and Deep Learning More ❯

Posted: 2 days ago

Data Scientist

London, England, United Kingdom
Hybrid / WFH Options

Haleon

Working Opportunity TELECOMMUTE APPLY HERE: Data Scientist About The Role This is an exciting opportunity is to join our Global AI & Data Science team. AI, Data Science and Machine learning is a critical part of our strategy as, and we are investing to build best in class data science capabilities so our AI solutions deliver impact and meet our … concepts to both technical and non-technical audiences to facilitate feasibility assessment, prioritisation and adoption. Have scientific mindset and understanding of statistical modelling and the mathematical foundations of Machine Learning algorithms Proficient in Python Have a deep knowledge of a sufficiently broad area of technical specialism (e.g. Time Series, Combinatorial Optimisation, Reinforcement Learning, Bayesian Statistics, NLP etc. … science frameworks and toolkits (Scipy, Scikit-Learn, Keras, PyTorch, Etc.) Experience with Git, Databricks and Microsoft Azure technical stack Prior exposure to Agile methodologies Experience building large scale machine learning systems Master's Degree in Computer Science, Data Science, Mathematics, Physics or equivalent Care to join us. Find out what life at Haleon is really like www.haleon.com/careers More ❯

Posted: 3 days ago

Senior Data Scientist

London, England, United Kingdom
Hybrid / WFH Options

Faculty

roundtables, or by contributing to large-scale open-source projects. You will also have the opportunity to teach on the fellowship about topics that range from basic statistics to reinforcement learning, and to mentor the fellows through their 6-week project. Thanks to Faculty platform, you will have access to powerful computational resources, and you will enjoy the … become a fluent Python programmer in a short timeframe An excellent command of the basic libraries for data science (e.g. NumPy, Pandas, Scikit-Learn) and familiarity with a deep-learning framework (e.g. TensorFlow, PyTorch, Caffe) A high level of mathematical competence and proficiency in statistics A solid grasp of essentially all of the standard data science techniques, for example … supervised/unsupervised machine learning, model cross validation, Bayesian inference, time-series analysis, simple NLP, effective SQL database querying, or using/writing simple APIs for models. We regard the ability to develop new algorithms when an innovative solution is needed as a fundamental skill A leadership mindset focussed on growing the technical capabilities of the team; a caring More ❯

Posted: 2 days ago

Senior Machine Learning Engineer

London, England, United Kingdom
Hybrid / WFH Options

Senior Machine Learning Engineer Department: Data Science & Intelligence Employment Type: Permanent - Full Time Location: London, UK Description Who we are Tripledot Studios is on track to become one of the largest independent mobile games companies in the world. We are a multi-award-winning organisation, and following our recent acquisition announcement, we’re preparing to grow into a global … games. Our ambition is to become the most successful games company in the world, and we’re just getting started. About the Role Role Overview As a Senior Machine Learning Engineer , you'll significantly advance our machine learning, directly impacting high-profile mobile games. You'll research, prototype, develop, and deploy ML models to optimize in-game experiences … and Kubernetes. Experience with cloud platforms (AWS/Google Cloud) for scalable model deployment. Ability to work independently and manage end-to-end ML projects. Experience with recommendation systems, reinforcement learning, or multi-armed bandits highly desirable. Comfortable collaborating to translate business needs into technical solutions. Strong problem-solving skills with a proactive and curious approach to data More ❯

Posted: 2 days ago

Research Scientist / Research Engineer, Pre-training

London, United Kingdom
Hybrid / WFH Options

Menlo Ventures

tooling to enhance team productivity Contribute to the entire stack, from low-level optimizations to high-level model design Qualifications: Advanced degree (MS or PhD) in Computer Science, Machine Learning, or a related field Strong software engineering skills with a proven track record of building complex systems Expertise in Python and experience with deep learning frameworks (PyTorch preferred … Familiarity with large-scale machine learning, particularly in the context of language models Ability to balance research goals with practical engineering constraints Strong problem-solving skills and a results-oriented mindset Excellent communication skills and ability to work in a collaborative environment Care about the societal impacts of your work Preferred Experience: Work on high-performance, large-scale ML … systems Familiarity with GPUs, Kubernetes, and OS internals Experience with language modeling using transformer architectures Knowledge of reinforcement learning techniques Background in large-scale ETL processes You'll thrive in this role if you: Have significant software engineering experience Are results-oriented with a bias towards flexibility and impact Willingly take on tasks outside your job description to More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 12 days ago

Research Engineer

London Area, United Kingdom
Hybrid / WFH Options

Morson Talent

Design (Full-Time | Hybrid | London) About DeepPCB DeepPCB is My Client’s cutting-edge AI-powered Place & Route design tool for Printed Circuit Boards (PCBs). By combining deep reinforcement learning with high-performance computing, DeepPCB automates and accelerates PCB layout workflows—helping hardware teams around the world design faster, smarter, and more efficiently. Learn more at deeppcb.ai … to join the DeepPCB team and push the boundaries of artificial intelligence in electronic design automation (EDA). In this role, you'll design, build, and scale advanced machine learning systems—solving complex real-world PCB design challenges and contributing to the future of intelligent hardware engineering. Key Responsibilities Research and implement scalable deep learning and reinforcement learning algorithms tailored for PCB place-and-route problems. Adapt ML algorithms for performance in distributed and GPU-accelerated environments. Build production-ready ML solutions and prototypes that directly integrate into the DeepPCB platform. Communicate research progress clearly to both technical and non-technical audiences. Collaborate across research, engineering, product, and business teams to drive innovation. Represent DeepPCB More ❯

Posted: Today

Research Engineer

City of London, London, United Kingdom
Hybrid / WFH Options

Morson Talent

Posted: Today

AI Research Residency

London Area, United Kingdom
Hybrid / WFH Options

MediaTek

pursuit. Program Highlights Mentorship: Collaborate with experienced researchers and engineers on innovative AI projects Research Focus: Engage in areas such as large language models (LLMs), optimization methods for deep learning, reinforcement learning (RL), and generative models Professional Development: Gain hands-on experience in AI research with real-world applications, contributing to both academic publications and product development … leading universities and contribute to the broader scientific community Work Arrangement: Benefit from a hybrid work model, combining remote and on-site collaboration Responsibilities Conduct innovative research in machine learning and artificial intelligence Develop and implement algorithms Collaborate with cross-functional teams to integrate AI solutions Publish research findings in top-tier conferences and journals Stay up to date … with the latest advancements in AI and related fields Qualifications Required: PhD in Machine Learning, Artificial Intelligence, Mathematics, Computer Science, Physics, or a related field Proficiency in programming languages such as Python, C++, or similar Strong problem-solving skills and the ability to work both independently and collaboratively Excellent communication skills, with the ability to present complex ideas clearly More ❯

Posted: 4 days ago

AI Research Residency

City of London, London, United Kingdom
Hybrid / WFH Options

MediaTek

Posted: 4 days ago

Applied Scientist

London, England, United Kingdom
Hybrid / WFH Options

Wayve

paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future. At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment … research expertise in AI applied to autonomous driving or similar robotics or decision making domain, inclusive, but not limited to the following specific areas: Model-free and model-based reinforcement learning Offline reinforcement learning Planning with learned models, model predictive control and tree search Imitation learning, inverse reinforcement learning and causal inference Learned … large, rich, and diverse sources of real-world driving data How to architect our models to best employ the latest advances in foundation models, transformers, world models, etc. Which learning algorithms to use (e.g. reinforcement learning, behavioural cloning) How to leverage simulation for controlled experimental insight, training data augmentation, and re-simulation How to scale models efficiently More ❯

Posted: Today

Quant Developer

City Of London, England, United Kingdom
Hybrid / WFH Options

Glocomms

tooling, and models that will drive alpha generation and risk management for years to come. As a Quant Developer, you will work at the intersection of quantitative research, machine learning, and software engineering. You'll collaborate with quants, data scientists, and portfolio managers to design and implement scalable systems for data ingestion, model training, and real-time signal deployment. … to architectural decisions and technology stack selection for the greenfield platform. Required Skills & Experience Strong programming skills in Python, with experience in production-grade systems. Solid understanding of machine learning workflows, including model training, validation, and deployment. Experience with quantitative finance, including time series analysis, alpha modelling, or risk analytics. Familiarity with cloud infrastructure (e.g., AWS, GCP) and containerisation … engineering tools Experience working in fast-paced, collaborative environments with agile methodologies. Nice to Have Prior experience in a hedge fund, prop trading firm, or investment bank. Exposure to reinforcement learning, deep learning, or LLMs in financial contexts. Knowledge of market microstructure and execution algorithms. Contract Details Duration: 12 months Start Date: July 2025 Location: London (Hybrid More ❯

Posted: 2 days ago

Senior Data Scientist

London, England, United Kingdom
Hybrid / WFH Options

Sprout.ai LTD

from you. Apply now and help shape the future of claims automation. Responsibilities Develop features for our state-of-the-art claims automation platform Research, build and deploy machine learning algorithms and models to production within product teams Provide technical guidance and input on the design and implementation of machine learning algorithms Support with customer PoVs and onboarding … product requirements and help translate these into technical solutions Execute and deliver full AI/ML solutions from sourcing training data, design and implementing state-of-the-art machine learning models, testing, benchmark and product-driven research for model performance improvement, to shipping stable, tested, performant code in an agile environment. Work closely with Product Managers to help shape … Computer Vision or LLM projects into production Strong understanding of software development fundamentals, in particular deploying models to production and how to set up pipelines. Demonstrate expertise in deep learning for computer vision, natural language processing, reinforcement learning etc. Displays in depth knowledge in machine learning best practices, scalable training and deployment, model introspection and evaluation More ❯

Posted: 2 days ago

Principal Data Scientist

London, England, United Kingdom
Hybrid / WFH Options

Sky Ireland Limited

Predictive Modelling, Optimisation, Clustering or NLP Experience building and deploying advanced analytics solutions in a large scale (preferably B2C) cloud environment Experience in deploying commercially viable applications using deep learning techniques combining structured and unstructured data is highly desirable Ability to quickly understand a business objective, problem solving to create an analytical solution and develop a solution alongside stakeholders … platforms (essential) Programming for modelling and/or data analysis e.g. Python, Tensorflow (essential) Database experience, preferably SQL (essential) Expertise in cutting-edge AI methodologies, including Generative AI and Reinforcement Learning Machine learning - Supervised/unsupervised learning, regression, decision trees, random forests, boosting, clustering (essential) The rewards There's one thing people can't stop talking More ❯

Posted: Today

AI Agent Engineer - Consultant/Snr Consultant level

London, England, United Kingdom
Hybrid / WFH Options

Infosys

Expiry Date: 13.08.2025 col-wide Job Description: Infosys Consulting is at the forefront of applied AI innovation, delivering real-world business value through the convergence of AI agents, machine learning, and modern enterprise architecture. As part of our growing Enterprise AI consulting practice, we are looking for technically hands-on professionals to design and deliver client-centric intelligent systems … communication and consulting skills, with the ability to present solutions to both technical and non-technical stakeholders. Preferred Skills · Hands-on exposure to cognitive architectures, planning-based agents, or reinforcement learning in real-world deployments. · Experience integrating AI agents into enterprise apps like Salesforce, ServiceNow, SAP, or custom apps via APIs. · Understanding of AI observability, performance monitoring, and More ❯

Posted: Yesterday

AI Agent Engineer - Consultant/Snr Consultant level

London, England, United Kingdom
Hybrid / WFH Options

Infosys Consulting - Europe

Infosys Consulting is at the forefront of applied AI innovation, delivering real-world business value through the convergence of AI agents, machine learning, and modern enterprise architecture. As part of our growing Enterprise AI consulting practice, we are looking for technically hands-on professionals to design and deliver client-centric intelligent systems and support business growth through strategic pre … communication and consulting skills, with the ability to present solutions to both technical and non-technical stakeholders. Preferred Skills • Hands-on exposure to cognitive architectures, planning-based agents, or reinforcement learning in real-world deployments. • Experience integrating AI agents into enterprise apps like Salesforce, ServiceNow, SAP, or custom apps via APIs. • Understanding of AI observability, performance monitoring, and More ❯

Posted: Yesterday

Agentic AI Architect

London, England, United Kingdom
Hybrid / WFH Options

Infosys

you want to login/join with: Infosys Consulting is at the forefront of applied AI innovation, delivering real-world business value through the convergence of AI agents, machine learning, and modern enterprise architecture. As part of our growing Enterprise AI consulting practice, we are looking for entrepreneurial and technically hands-on AI Agent Engineers and Agentic AI Architects … communication and consulting skills, with the ability to present solutions to both technical and non-technical stakeholders. Preferred Skills · Hands-on exposure to cognitive architectures, planning-based agents, or reinforcement learning in real-world deployments. · Experience integrating AI agents into enterprise apps like Salesforce, ServiceNow, SAP, or custom apps via APIs. · Understanding of AI observability, performance monitoring, and More ❯

Posted: 2 days ago

12 3 4

Salary Guide

Reinforcement Learning
London

10th Percentile: £92,500
25th Percentile: £125,000
Median: £150,000
75th Percentile: £175,000

More Reinforcement Learning insights »