Remote Reinforcement Learning Jobs

1 to 25 of 38 Remote Reinforcement Learning Jobs

Deep Reinforcement Learning Specialist with Security Clearance

Colorado Springs, Colorado, United States
Hybrid / WFH Options
Aerospace Corporation
real-time operating systems, and visualization frameworks, with expertise in cyber security, software architecture, software engineering, process improvement, software development, data science, and machine learning to deliver responsive, resilient, high-performance software intensive systems to our IC, DoD, and civilian customers. The Data Science and Artificial Intelligence Department (DSAID … seeks a creative and enthusiastic Deep Reinforcement Learning Specialist to join a team of engineers, data scientists, and programmers with a passion for researching, prototyping, understanding, and building AI and data-enabled tools across the space enterprise. We are a growing, innovative, and collaborative department that makes meaningful … Security Space (AF, NRO, etc.), and Civil and Commercial customers (NASA, NOAA, MDA, DHS, etc.). The ideal candidate has deep expertise in Deep Reinforcement Learning, while contributing broadly to AI projects involving one or more of the following: autonomous systems, control and dynamics, robotics, and nonlinear optimization. More ❯
Employment Type: Permanent
Salary: USD 249,600 Annual
Posted:

Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options
Locus Robotics
to sorting and pack-out-Locus Robotics empowers businesses to meet peak demands and adapt to ever-changing operational needs. Are you a Machine Learning Engineer with a passion for reinforcement learning, multi-agent systems, and simulation at scale? We want to hear from you! At Locus … tools and ML systems to optimize the behavior of large autonomous fleets in dynamic environments. In this role, you will work on cutting-edge reinforcement learning (RL) models, multi-agent systems, and faster-than-real-time simulations to drive innovation in logistics, robotics, and beyond. You'll collaborate … real-time modelling of 1,000+ autonomous agents for various use cases such as fleet optimization, logistics, or robotics. Develop, deploy, and maintain machine learning models, with a strong focus on reinforcement learning (RL) and multi-agent systems to optimize fleet behavior in dynamic environments. Implement and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Software Engineer

Surrey, Metro Vancouver, British Columbia
Hybrid / WFH Options
ECM Selection
experience will also be considered) - Full rights to work in the UK without limitation While not required, any additional experience with digital twins, machine learning, reinforcement learning or agentic systems within a scientific context would be beneficial. On offer is a very competitive salary and attractive benefits … hybrid or remote work. Keywords: RSE, Scientific Modelling, R&D, Python, Julia, R, C, C++, C#, Fortran, Digital Twins, Agentic Systems, Probabilistic Programming, Machine Learning, Reinforcement Learning Another top job from ECM, the high-tech recruitment experts. Even if this job's not quite right, do contact More ❯
Employment Type: Permanent
Salary: £60,000 - £70,000
Posted:

Senior Data Scientist Data and Insights London Hybrid Remote

London, United Kingdom
Hybrid / WFH Options
loveholidays
Data Scientists, four Data Scientists, and the Head of Data Science. We specialise in various areas such as Recommender Systems, Time Series Forecasting, Deep Learning, and Reinforcement Learning, fostering a collaborative learning environment. Our focus is on modelling and problem-solving, leveraging advanced machine learning … planning/prioritisation to delivery including monitoring and alerting Designing experiments and modelling to generate actionable insights and enhance business performance Proficient in machine learning and statistical methods for predictive modelling and forecasting Experience deploying ML models to production at scale Solid understanding of SQL Proficiency in unit testing … CI/CD, model management and experiment tracking Desirable Experience with Deep Learning, Generative AI and Reinforcement Learning Experience with Time Series Forecasting and Recommender Systems Previous experience working in e-commerce, retail, or the travel industry. Conducted and analysed large scale A/B experiments Experience More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AI/ML Tech Lead

Manchester Area, United Kingdom
Hybrid / WFH Options
Airtime
engineers, product teams, and software engineers to integrate ML solutions into proprietary leading-edge fintech applications. Research and implement state-of-the-art deep learning, reinforcement learning, and transformer-based NLP models (e.g., BERT, GPT). Monitor model performance and security, retrain models as necessary, and optimise … adoption. Hire, mentor and support junior ML engineers, fostering engineering excellence, knowledge-sharing and best practices within the team. Requirements Strong background in Machine Learning, Deep Learning, and NLP Experience with transformer models (BERT, GPT, LLaMA, etc.) Proficiency in SQL, Python and ML libraries such as TensorFlow, PyTorch … accessible across the business Colleague Benefits Share options. 23 days annual leave, plus one for each year served (capped at 28). Birthday leave. Learning & development budget/time allocation Flexible start & finish hours 06:30 - 10:30 am Life assurance at 5x salary Health cash plan Virtual GP More ❯
Posted:

AI / ML Recruitment Consultant

United Kingdom
Hybrid / WFH Options
AT Talent - Rec2Rec
plug directly into the world's most impressive DeepTech businesses in the world to place the TOP 1% across Robotics, Computer Vision, GenAI, Machine Learning, Deep Learning, Reinforcement Learning + the works. I could keep going, so let me skip to the logistics of the role More ❯
Posted:

Artificial Intelligence/Machine Learning Engineer with Security Clearance

Colorado Springs, Colorado, United States
Hybrid / WFH Options
Lockheed Martin
This capability will show that technical barriers have been removed for automated coordination across defense and industry RF spectrum users. Artificial Intelligence/Machine Learning Engineer THE WORK The Lockheed Martin Realm AI team is looking for an AI/ML engineer who is passionate about cutting-edge AI … physics and defense. The Realm AI team is a research and development team located in Colorado Springs, CO that applies modern optimization and machine learning algorithms to some of our customers' most complex and challenging problems. Our team specializes in non-linear stochastic optimization, deep learning, reinforcement learning and high-performance computation applied to advanced missile defense. The selected candidate will primarily be working to extend and refine AI algorithms to new problems and domains. The selected candidate will be required to work onsite in Colorado Springs and must be able to obtain a security More ❯
Employment Type: Permanent
Salary: USD 225,975 Annual
Posted:

Principal Machine Learning Engineer, Agentic AI

Oregon, United States
Hybrid / WFH Options
Zillow
to deliver a delightful customer experience. We strongly believe in accountability, collaboration, and transparency. About the role Zillow is looking for a Principal Machine Learning Engineer on the Agentic AI team to develop innovative multimodal Agentic AI technologies. They will be hands-on in prototyping ideas, collaborating with product … partners, and evaluating, and deploying models to Zillow's highly scalable infrastructure platforms. They'll be developing perception and language understanding, deep reasoning, and reinforcement learning to handle AI agent decision-making. As a Principal Machine Learning Engineer on this team, you will: Leverage frameworks like AgengSDK … has: A master's degree or above Experience in Computer Science, Electrical Engineering, or a related field with emphasis on foundational LLM, agentic AI, reinforcement learning, AI planning, or natural language processing 7+ years of hands-on work building high-impact solutions-ideally building agent-based systems, multi More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

AI Engineer

London, United Kingdom
Hybrid / WFH Options
Birdie
up skill our broader engineering teams. Build and maintain AI infrastructure leveraging the latest technology approaches: Investigate cutting-edge approaches like generative AI or reinforcement learning to drive adoption and impact. Ensure seamless integration of AI into existing platform and customer-facing processes. We'd love to hear … from you if you have: At least 2 years of experience working in an AI engineering or machine learning role supporting customer-facing processes and infrastructure. Experience working in a cross functional product engineering squad with deep interest in understanding and solving end user problems. Proven expertise in building … scalable machine learning models and deploying them into production environments. Passionate about enabling others with tools, frameworks, and documentation-ensuring teams can independently leverage AI without creating bottlenecks. A technical leader who stays curious about emerging technologies (e.g., generative AI, reinforcement learning) and rapidly evaluates their pros More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Artificial Intelligence Engineer

United Kingdom
Hybrid / WFH Options
WNTD
autonomous AI agents that can reason, plan, and take actions independently in complex environments. The ideal candidate will have experience with LLM-based agents, Reinforcement Learning (RL), event-driven architectures, and multi-agent systems . You will work on next-generation AI solutions that enhance automation, decision-making … CrewAI, OpenAI Function Calling, etc.). Hands-on experience with LLMs ( GPT, LLaMA, Claude, Mistral , etc.), diffusion model or GANs Hands-on experience with Reinforcement Learning (RL), Evolutionary Algorithms, or Graph Neural Networks . Strong experience with AI/ML framework such as TensorFlow, PyTorch or JAX . More ❯
Posted:

Senior Data Scientist London, UK (hybrid)

London, United Kingdom
Hybrid / WFH Options
Rakuten Viber
with family and friends, through managing business relationships to pursuing their passions. Connecting people across the world is a complex problem with many machine-learning applications. The purpose of this role is to implement mathematical models and algorithms to solve complex business problems in recommendations and classification. Successful outcomes … globe. As a Senior Data Scientist, you will work in a highly collaborative environment with extensive amounts of data to research and develop deep learning models in the domains of dating, moderation and content segmentation and apply them to tasks such as recommendation systems and analytics at a high … management and partner teams to design and implement solutions in recommender systems for given objectives. Lead technical efforts to improve the performance of deep learning models and propose initiatives to impact company goals directly. Autonomously find solutions to complex problems in social network recommendations and understand the data generation More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Specialist, Applied AI Engineer

London, United Kingdom
Hybrid / WFH Options
Industrial and Financial Systems
and iterate AI solutions in collaboration with customers and use case leads is a must. You will be responsible for applying generative AI techniques, reinforcement learning, and multi-agent systems to solve real-world industrial challenges. This role demands a high level of creativity in developing new AI … and optimization of AI models to ensure reliability, safety, and regulatory compliance in industrial settings. Stay ahead of the latest advancements in generative AI, reinforcement learning, and multi-agent systems to drive innovation within the company. Qualifications Masters degree in Data Science. Thought leader and expert in applied More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Software Engineer

Guildford, Surrey, United Kingdom
Hybrid / WFH Options
Ecm Selection
experience will also be considered) Full rights to work in the UK without limitation While not required, any additional experience with digital twins, machine learning, reinforcement learning or agentic systems within a scientific context would be beneficial. On offer is a very competitive salary and attractive benefits More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer, Content and Catalog Management

London, United Kingdom
Hybrid / WFH Options
Spotify AB
environment. What You'll Do: Drive the full lifecycle of ML solutions for CoCaM services, including research, design, development, evaluation, and deployment. Manage Machine Learning projects ranging from Supervised Learning, to Reinforcement Learning, to LLMs. Optimize and monitor deployed ML model performance, implementing improvements based on … applications. Work closely with engineering teams to integrate ML models into existing systems and workflows. Be an active participant of a group of machine learning engineers, staying updated with the latest advancements, participating in code reviews, and contributing to knowledge sharing across the team. Who You Are: 2+ years … of hands-on experience in developing and deploying machine learning models in a production environment. Practical experience in implementing ML systems using languages like Python or Scala and are familiar with relevant ML libraries and frameworks (e.g., TensorFlow or PyTorch). Solid understanding of various machine learning algorithms More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineer - Content and Catalog Management

London, United Kingdom
Hybrid / WFH Options
Spotify
environment. What You'll Do Drive the full lifecycle of ML solutions for CoCaM services, including research, design, development, evaluation, and deployment. Manage Machine Learning projects ranging from Supervised Learning, to Reinforcement Learning, to LLMs. Optimize and monitor deployed ML model performance, implementing improvements based on … applications. Work closely with engineering teams to integrate ML models into existing systems and workflows. Be an active participant of a group of machine learning engineers, staying updated with the latest advancements, participating in code reviews, and contributing to knowledge sharing across the team. Who You Are 2+ years … of hands-on experience in developing and deploying machine learning models in a production environment. Practical experience in implementing ML systems using languages like Python or Scala and are familiar with relevant ML libraries and frameworks (e.g., TensorFlow or PyTorch). Solid understanding of various machine learning algorithms More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning (ML) Engineer II

Uxbridge, Middlesex, United Kingdom
Hybrid / WFH Options
NLP PEOPLE
office is open planned, and you will be working in an innovative and collaborative environment with your international peers and colleagues. As the Machine Learning (ML) Engineer II, you will be responsible for developing AI Labs' machine learning platform and creating production-ready AI-based solutions for key … guidance to other parts of the business. Key Accountabilities: ML and Natural Language Processing (NLP) Technology: • Leverage proprietary technology stack to build custom machine learning models • Design, implement, and document new ML/NLP modeling techniques and strategies • Develop Back-end/server-side software to support AI solution … development and serving • Build internal frameworks, libraries, and infrastructure to improve machine learning and NLP capabilities to allow for rapid prototyping and new product delivery • Review and adapt recent research in ML and NLP into prototypes and production solutions • Review and improve the code of other engineers to enhance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Manager, Research Science (Foundational Research, Machine Learning)

London, United Kingdom
Hybrid / WFH Options
Thomas Reuters
Are you a curious and open-minded individual with an interest in leading teams conducting state-of-the-art foundational machine learning research? Thomson Reuters Labs is seeking Research Scientist Managers with a passion for leading teams and solving challenging machine learning problems in a data-rich, complex … academic environment driven by real-world problems. Foundational Research is the dedicated core Machine Learning research division of Thomson Reuters. We are focused on research and development, with a particular focus on advanced algorithms and training techniques for Large Language Models (LLMs). We are building a strong foundation … sleeves and participate in designing, coding, conducting experiments, and translating findings into concrete deliverables. Our focus areas are: LLM Training (Continued Pretraining, Instruction Tuning, Reinforcement Learning Alignment, Distributed Training, Efficient ML techniques) Post-training techniques for planning & reasoning (e.g. LLMs & Knowledge Graphs, Advanced Prompting, CoT pipelines, RAG, Tool More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Scientist

Manchester Area, United Kingdom
Hybrid / WFH Options
55 Exec Search
learning. Analyse behavioural datasets to uncover insights and patterns. Design and train predictive models based on user interactions, decision flows, and sequences. Apply machine learning and deep learning techniques to simulate and anticipate behaviour. Collaborate with the wider AI/ML engineers and team Develop and implement scalable … data pipelines and experimentation frameworks. Translate business and product questions into data-driven solutions. Stay on top of emerging trends in behavioural modelling, reinforcement learning, and cognitive AI. What we are looking for: 2+ years of experience in data science or machine learning roles. Strong knowledge of More ❯
Posted:

Data Scientist

Birmingham, Staffordshire, United Kingdom
Hybrid / WFH Options
Investigo
Scientist Salary: £50,000 - £57,000 Location: Birmingham - Hybrid working 1-2 days a month on site About You: Technical Skills: Expertise in machine learning (supervised, unsupervised, reinforcement learning). Strong proficiency in Databricks, Azure ML, Azure Data Factory, and SQL. Hands-on experience in Python, R … and experience with deep learning frameworks such as TensorFlow, PyTorch, or Scikit-learn. Expert-level experience in Power BI for advanced visualisations, ML model interpretation, and KPI tracking. Deep knowledge of feature engineering, model deployment, and MLOps best practices. Experience with big data processing (Spark, Hadoop) and cloud-based More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Scientist

Birmingham, West Midlands, West Midlands (County), United Kingdom
Hybrid / WFH Options
Investigo
Scientist Salary: £50,000 - £57,000 Location: Birmingham - Hybrid working 1-2 days a month on site About You: Technical Skills: Expertise in machine learning (supervised, unsupervised, reinforcement learning). Strong proficiency in Databricks, Azure ML, Azure Data Factory, and SQL. Hands-on experience in Python, R … and experience with deep learning frameworks such as TensorFlow, PyTorch, or Scikit-learn. Expert-level experience in Power BI for advanced visualisations, ML model interpretation, and KPI tracking. Deep knowledge of feature engineering, model deployment, and MLOps best practices. Experience with big data processing (Spark, Hadoop) and cloud-based More ❯
Employment Type: Permanent
Salary: £50000 - £57000/annum
Posted:

Full Stack Developer with Security Clearance

Lexington, Massachusetts, United States
Hybrid / WFH Options
Equiliem
to understand front-end source code written in React or similar frameworks. Provide guidance to less experienced front-end engineers. • General knowledge of machine learning and reinforcement learning concepts, frameworks, and environments, such as Pandas, TensorFlow, and Jupyter Notebook. • Broad knowledge of the general features, capabilities, and More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Applied Scientist, EU STEP Science and Tech

London, United Kingdom
Hybrid / WFH Options
Amazon
options in Barcelona, Berlin, or London, designed to maximize team interaction. Remote work options are also considered. Basic Qualifications PhD in Operations Research, Machine Learning, Statistics, Applied Mathematics, Computer Science, or related fields, or equivalent experience. Excellent written and verbal communication skills. Experience programming in Java, Python, C++, or … in a fast-paced applied research environment. Ability to handle ambiguity. Top-tier publications in relevant fields. Key Responsibilities Solve complex optimization and machine learning problems using scalable algorithms. Design and develop prototypes addressing real-world logistics challenges. Lead analyses to support decision-making and communicate results to leadership. … optimizing Amazon's logistics through advanced mathematical algorithms and data-driven techniques. We work closely with academic experts, employ cutting-edge methods like deep learning and reinforcement learning, and prioritize correct modeling and production-ready prototypes. Our goal is to support strategic investments and solve real-world More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Machine Learning Developer

United Kingdom
Hybrid / WFH Options
Capgemini
LLMs and GenAI. • Perform feature engineering and selection to optimize model performance. • Select and implement appropriate AI/ML algorithms, including supervised, unsupervised, and reinforcement learning models. • Train, evaluate, and optimize models using machine learning techniques. • Deploy models to production environments, ensuring robustness and scalability. • Monitor model … in AI/ML development. • Strong understanding of AI/ML algorithms and techniques, including LLMs, GenAI, and automated AI systems. • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch). • Proficiency in Python, R, or other relevant programming languages. • Experience with data analysis and visualization tools (e.g., Matplotlib, Seaborn … invested in wellbeing apps such as Thrive and Peppy. You will be empowered to explore, innovate, and progress. You will benefit from Capgemini's 'learning for life' mindset, meaning you will have countless training and development opportunities from think tanks to hackathons, and access to 250,000 courses with More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Algorithm Developer

Stevenage, Hertfordshire, United Kingdom
Hybrid / WFH Options
MBDA Miissle System
skills Relevant experience (Post-Doctoral or industrial) in robotics, data fusion, tracking/estimation, pattern discovery & recognition, statistical inference, optimisation and machine/deep learning algorithms along with real-time implementation, and/or validation & verification is a strong advantage Experience in Matlab, Simulink and/or Stateflow, Python … filters, grid-based estimation methods, Multi-Object-Multi-Sensor Fusion, data-association, random finite sets, Bayesian belief networks, Dempster-Shafer theory of evidence Machine Learning for regression and pattern recognition/discovery problems e.g. Gaussian processes, latent variable methods, support vector machines, probabilistic/statistical models, neural networks, Bayesian … inference, random-forests, novelty detection, clustering Deep Learning e.g. Deep reinforcement learning, Monte-Carlo tree search, deep regression/classification, deep embeddings, recurrent Networks, natural language processing Computer Vision algorithms e.g. Structure from motion, image Based navigation, SLAM, pose estimation/recovery To be considered for the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Scientist: Knowledge and Semantics

United Kingdom
Hybrid / WFH Options
BAE Systems (New)
recruit a team lead for our Knowledge and Semantics team. The Data and Decision Support Capability has a diverse range of teams working in: reinforcement learning, NLP, knowledge graphs, applications of LLMs, computer vision, AI for RF and EW, sonar and acoustics. You will have the opportunity to … You will have experience of leading a team of researchers working in NLP, LLM applications, text-mining, knowledge graphs and/or graph machine learning and with a vision on how to develop solutions for practical applications of ML in these domains. You will have a proven record of … successful proposal writing and delivery to funding bodies relevant to the Defence sector. You should have existing skills in Machine Learning (ML), will need to be a proficient programmer in Python, with extensive experience in the use of libraries and toolboxes to support efficient development. Candidates will also have More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Reinforcement Learning
Work from Home
10th Percentile
£90,801
25th Percentile
£105,000
Median
£150,000
75th Percentile
£175,000