20 of 20 Reinforcement Learning Jobs in the South East

Physicist

Hiring Organisation
Huberta
Location
Southampton, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Slough, Berkshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Portsmouth, Hampshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Guildford, Surrey, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
High Wycombe, Buckinghamshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Woking, Surrey, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Dartford, Kent, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Oxford, Oxfordshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Reading, Berkshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Maidstone, Kent, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Crawley, West Sussex, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Brighton, East Sussex, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Newport, Isle of Wight, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

AI Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £85,000 per annum
currently working on an AI Engineer role with a global language and translation company. You will be designing, developing, and deploying cutting-edge machine learning solutions across the company.If you enjoy end-to-end ownership (from experimentation to deployment), working with AWS, Docker, MLflow, TensorFlow/PyTorch, and contributing … code in Python Experience with TensorFlow, PyTorch and Scikit-learn Experience with NLPs and LLMs Speech, Text or Audio data Strong knowledge of machine learning techniques and algorithms, including supervised and unsupervised learning, deep learning, and reinforcement learning

Machine Learning Quant Engineer

Hiring Organisation
Michael Page Technology
Location
London, South East, England, United Kingdom
Employment Type
Temporary
Salary
£1,000 - £1,200 per day
role requires an ML Quant Engineer with expertise within an Investment Bank. The position is based in London and involves developing and implementing machine learning models to support financial decision-making. Client Details The hiring organisation is a large entity within the financial services industry. Description Design and implement … machine learning models for financial applications, with a focus on derivatives pricing, risk analytics, and market forecasting. Build scalable ML pipelines to process large volumes of financial data efficiently. Develop deep learning architectures for time series prediction, anomaly detection, and pattern recognition in market data. Optimise model performance

Machine Learning Quant Engineer - Investment banking/ XVA

Hiring Organisation
Harvey Nash
Location
London, South East, England, United Kingdom
Employment Type
Contractor
Contract Rate
£1,000 - £1,200 per day
Senior Quant Machine Learning Engineer sought by leading investment bank based in the city of London. **Inside IR35, 4 days a week on site** The role:To lead the design and deployment of ML-driven models across our trading and investment platforms. This is a high-impact, front-office … production deployment Mentor junior quants and engineers; contribute to knowledge-sharing and model governance processes Stay current with cutting-edge ML research (e.g., deep learning, generative models, reinforcement learning) and assess applicability to financial markets Collaborate closely with cross-functional teams, including traders, data engineers, and software

Machine Learning Engineer (0–3 Years Experience)

Hiring Organisation
IT Graduate Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£45,000 - £75,000 per annum, OTE
Machine Learning Engineer (LLM/AI Systems) London/Hybrid | 0–3 Years Experience | Competitive Salary Are you obsessed with AI and large language models? We’re an early-stage startup building real-world products powered by LLMs — from intelligent copilots to adaptive automation tools — and we’re looking … research — we give you time and resources to explore, learn, and publish. What We’re Looking For 0–3 years of experience in Machine Learning, Data Science, or NLP/LLM. Strong Python skills; exposure to PyTorch/TensorFlow/Hugging Face. (Bonus) understand fundamentals of deep learning

New Trading Team's 1st C++ Quant Developer | HFT

Hiring Organisation
Augmentti
Location
Slough, Berkshire, UK
Employment Type
Full-time
frequency, low-latency trading. Work with a Humble Leader: You'll work closely with a brilliant PM who has a strong technical background (from reinforcement learning strategies to low-latency C++ coding) and a pragmatic, collaborative approach. This is someone who's not only mastered complex trading strategies