1 to 25 of 74 Permanent Reinforcement Learning Jobs in the UK excluding London

AI Engineer

Hiring Organisation
DXC Technology
Location
Bishopton, Renfrewshire, Scotland, United Kingdom
Employment Type
Permanent
advanced prompt engineering strategies Leverage Retrieval-Augmented Generation (RAG) for enhanced contextual performance Build intelligent agents using frameworks like LangChain, LlamaIndex, CrewAI, AutoGen Apply reinforcement learning techniques including Q-learning , policy gradients , and RLlib Collaborate with cross-functional teams to integrate AI solutions into scalable products Ensure … background in fine-tuning and prompt engineering Hands-on experience with RAG pipelines Familiarity with Agent Frameworks (LangChain, LlamaIndex, CrewAI, AutoGen) Solid understanding of reinforcement learning concepts and tools (Q-learning, policy gradients, RLlib) Azure AI Engineer Associate certification (or willingness to obtain) Bachelor's degree

Physicist

Hiring Organisation
Huberta
Location
Aberdeen, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Swindon, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Coventry, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Belfast, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Southampton, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Cardiff, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Leicester, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Sheffield, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Nottingham, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Liverpool, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Midlands, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Edinburgh, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Glasgow, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Leeds, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Birmingham, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Bradford, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Bristol, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Manchester, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Luton, Bedfordshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Bournemouth, Dorset, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Basildon, Essex, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Cheltenham, Gloucestershire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Basingstoke, Hampshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory

Physicist

Hiring Organisation
Huberta
Location
Watford, Hertfordshire, UK
Employment Type
Full-time
build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical … structures, such as tensor networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory