Permanent Reinforcement Learning Jobs in the UK excluding London

1 to 25 of 34 Permanent Reinforcement Learning Jobs in the UK excluding London

Research Scientist Intern, Reinforcement Learning

london, south east england, united kingdom
Wayve
paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future. At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment … we back each other to deliver impact. Make Wayve the experience that defines your career The role We're looking for a curious and motivated Reinforcement Learning Intern to help advance the next generation of decision-making systems for autonomous driving. In this role, you'll work embedded in a research team to develop scalable RL algorithms that … enable vehicles to learn complex behaviors directly from experience — both in simulation and the real world. The ideal candidate has experience in some combination of reinforcement learning, imitation learning, offline RL, or world modelling, and is motivated to apply cutting-edge research ideas to real-world embodied AI challenges. We're particularly interested in temporal credit assignment More ❯
Posted:

Data Science Faculty

norfolk, east anglia, united kingdom
ODU
any area of data science. Working with our partners at JLab and NASA LaRC, areas of particular interest include: big data analytics, data mining, data visualization, GIS , scientific machine learning, reinforcement learning, federated learning, foundational models for science, generative models, causality discovery, data privacy and security, and quantum computing. We also seek data science faculty that More ❯
Posted:

AI Engineer

Bishopton, Renfrewshire, Scotland, United Kingdom
Hybrid/Remote Options
DXC Technology
for domain-specific applications Implement advanced prompt engineering strategies Leverage Retrieval-Augmented Generation (RAG) for enhanced contextual performance Build intelligent agents using frameworks like LangChain, LlamaIndex, CrewAI, AutoGen Apply reinforcement learning techniques including Q-learning , policy gradients , and RLlib Collaborate with cross-functional teams to integrate AI solutions into scalable products Ensure best practices in data engineering … GPT, LLaMA, Mistral, Claude) Strong background in fine-tuning and prompt engineering Hands-on experience with RAG pipelines Familiarity with Agent Frameworks (LangChain, LlamaIndex, CrewAI, AutoGen) Solid understanding of reinforcement learning concepts and tools (Q-learning, policy gradients, RLlib) Azure AI Engineer Associate certification (or willingness to obtain) Bachelor's degree in a relevant field or equivalent … with industry knowledge and technology Why Join Us? Work on impactful AI projects with real-world applications Be part of a collaborative and forward-thinking team Access to continuous learning and development opportunities Flexible working arrangements and a supportive work culture Ready to shape the future of AI? Apply now and bring your expertise to a team that values More ❯
Employment Type: Permanent
Posted:

Physicist

Midlands, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Edinburgh, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Glasgow, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Leeds, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Birmingham, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Aberdeen, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Bristol, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Manchester, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Worcester, Worcestershire, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Chester, Cheshire, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Chelmsford, Essex, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Slough, Berkshire, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Stevenage, Hertfordshire, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Gloucester, Gloucestershire, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Lincoln, Lincolnshire, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Warrington, Cheshire, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Newport, Isle of Wight, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Bolton, Greater Manchester, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

York, North Yorkshire, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Physicist

Brighton, East Sussex, UK
Hybrid/Remote Options
Huberta
ownership. As we scale, opportunities to build and lead teams. We're hiring immediately. What You'll Actually Do Derive mathematical formulations of abstract concepts. Design and implement novel learning approaches informed by physics and mathematics. Take concepts from statistical mechanics, information theory, and dynamic systems to construct first principles algorithms. Optimise using advanced mathematical structures, such as tensor … networks, and techniques like reinforcement learning - make these approaches work in a production system. Debug why your theoretically sound approach breaks at scale. Fix it. Ship it. Daily reality includes mathematical derivations and performance optimisation. You'll need to be comfortable moving between theory and systems-level engineering within the same afternoon. Required Qualifications Education & Experience: PhD in … physics and statistical physics Dynamic systems (energy landscapes, emergent behaviours) Modelling and simulation Mathematics: Advanced linear algebra, optimisation, numerical methods Information theory Probability, statistics Graph theory Highly Valued: Machine Learning Experience: Specifically in parameter-free learning approaches, Bayesian methods and belief networks, reinforcement learning and graph neural networks, computational optimisation at scale, algorithms and data structures. More ❯
Employment Type: Full-time
Posted:

Research Engineer

london, south east england, united kingdom
Hybrid/Remote Options
Anthropic
of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: You want to build and run elegant and thorough machine learning experiments to help us understand and steer the behavior of powerful AI systems. You care about making AI helpful, honest, and harmless, and are interested in the ways that … Testing the robustness of our safety techniques by training language models to subvert our safety techniques, and seeing how effective they are at subverting our interventions. Run multi-agent reinforcement learning experiments to test out techniques like AI Debate. Build tooling to efficiently evaluate the effectiveness of novel LLM-generated jailbreaks. Write scripts and prompts to efficiently produce … efforts Pick up slack, even if it goes outside your job description Care about the impacts of AI Strong candidates may also: Have experience authoring research papers in machine learning, NLP, or AI safety Have experience with LLMs Have experience with reinforcement learning Have experience with Kubernetes clusters and complex shared codebases Candidates need not have More ❯
Posted:

Lead AI Research Engineer

Chelmsford, Essex, UK
Hybrid/Remote Options
Edison Smart
AI research team and are looking for a Lead AI Technologist to help lead the team, researching and developing novel AI/ML technologies. You'll lead projects spanning reinforcement learning, NLP, computer vision, and signal processing, while mentoring teams and shaping partnerships with academic and industry leaders. Responsibilities: Lead delivery of large-scale AI/ML projects … in engineering assurance and technical review Proposal writing experience Strong communication and mentoring skills Beneficial: Expertise in one or more of: AI/ML for imagery and remote sensing Reinforcement learning Natural Language Processing and Large Language Models Knowledge graphs and graph neural networks AI for RF, EW, radar, sonar, or acoustics Autonomy applications If you are interested More ❯
Employment Type: Full-time
Posted:
Reinforcement Learning
the UK excluding London
10th Percentile
£69,600
25th Percentile
£70,000
Median
£80,000
75th Percentile
£97,500
90th Percentile
£121,250