Remote Reinforcement Learning Jobs in London

3 of 3 Remote Reinforcement Learning Jobs in London

2026 Machine Learning Center of Excellence (NLP) - Summer Associate

london, south east england, united kingdom
Hybrid/Remote Options
JPMorganChase
Description The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. As a part of CDAO, The Machine Learning Center of Excellence (MLCOE) partners across the firm to shape, create, and deploy Machine Learning Solutions for our most challenging business problems. This includes ensuring the quality, integrity, and … generate insights and drive decision-making. The CDAO is also responsible for developing and implementing solutions that support the firm's commercial goals by harnessing artificial intelligence and machine learning technologies to develop new products, improve productivity, and enhance risk management effectively and responsibly. As a Summer Associate within the MLCOE, you will apply sophisticated machine learning methods … to a diverse range of complex domains, including natural language processing, large language models, speech recognition and understanding, reinforcement learning, and recommendation systems. You will collaborate closely with MLCOE mentors, business experts, and technologists, conducting independent research and deploying solutions into production. A strong passion for machine learning, solid expertise in deep learning with hands-on More ❯
Posted:

Research Engineer

london, south east england, united kingdom
Hybrid/Remote Options
Anthropic
of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: You want to build and run elegant and thorough machine learning experiments to help us understand and steer the behavior of powerful AI systems. You care about making AI helpful, honest, and harmless, and are interested in the ways that … Testing the robustness of our safety techniques by training language models to subvert our safety techniques, and seeing how effective they are at subverting our interventions. Run multi-agent reinforcement learning experiments to test out techniques like AI Debate. Build tooling to efficiently evaluate the effectiveness of novel LLM-generated jailbreaks. Write scripts and prompts to efficiently produce … efforts Pick up slack, even if it goes outside your job description Care about the impacts of AI Strong candidates may also: Have experience authoring research papers in machine learning, NLP, or AI safety Have experience with LLMs Have experience with reinforcement learning Have experience with Kubernetes clusters and complex shared codebases Candidates need not have More ❯
Posted:

Machine Learning Engineer

City of London, London, United Kingdom
Hybrid/Remote Options
JLA Resourcing Ltd
Role - Machine Learning Engineer Location Hybrid, 1 day on site in London (with flex) Salary - £70,000 to £75,000 The Opportunity Were partnering with a client in the financial services sector who are looking to bring on a Machine Learning Engineer to their growing Intelligent Automation Team of 50. Following a period of significant transformation and with … a strong benefits package. The position is hybrid, typically one day per week on site in London (with flexibility around which day). The Role Reporting to the Machine Learning Lead, youll be a hands-on Machine Learning Engineer with a strong track record of building and deploying ML solutions at scaleparticularly in NLP and GenAI. You will … ARM, Terraform). A test-driven development mindset and commitment to engineering quality. Broad understanding of ML approaches, with the ability to explain methods clearly: Regression, clustering, decision trees, reinforcement learning Gradient boosting, CNNs, RNNs, LSTMs Attention models, encoder/decoder architectures, transformers, vector semantics Demonstrable experience developing GenAI applications in real-world settings. The confidence to communicate More ❯
Employment Type: Permanent, Work From Home
Salary: £70,000
Posted:
Reinforcement Learning
London
10th Percentile
£65,975
25th Percentile
£83,563
Median
£95,000
75th Percentile
£98,125
90th Percentile
£103,000