Reinforcement Learning Jobs in the UK

1 to 25 of 62 Reinforcement Learning Jobs in the UK

NLP / LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence

London, United Kingdom
NLP PEOPLE
NLP/LLM Scientist - Applied AI ML Lead - Machine Learning Centre of Excellence The Machine Learning Center of Excellence invites the successful candidate to apply sophisticated machine learning methods to a wide variety of complex tasks including natural language processing, speech analytics, time series, reinforcement learning and recommendation systems. The candidate must excel in working … in a highly collaborative environment together with the business, technologists and control partners to deploy solutions into production. The candidate must also have a strong passion for machine learning and invest independent time towards learning, researching and experimenting with new innovations in the field. The candidate must have solid expertise in Deep Learning with hands-on implementation … experience and possess strong analytical thinking, a deep desire to learn and be highly motivated. Job Responsibilities • Research and explore new machine learning methods through independent study, attending industry-leading conferences, experimentation and participating in our knowledge sharing community • Develop state-of-the art machine learning models to solve real-world problems and apply it to tasks such More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Engineer: Graph Machine Learning (London)

London, UK
Atmanlabs
Research Engineer: Graph Machine Learning Atman Labs, London About Atman Labs At Atman Labs we are building software to emulate proactive human expertise. Emulating human experts with deep knowledge and proactive assistance has largely been impossible to do via standalone Artificial Intelligence techniques. As an applied research and commercialization company we are deploying our products in a number of … value of our approach from proactive shopping assistance, to personal teachers to healthcare concierges and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors. The Next Frontier of Machine Reasoning: Web-scale Knowledge Graph Exploration using Reinforcement Learning … and explore structured mental models in their heads to solve open-ended problems across different domains. Our research seeks to emulate this process through a novel combination of using reinforcement learning agents to perform exploration through a knowledge graph. Knowledge graphs allow us to represent structured information and the logical relations that govern it, unlocking the ability to More ❯
Employment Type: Full-time
Posted:

Research Engineer, Machine Learning (Horizons) (London)

London, UK
Hybrid / WFH Options
Anthropic
Research Engineer, Machine Learning (Horizons) London, UK About Anthropic Anthropics mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI … systems. About the role: As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning, improving reasoning abilities in areas such as code generation and mathematics, and exploring reinforcement learning for agentic/open-ended tasks. Representative projects: Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models. Create tools and environments for models to interact with, enabling them to perform complex, open-ended tasks. Design and run experiments to enhance models' reasoning capabilities, particularly in code generation More ❯
Employment Type: Full-time
Posted:

Research Engineer, Machine Learning (Horizons) London, UK (London)

London, UK
Hybrid / WFH Options
Alcides Fonseca
Research Engineer, Machine Learning (Horizons) London, UK About Anthropic Anthropics mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI … systems. About the role: As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning, improving reasoning abilities in areas such as code generation and mathematics, and exploring reinforcement learning for agentic/open-ended tasks. Representative projects: Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models. Create tools and environments for models to interact with, enabling them to perform complex, open-ended tasks. Design and run experiments to enhance models' reasoning capabilities, particularly in code generation More ❯
Employment Type: Full-time
Posted:

Applied AI ML Director - NLP / LLM and Graphs

London, United Kingdom
NLP PEOPLE
generate insights and drive decision-making. The CDAO is also responsible for developing and implementing solutions that support the firm's commercial goals by harnessing artificial intelligence and machine learning technologies to develop new products, improve productivity, and enhance risk management effectively and responsibly.As an Applied AI ML Director - NLP/LLM and Graphs within the Chief Data & Analytics … Office, Machine Learning Centre of Excellence, you will have the opportunity to apply sophisticated machine learning methods to complex tasks including natural language processing, graph analytics, speech analytics, time series, reinforcement learning and recommendation systems. You will collaborate with various teams and actively participate in our knowledge sharing community. We are looking for someone who excels … in a highly collaborative environment, working together with our business, technologists and control partners to deploy solutions into production. If you have a strong passion for machine learning and enjoy investing time towards learning, researching and experimenting with new innovations in the field, this role is for you. We value solid expertise in Deep Learning with hands More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options
InstaDeep Ltd
be a part of the AI revolution! About DeepPCB: DeepPCB is InstaDeep's AI-powered Place & Route PCB (Printed Circuit Board) design tool. We use a combination of deep reinforcement learning and high-performance computing to automate and scale PCB place-and-route workflows, accelerating hardware innovation globally. We are looking for a Machine Learning Engineer to … join the DeepPCB team and help push the boundaries of AI for electronic design automation (EDA). You will develop, optimize, and deploy cutting-edge machine learning and reinforcement learning models focused on automating complex PCB design problems, working closely with researchers and engineers to bring ideas to life. Responsibilities: Develop scalable and efficient machine learning … technical depth based on the audience. Participate in technical discussions, design reviews, and customer-facing activities when required. Requirements: B.Sc ., M.Sc ., or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related technical field. 5 years of professional experience in applied machine learning or engineering roles. Strong expertise in Machine Learning and Deep Learning More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research positions available in machine learning at all levels: Research Fellow, Postdoc, PhD s ...

Manchester, Lancashire, United Kingdom
The International Society for Bayesian Analysis
Research positions available in machine learning at all levels: Research Fellow, Postdoc, PhD student. Turing AI Fellowship, Univ Manchester, UK Still some positions available in my new research group funded by the Turing AI World-Leading Researcher Fellowship: Human-AI Research Teams: Steering AI in Experimental Design and Decision-Making. Positions are available at all stages; we seek to … but leave some for future years as well: - Research Fellow - Postdoc - PhD Student The work involves probabilistic modelling in exciting new settings, and developing new methods for probabilistic machine learning and inference. Applicants with outstandingly strong expertise in one of following topics are welcome, or strong expertise in one and keen interest in working with expert colleagues on the … others: automatic experimental design, Bayesian inference, human-in-the-loop learning, advanced user modelling, machine teaching, privacy-preserving learning, reinforcement learning, inverse reinforcement learning, simulator-based inference, likelihood-free inference. There will be particularly good opportunities to join new work on collaborative modelling and decision-making with AI. And applications in drug design, synthetic More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal AI Engineer - Entourage

London, United Kingdom
Web3 Foundation
protocols at the intersection of Web3 and AI, disrupting traditional industries, empowering individuals, and fostering a more equitable digital landscape. Recent ventures include: - Kluster.ai - Anti-hallucination platform for Machine Learning models - Moonbeam - EVM-compatible L1 blockchain optimised for cross-chain use cases - Tanssi - Decentralised AppChain infrastructure secured via Restaking What you'll do: Lead the end-to-end design … and development of core AI systems that enable collective learning and shared memory among autonomous agents. Architect and implement scalable distributed infrastructure for capturing, validating, and surfacing agent experiences across complex networks at scale Drive innovation in protocol-level mechanisms for memory curation, knowledge consolidation, and token-incentivized participation across mutually distrusting agents. Build frameworks and tooling that allow … agents to transform episodic episodic experiences and action trajectories into reusable, network-wide intelligence. Collaborate closely with the CTO to operationalize cutting-edge work in reinforcement learning, LLMs, and multi-agent coordination into production-grade systems. Define and uphold technical standards for code quality, security, reliability and scalability across the AI and protocol layers. What you'll bring More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

University Assistant Professor in Machine Learning

Cambridge, Cambridgeshire, United Kingdom
NLP PEOPLE
Applications are invited for a University Assistant Professorship in the broad area of Machine Learning. The successful candidate will join the Computational and Biological Learning Lab (CBL) cbl.eng.cam.ac.uk in the Information Engineering Division. CBL combines expertise in machine learning with computational neuroscience. The candidate will lead a research programme in one or more of the following areas: machine … learning, decision making, and theory and practice of deep learning. We encourage applicants who will strengthen our current research activities in probabilistic machine learning, reinforcement learning, supervised and unsupervised learning, active learning, and all aspects of machine intelligence. We welcome applicants with an interest in applications of machine learning to engineering and the … We will give priority to candidates who are well placed to do this. The candidate will contribute to the Division's teaching activities including those associated with the Machine Learning and Machine Intelligence MPhil programme and the teaching of Information Engineering to undergraduate students. We particularly welcome applications from women and/or candidates from a BME background for More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist Data Science Team London, UK

London, United Kingdom
Applied Data Science Partners
t wait to hear from you! KEY RESPONSIBILITIES: Actively contribute to the design, development, testing, and evaluation of data science solutions Train and deploy state-of-the-art machine learning and reinforcement learning models Build AI systems using Large Language Models Build processes for extracting, cleaning and transforming data (SQL/Python) Ad-hoc data mining for … of presenting technical concepts to stakeholders Experience of proactively contributing to the design, development, testing, and deployment of data science and AI solutions Experience and understanding of applied machine learning techniques in Python (e.g., xgboost, regression, decision trees) Experience with physics modelling highly desirable Practical knowledge and experience of developing AI solutions using advanced machine learning techniques (e.g. … reinforcement learning, deep learning) Experience of working collaboratively as part of a data science team, using tools like Git to adhere to established data science and AI best practices Experience of using different analysis techniques to draw insight from data, using tools such as Python and SQL Excellent Python skills, including relevant libraries for data analysis and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Scientist

London, United Kingdom
Applied Data Science Partners
t wait to hear from you! KEY RESPONSIBILITIES: Actively contribute to the design, development, testing, and evaluation of data science solutions Train and deploy state-of-the-art machine learning and reinforcement learning models Build AI systems using Large Language Models Build processes for extracting, cleaning and transforming data (SQL/Python) Ad-hoc data mining for … of presenting technical concepts to stakeholders Experience of proactively contributing to the design, development, testing, and deployment of data science and AI solutions Experience and understanding of applied machine learning techniques in Python (e.g. xgboost, regression, decision trees) Experience with computational simulation/modelling highly desirable Practical knowledge and experience of developing AI solutions using advanced machine learning techniques (e.g. reinforcement learning, deep learning) Experience of working collaboratively as part of a data science team, using tools like Git to adhere to established data science and AI best practices Experience of using different analysis techniques to draw insight from data, using tools such as Python and SQL Excellent Python, including relevant libraries for data More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Machine Learning Engineer

United Kingdom
Igbaffiliate
Senior Machine Learning Engineer Our Company: QiH is a fast-growing, innovative, and progressive scale-up business headquartered in London with a collective of brilliant brains in Skopje. We are at the start of an exciting journey as we build out our internal engineering capability, spearheading our tech transformation, building best in class products and tackling exciting and complex … of our success! At QiH, we have created an energetic and target-driven culture and continuously invest in each individual. The Role We are seeking an experienced Senior Machine Learning Engineer to join our dynamic team. You'll be at the forefront of designing, developing, and deploying ML models that power personalised advertising, customer journey analytics, audience segmentation, and … by Google Cloud Platform (GCP), and we pride ourselves on applying cutting-edge techniques to real-world marketing and advertising challenges. Key Responsibilities: Design, build, and deploy scalable machine learning models for ad targeting, user segmentation, conversion prediction, and content personalization. Develop production-grade ML pipelines leveraging GCP services like Vertex AI, BigQuery, Dataflow, and Pub/Sub. Collaborate More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Machine Learning Engineer

London Area, United Kingdom
Hybrid / WFH Options
QiH Group
of our success! At QiH, we have created an energetic and target-driven culture and continuously invest in each individual. The Role We are seeking an experienced Senior Machine Learning Engineer to join our dynamic team. You’ll be at the forefront of designing, developing, and deploying ML models that power personalised advertising, customer journey analytics, audience segmentation, and … by Google Cloud Platform (GCP), and we pride ourselves on applying cutting-edge techniques to real-world marketing and advertising challenges. Key Responsibilities: Design, build, and deploy scalable machine learning models for ad targeting, user segmentation, conversion prediction, and content personalization. Develop production-grade ML pipelines leveraging GCP services like Vertex AI, BigQuery, Dataflow, and Pub/Sub. Collaborate … cross-functionally with Data Science, Data Engineering, and Product teams to translate business objectives into ML solutions. Research and implement state-of-the-art techniques in deep learning, reinforcement learning, and large-scale optimization relevant to Ad Tech and Mar Tech. Monitor, troubleshoot, and continuously improve model performance in production environments. Mentor junior engineers and promote best More ❯
Posted:

Senior Machine Learning Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
QiH Group
of our success! At QiH, we have created an energetic and target-driven culture and continuously invest in each individual. The Role We are seeking an experienced Senior Machine Learning Engineer to join our dynamic team. You’ll be at the forefront of designing, developing, and deploying ML models that power personalised advertising, customer journey analytics, audience segmentation, and … by Google Cloud Platform (GCP), and we pride ourselves on applying cutting-edge techniques to real-world marketing and advertising challenges. Key Responsibilities: Design, build, and deploy scalable machine learning models for ad targeting, user segmentation, conversion prediction, and content personalization. Develop production-grade ML pipelines leveraging GCP services like Vertex AI, BigQuery, Dataflow, and Pub/Sub. Collaborate … cross-functionally with Data Science, Data Engineering, and Product teams to translate business objectives into ML solutions. Research and implement state-of-the-art techniques in deep learning, reinforcement learning, and large-scale optimization relevant to Ad Tech and Mar Tech. Monitor, troubleshoot, and continuously improve model performance in production environments. Mentor junior engineers and promote best More ❯
Posted:

Head of AI (London)

London, UK
Scrumconnect Consulting
define product strategy. In this role, you will: Lead AI strategy and execution in a high-ambiguity environment. Build, train, and deploy state-of-the-art models (e.g., deep learning, NLP, computer vision, reinforcement learning, or relevant domain-specific architectures). Design infrastructure for data ingestion, annotation, experimentation, model versioning, and monitoring. Collaborate closely with product, design … define product strategy. In this role, you will: Lead AI strategy and execution in a high-ambiguity environment. Build, train, and deploy state-of-the-art models (e.g., deep learning, NLP, computer vision, reinforcement learning, or relevant domain-specific architectures). Design infrastructure for data ingestion, annotation, experimentation, model versioning, and monitoring. Collaborate closely with product, design … quarter AI roadmap aligned with product milestones and fundraising milestones. Identify and evaluate opportunities for AI-driven competitive advantages (e.g., proprietary data, unique model architectures, transfer/few-shot learning). Collaborate with business stakeholders to translate big problems into technically feasible AI solutions. Data & Infrastructure Oversee the creation and maintenance of scalable data pipelines (ETL/ELT) and More ❯
Employment Type: Full-time
Posted:

Solution Architect - AI (Remote)

Cambourne, Cambridgeshire, United Kingdom
Hybrid / WFH Options
Remotestar
COE will focus on nurturing Al talent, fostering knowledge sharing, and continuously evolving Al practices across the organization Location: Chennai, Onsite Experience: • 11+ years of experience in Al, machine learning, or data science , with a proven track record of delivering Al solutions. • 7+ years of experience in a leadership or architecture role, ideally with some experience in leading a … and cloud platforms like AWS, Azure, or Google Cloud. • Experience in multiple industries is advantageous (e.g., healthcare, finance, retail). Skills: AI/ML Expertise: Strong understanding of machine learning algorithms, deep learning, natural language processing, computer vision, and data-driven problem-solving techniques. Architecture Skills: Proven ability to design and architect scalable, reliable, and high-performance Al … ensuring high standards in solution design, integration, and deployment. Consulting and Advisory: Work closely with stakeholders to identify business requirements and translate them into Al-powered solutions, including machine learning models, data pipelines, and Al- driven processes. Platform Selection and Integration: Evaluate and select appropriate Al tools, platforms, and technologies to meet business goals. Oversee integration with existing systems More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal Data Scientist (London)

London, UK
Microsoft
Principal Scientist in MSAI, you will work in an exciting and fast-paced, collaborative environment. You'll bring deep expertise in large language models (LLMs), information retrieval, and machine learning to improve the quality and scalability of M365 Copilot.You'll partner closely with engineering and product teams to innovate, design and evaluate end-to-end AI solutions that serve … impact, technically ambitious projects that directly shape the future of Microsoft 365 Copilot.Examples include: Advancing deep reasoning in Microsoft 365 Copilot by applying next-generation LLM fine-tuning and reinforcement learning techniques. Improving Copilot Chat and Researcher response quality through state-of-the-art grounding data selection strategies. Enhancing Copilot Search by developing novel content representation models. Building … Physics, Statistics, OR related areas is highly preferred. Candidates with master's degree with proven industry experience or a strong publication record in the areas of Information Retrieval, Machine Learning, Natural Language Processing, and Deep Learning are considered as well. Extensive hands-on experience building and deploying products using Machine Learning. Specifically, we are looking for expertise in More ❯
Employment Type: Full-time
Posted:

Principal Data Scientist - Microsoft 365 Copilot

London, United Kingdom
Microsoft
impact, technically ambitious projects that directly shape the future of Microsoft 365 Copilot.Examples include: - Advancing deep reasoning in Microsoft 365 Copilot by applying next-generation LLM fine-tuning and reinforcement learning techniques. - Improving Copilot Chat and Researcher response quality through state-of-the-art grounding data selection strategies. - Enhancing Copilot Search by developing novel content representation models. - Building … Physics, Statistics, OR related areas is highly preferred. Candidates with master's degree with proven industry experience or a strong publication record in the areas of Information Retrieval, Machine Learning, Natural Language Processing, and Deep Learning are considered as well. - Extensive hands-on experience building and deploying products using Machine Learning. Specifically, we are looking for expertise in … Language Processing, Large Language Models, Information Retrieval, and Recommendation Systems with a good understanding of techniques like Differential Privacy, Responsible AI and related areas. - High proficiency in deploying machine learning applications at scale in real production environments and proven track record of successfully shipping applied research to production is a must - Excellent problem solving and data analysis skills and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal Data Scientist

London, United Kingdom
Microsoft
impact, technically ambitious projects that directly shape the future of Microsoft 365 Copilot.Examples include: - Advancing deep reasoning in Microsoft 365 Copilot by applying next-generation LLM fine-tuning and reinforcement learning techniques. - Improving Copilot Chat and Researcher response quality through state-of-the-art grounding data selection strategies. - Enhancing Copilot Search by developing novel content representation models. - Building … Physics, Statistics, OR related areas is highly preferred. Candidates with master's degree with proven industry experience or a strong publication record in the areas of Information Retrieval, Machine Learning, Natural Language Processing, and Deep Learning are considered as well. - Extensive hands-on experience building and deploying products using Machine Learning. Specifically, we are looking for expertise in … Language Processing, Large Language Models, Information Retrieval, and Recommendation Systems with a good understanding of techniques like Differential Privacy, Responsible AI and related areas. - High proficiency in deploying machine learning applications at scale in real production environments and proven track record of successfully shipping applied research to production is a must - Excellent problem solving and data analysis skills and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Inkfish Research Scientist (Medical) in Large Language Models

London, United Kingdom
EURAXESS Czech Republic
deadlines Understanding of digital health platforms Ability or potential to contribute to the development of funding proposals in order to generate external funding to support research projects Knowledge of reinforcement learning techniques, particularly Reinforcement Learning with Human Feedback (RLHF), as applied to LLMs Awareness of regulatory frameworks in healthcare research, such as GDPR, HIPAA, or MHRA More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Researcher

London, United Kingdom
NLP PEOPLE
and test your ideas with real-world data in an academic environment. Who are we looking for? The ideal candidate will have: • Either a post-graduate degree in machine learning or a related discipline, or commercial experience developing novel machine learning algorithms. We will also consider exceptional candidates with a proven record of success in online data science … competitions, such as Kaggle • Experience in one or more of deep learning, reinforcement learning, non-convex optimisation, Bayesian non-parametrics, NLP or approximate inference • Excellent reasoning skills and mathematical ability are crucial: off-the-shelf methods don't always work on our data so you will need to understand how to develop your own models • Strong programming … work scheme • Monthly company events . Company: G-Research Qualifications: Language requirements: Specific requirements: Educational level: Level of experience (years): Senior (5+ years of experience) Tagged as: Industry , Machine Learning , NLP , United Kingdom More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Member of Technical Staff

London, United Kingdom
Microsoft
design for AI, prompt engineering methodologies, and AI systems design. Demonstrated experience in one or more of the following areas: prompt engineering, experimental design, language model evaluations, fine tuning, reinforcement learning/direct preference optimization, data curation, and classic machine learning principles. Required/Minimum Qualifications Bachelor's Degree in Computer Science, or related technical discipline AND … open source contributions, and/or on-the-job work experience. Deeper expertise in one or more parts of the AI stack, including prompt engineering, pre-training, fine-tuning, reinforcement learning and direct preference optimization, data curation, LLM inference, orchestration, evaluation pipelines, and deployment. Additional or Preferred Qualifications Bachelor's/Master's Degree in Computer Science or … AI and its deployment. Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers. Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI. Proven ability to collaborate and contribute to a positive, inclusive work environment More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Technologist - Artificial Intelligence & ML

Chelmsford, Essex, South East, United Kingdom
Hybrid / WFH Options
Anson Mccade
Lead Technologist - Artificial Intelligence & ML £85,000 GBP Hybrid WORKING Location: Chelmsford, East of England - United Kingdom Type: Permanent Lead AI & Machine Learning Technologist Location: Great Baddow | Hybrid (2 days onsite) | Salary up to £85,000 + 10% bonus | SC Clearable A leading defence technology organisation is seeking a Lead AI & Machine Learning Technologist to join their expanding … Data and Decision Support capability. This role offers the opportunity to lead complex AI research projects spanning reinforcement learning, NLP and large language models, knowledge graphs, and AI applications in radar, sonar, and remote sensing. About the Opportunity The successful candidate will work within a 35-strong AI/ML team comprising approximately 25 scientists and 10 product … novel research opportunities, fostering collaborations with academic and industry partners Contribute to high-profile research proposals (e.g., Dstl, EPSRC, NS) About You PhD in a relevant field (AI, Machine Learning, Computer Science, Mathematics or related discipline) Around 10+ years' experience in AI/ML research and development in industry or academia, preferably within the defence sector Proven leadership experience More ❯
Employment Type: Permanent, Work From Home
Posted:

Research Engineer / Scientist, Alignment Science, London

London, United Kingdom
Hybrid / WFH Options
Menlo Ventures
of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: You want to build and run elegant and thorough machine learning experiments to help us understand and steer the behavior of powerful AI systems. You care about making AI helpful, honest, and harmless, and are interested in the ways that … Testing the robustness of our safety techniques by training language models to subvert our safety techniques, and seeing how effective they are at subverting our interventions. Run multi-agent reinforcement learning experiments to test out techniques like AI Debate. Build tooling to efficiently evaluate the effectiveness of novel LLM-generated jailbreaks. Write scripts and prompts to efficiently produce … efforts Pick up slack, even if it goes outside your job description Care about the impacts of AI Strong candidates may also: Have experience authoring research papers in machine learning, NLP, or AI safety Have experience with LLMs Have experience with reinforcement learning Have experience with Kubernetes clusters and complex shared codebases Candidates need not have More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Research Scientist, LLM Agents (Foundational Research)

London, United Kingdom
Hybrid / WFH Options
Thomas Reuters
Are you a curious and open-minded individual with an interest in conducting state-of-theart foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex agent-based AI systems in a data-rich, complex academic environment driven by real-world problems. Foundational Research is the dedicated core Machine Learning research … scientists who participate in designing, coding, conducting experiments, translating findings into concrete deliverables and engaging with the academic community. Our focus areas are: LLM Training (Continued Pretraining, Instruction Tuning, Reinforcement Learning Alignment, Distributed Training, Efficient ML techniques) Post-training techniques for planning, reasoning & complex workflows (e.g., Reasoning Models, LLMs + Knowledge Graphs, Test time compute, CoT pipelines, Tool … use & API calling, etc.) Data-centric Machine Learning (Synthetic Data, Curriculum Learning, Learned datamixtures, etc.) Evaluation (Benchmarks, Human-in-the-loop, red teaming/Adversarial Testing, Hallucination detection, ) We work collaboratively both with TR Labs (TR's applied research division), academic partners at world-leading research institutions and subject matter experts with decades of experience. We experiment, prototype More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Reinforcement Learning
10th Percentile
£75,000
25th Percentile
£86,250
Median
£100,000
75th Percentile
£175,000