4 of 4 Permanent Reinforcement Learning Jobs in the West Midlands

Applied Data Scientist

Hiring Organisation
Change-IT Consulting Ltd
Location
Birmingham, West Midlands, United Kingdom
Employment Type
Permanent, Work From Home
faster and that genuinely meet user needs at national scale. Youll play a key role in exploring complex datasets, building production-ready machine learning and generative AI solutions, and working closely with multidisciplinary teams to translate real user problems into impactful AI capabilities. Key responsibilities include: Exploring, analysing … interpreting large, complex and diverse datasets to uncover insights and opportunities for AI-driven improvement. Designing, building, evaluating and optimising machine learning, deep learning and generative AI models for real-world service applications. Collaborating with engineers, product managers, designers and policy stakeholders to translate user needs into scalable ...

Software Engineer - Large Language Models

Hiring Organisation
Fastino Labs
Location
Telford, Shropshire, UK
Employment Type
Full-time
overall performance metrics Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards Build robust … Required - Great velocity for building and shipping agents/AI products. Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies Optional - Demonstrated ability to do independent research in Academic ...

Software Engineer - Large Language Models

Hiring Organisation
Fastino Labs
Location
Stoke-on-Trent, Staffordshire, UK
Employment Type
Full-time
overall performance metrics Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards Build robust … Required - Great velocity for building and shipping agents/AI products. Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies Optional - Demonstrated ability to do independent research in Academic ...

Lead AI Engineer

Hiring Organisation
Akixi
Location
Telford, Shropshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...