4 of 4 Reinforcement Learning Jobs in Yorkshire

Software Engineer - Large Language Models

Hiring Organisation
Fastino Labs
Location
Wakefield, West Yorkshire, UK
Employment Type
Full-time
overall performance metrics Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards Build robust … Required - Great velocity for building and shipping agents/AI products. Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies Optional - Demonstrated ability to do independent research in Academic ...

Lead AI Engineer

Hiring Organisation
Akixi
Location
Bradford, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...

Lead AI Engineer

Hiring Organisation
Akixi
Location
Doncaster, South Yorkshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...

Lead AI Engineer

Hiring Organisation
Akixi
Location
Hull, East Yorkshire, UK
Employment Type
Full-time
similar conversational-AI platforms. Deep understanding of prompt engineering and fine-tuning of large language models. Strong grounding in ML concepts — supervised, unsupervised, and reinforcement learning. Familiarity with cloud AI/ML services (e.g. Azure Cognitive Services, AWS SageMaker, and/or GCP Vertex AI). Experience deploying ...