2 of 2 Remote Reinforcement Learning Jobs in Scotland

Data Annotation Specialist (Remote)

Hiring Organisation
Work Vista
Location
Aberdeen, UK
clients, a leading global player in the AI space. This remote, part-time contract role is essential for training machine learning models. You will be responsible for analyzing, labeling, and improving text data to help the AI understand human language more accurately. Your work will directly influence … annotations). Key Responsibilities: Evaluate pairs of AI-generated responses and rank them based on specific criteria such as helpfulness, honesty, and harmlessness (RLHF - Reinforcement Learning from Human Feedback). Annotate text data by tagging it with relevant categories (e.g., sentiment analysis, topic classification, safety violations). Write ...

AI Content Reviewer (Remote)

Hiring Organisation
Work Vista
Location
Edinburgh, UK
final polish to ensure the AI sounds human and helpful. Rate and rank multiple model outputs to provide the "ground truth" data needed for Reinforcement Learning (RLHF). Qualifications: Proven experience in content moderation, editing, or quality assurance (QA). Sharp analytical skills with the ability to detect ...