metacognition, greater autonomy, deeper understanding, improved transfer, and more durable skills. This role sits at the intersection of learning science, cognitive science, experimental design, LLM evaluation, and applied product research. You will help develop cognitive outcome measures, design and manage RCTs and field studies, build classifiers and graders, guide external … outputs, reason about classifier and grader performance, and collaborate effectively with data scientists, engineers, and research teams. Understand the practical strengths and limitations of LLM‐based evaluation methods, including model‐as‐judge systems, rubric design, validation, calibration, inter‐rater reliability, and precision/recall tradeoffs. Help design, launch, and manage ...