Reinforcement Learning to Train Large Language Models to Explain Human Decisions

a year ago

Explores using reinforcement learning to train large language models (LLMs) to explain human decisions.
Aims to develop dual-purpose cognitive models for both prediction and interpretable explanation.
Uses outcome-based rewards to generate explicit reasoning traces for human risky choices.
Demonstrates high-quality explanations and strong quantitative predictions of human decisions.

Hasty Briefsbeta