Reinforcement Learning to Train Large Language Models to Explain Human Decisions
a year ago
- #Cognitive Modeling
- #Reinforcement Learning
- #Artificial Intelligence
- Explores using reinforcement learning to train large language models (LLMs) to explain human decisions.
- Aims to develop dual-purpose cognitive models for both prediction and interpretable explanation.
- Uses outcome-based rewards to generate explicit reasoning traces for human risky choices.
- Demonstrates high-quality explanations and strong quantitative predictions of human decisions.