Neural Thermodynamic Laws for Large Language Model Traininga year ago#Machine Learning#Thermodynamics#Large Language Modelshttps://arxiv.org/abs/2505.10559Copy Link引入神经热力学定律(NTL)以理解大语言模型训练动态在河谷式损失景观假设下热力学量及原理涌现的理论证明从热力学视角推导学习率调度的实践指南提及arXivLabs框架——由社区驱动的arXiv功能开发体系