10
2026/02

大语言模型强化微调中的熵动力学分析

论文标题:On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language ...