10
2026/02
大语言模型强化微调中的熵动力学分析
论文标题:On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language
...