26

2026/03

微软新研究：Self-Distillation 会降低大模型的推理能力

让每一项优秀工作，被更多人看见：点击进入投稿通道论文标题：Why Does Self-Distillation (Sometimes) Degrad ...

9 小时前

6 0