14
2025/11
NeurIPS 2025 满分论文:LLM 强化学习的上限已被基座锁死了
论文标题:Does Reinforcement Learning Really Incentivize Reasoning Capacity in LL
...
小红书推出 RedOne 2.0:SNS 领域大模型后训练实践指南
论文标题:RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Netw
...