15
2025/11
Meta AI 推出 RIFL:基于准则的强化学习来提升 LLM 指令遵循能力
论文标题:Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM
...