15

2025/11

Meta AI 推出 RIFL：基于准则的强化学习来提升 LLM 指令遵循能力

论文标题：Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM ...

5 月前

501 1