22

2026/01

JudgeRLVR：先判断后生成——打破推理模型“长思维链”的效率悖论

论文标题：JudgeRLVR: Judge First, Generate Second for Efficient Reasoning 论文链接： ...

2 月前

252 0