22
2026/01
JudgeRLVR:先判断后生成——打破推理模型“长思维链”的效率悖论
论文标题:JudgeRLVR: Judge First, Generate Second for Efficient Reasoning
论文链接:
...