热点
关于我们
xx
xx
"
多轮推理
" 相关文章
MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL
cs.AI updates on arXiv.org
2025-10-30T04:13:15.000000Z
首个多轮LLM Router问世, Router-R1可让大模型学会「思考–路由–聚合」
机器之心
2025-10-15T16:11:54.000000Z
TriMediQ: A Triplet-Structured Approach for Interactive Medical Question Answering
cs.AI updates on arXiv.org
2025-10-07T04:15:05.000000Z
Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization
cs.AI updates on arXiv.org
2025-09-17T04:46:33.000000Z
超越样本级RL!人大×快手提出ARPO:熵驱动Agent探索,多轮推理性能飙升
PaperWeekly
2025-08-11T14:41:47.000000Z
A Simple "Try Again" Can Elicit Multi-Turn LLM Reasoning
cs.AI updates on arXiv.org
2025-07-22T04:44:27.000000Z
三思而后行,让大模型推理更强的秘密是「THINK TWICE」?
机器之心
2025-04-05T07:57:03.000000Z