热点
关于我们
xx
xx
"
元推理
" 相关文章
LLMs as Strategic Agents: Beliefs, Best Response Behavior, and Emergent Heuristics
cs.AI updates on arXiv.org
2025-10-14T04:09:34.000000Z
LLMs as Strategic Agents: Beliefs, Best Response Behavior, and Emergent Heuristics
cs.AI updates on arXiv.org
2025-10-14T04:09:34.000000Z
Searching Meta Reasoning Skeleton to Guide LLM Reasoning
cs.AI updates on arXiv.org
2025-10-07T04:07:50.000000Z
LLMs cannot spot math errors, even when allowed to peek into the solution
cs.AI updates on arXiv.org
2025-09-03T04:17:29.000000Z
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
cs.AI updates on arXiv.org
2025-07-31T04:48:18.000000Z
'Meta', 'mesa', and mountains
少点错误
2024-10-31T17:38:00.000000Z
微软研究院MRP:大模型动态选择最佳解题策略的元推理提示,比CoT、ToT更有效
PaperAgent
2024-07-02T03:35:11.000000Z