元推理_Fishai

热点

"元推理" 相关文章

LLMs as Strategic Agents: Beliefs, Best Response Behavior, and Emergent Heuristics

cs.AI updates on arXiv.org 2025-10-14T04:09:34.000000Z

LLMs as Strategic Agents: Beliefs, Best Response Behavior, and Emergent Heuristics

cs.AI updates on arXiv.org 2025-10-14T04:09:34.000000Z

Searching Meta Reasoning Skeleton to Guide LLM Reasoning

cs.AI updates on arXiv.org 2025-10-07T04:07:50.000000Z

LLMs cannot spot math errors, even when allowed to peek into the solution

cs.AI updates on arXiv.org 2025-09-03T04:17:29.000000Z

RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents

cs.AI updates on arXiv.org 2025-07-31T04:48:18.000000Z

'Meta', 'mesa', and mountains

少点错误 2024-10-31T17:38:00.000000Z

微软研究院MRP：大模型动态选择最佳解题策略的元推理提示，比CoT、ToT更有效

PaperAgent 2024-07-02T03:35:11.000000Z

Copyright © 2019 FISHAI.All Rights Reserved