热点
"探索效率" 相关文章
Value of Information-Enhanced Exploration in Bootstrapped DQN
cs.AI updates on arXiv.org 2025-11-06T05:09:32.000000Z
清华、快手提出AttnRL:让大模型用「注意力」探索
机器之心 2025-10-21T14:51:01.000000Z
Experience-Driven Exploration for Efficient API-Free AI Agents
cs.AI updates on arXiv.org 2025-10-20T04:08:25.000000Z
Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects
cs.AI updates on arXiv.org 2025-10-07T04:17:48.000000Z
Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects
cs.AI updates on arXiv.org 2025-10-07T04:17:48.000000Z
EVOLvE: Evaluating and Optimizing LLMs For In-Context Exploration
cs.AI updates on arXiv.org 2025-07-15T04:24:29.000000Z
用动作分块突破RL极限,伯克利引入模仿学习,超越离线/在线SOTA
机器之心 2025-07-14T09:24:42.000000Z