热点
关于我们
xx
xx
"
复杂推理
" 相关文章
思而不学则殆:通义实验室×北大联合提出RL-PLUS,突破大模型推理边界
PaperWeekly
2025-10-27T12:24:00.000000Z
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees
cs.AI updates on arXiv.org
2025-10-27T06:31:17.000000Z
智能体系统如何「边做边学」?斯坦福团队探索在线优化的新范式
机器之心
2025-10-24T15:12:27.000000Z
智能体系统如何「边做边学」?斯坦福团队探索在线优化的新范式
机器之心
2025-10-24T15:12:27.000000Z
软件所提出基于信息论的大模型强化学习微调框架
oschina.net
2025-10-23T10:16:28.000000Z
Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning
cs.AI updates on arXiv.org
2025-10-23T04:23:13.000000Z
SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration
cs.AI updates on arXiv.org
2025-10-23T04:22:51.000000Z
Gemini 3.0 匿名上线
夕小瑶科技说
2025-10-21T14:53:53.000000Z
Budget-aware Test-time Scaling via Discriminative Verification
cs.AI updates on arXiv.org
2025-10-17T04:10:58.000000Z
IMAGINE: Integrating Multi-Agent System into One Model for Complex Reasoning and Planning
cs.AI updates on arXiv.org
2025-10-17T04:09:22.000000Z
MULTI: Multimodal Understanding Leaderboard with Text and Images
cs.AI updates on arXiv.org
2025-10-16T04:31:53.000000Z
What Makes Looped Transformers Perform Better Than Non-Recursive Ones (Provably)
cs.AI updates on arXiv.org
2025-10-14T04:17:45.000000Z
What Makes Looped Transformers Perform Better Than Non-Recursive Ones (Provably)
cs.AI updates on arXiv.org
2025-10-14T04:17:45.000000Z
蚂蚁开源万亿参数大模型Ling-1T:多项能力全球领先
掘金 人工智能
2025-10-09T23:51:30.000000Z
Samsung’s tiny AI model beats giant reasoning LLMs
AI News
2025-10-09T01:29:44.000000Z
Samsung’s tiny AI model beats giant reasoning LLMs
AI News
2025-10-09T01:29:44.000000Z
Samsung’s tiny AI model beats giant reasoning LLMs
AI News
2025-10-09T01:29:44.000000Z
What Can You Do When You Have Zero Rewards During RL?
cs.AI updates on arXiv.org
2025-10-07T04:16:03.000000Z
What Can You Do When You Have Zero Rewards During RL?
cs.AI updates on arXiv.org
2025-10-07T04:16:03.000000Z
Can an LLM Induce a Graph? Investigating Memory Drift and Context Length
cs.AI updates on arXiv.org
2025-10-07T04:15:19.000000Z