热点
"复合任务" 相关文章
TPS-Bench: Evaluating AI Agents' Tool Planning \& Scheduling Abilities in Compounding Tasks
cs.AI updates on arXiv.org 2025-11-05T05:15:42.000000Z
How Do Language Models Compose Functions?
cs.AI updates on arXiv.org 2025-10-03T04:17:12.000000Z
Evaluating Multimodal Large Language Models with Daily Composite Tasks in Home Environments
cs.AI updates on arXiv.org 2025-09-23T05:23:20.000000Z
手机AGI助手还有多远?移动智能体复合长程任务测试基准与调度系统发布
机器之心 2025-07-26T17:28:56.000000Z