热点
"多模态信息提取" 相关文章
NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding
cs.AI updates on arXiv.org 2025-08-21T04:04:32.000000Z