热点
"视频问答" 相关文章
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
cs.AI updates on arXiv.org 2025-10-09T04:14:51.000000Z
Video Panels for Long Video Understanding
cs.AI updates on arXiv.org 2025-09-30T04:05:17.000000Z
MovieCORE: COgnitive REasoning in Movies
cs.AI updates on arXiv.org 2025-09-19T05:04:05.000000Z
Bridging Vision Language Models and Symbolic Grounding for Video Question Answering
cs.AI updates on arXiv.org 2025-09-16T05:43:35.000000Z
ICML25 视频问答中以语言为中心的结构化推理
哔哩哔哩技术 2025-09-12T15:26:10.000000Z
EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering
cs.AI updates on arXiv.org 2025-08-15T04:18:40.000000Z
VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding
cs.AI updates on arXiv.org 2025-08-12T04:39:06.000000Z
ICML25 视频问答中以语言为中心的结构化推理
哔哩哔哩技术 2025-08-11T08:59:23.000000Z
ICML25 视频问答中以语言为中心的结构化推理
掘金 人工智能 2025-08-08T04:35:40.000000Z
LeAdQA: LLM-Driven Context-Aware Temporal Grounding for Video Question Answering
cs.AI updates on arXiv.org 2025-07-22T04:44:42.000000Z
Team of One: Cracking Complex Video QA with Model Synergy
cs.AI updates on arXiv.org 2025-07-21T04:06:51.000000Z
FIQ: Fundamental Question Generation with the Integration of Question Embeddings for Video Question Answering
cs.AI updates on arXiv.org 2025-07-18T04:14:02.000000Z
如今的智能体,已经像人一样「浏览」视频了,国内就有
机器之心 2024-11-22T06:10:07.000000Z