热点
"多模态问答" 相关文章
ICCV25 Highlight|格灵深瞳RICE模型狂刷榜单,让AI「看懂」图片的每个细节
机器之心 2025-10-29T11:32:20.000000Z
RAG-Anything × Milvus:读PDF要集成20个工具的RAG时代结束了!
Zilliz 2025-10-09T14:29:19.000000Z
EverydayMMQA: A Multilingual and Multimodal Framework for Culturally Grounded Spoken Visual QA
cs.AI updates on arXiv.org 2025-10-09T04:06:50.000000Z
EverydayMMQA: A Multilingual and Multimodal Framework for Culturally Grounded Spoken Visual QA
cs.AI updates on arXiv.org 2025-10-09T04:06:50.000000Z
Q-Mirror: Unlocking the Multi-Modal Potential of Scientific Text-Only QA Pairs
cs.AI updates on arXiv.org 2025-09-30T04:06:33.000000Z
MMAPG: A Training-Free Framework for Multimodal Multi-hop Question Answering via Adaptive Planning Graphs
cs.AI updates on arXiv.org 2025-09-22T04:53:28.000000Z
DB3 Team's Solution For Meta KDD Cup' 25
cs.AI updates on arXiv.org 2025-09-15T08:12:50.000000Z
QuesGenie: Intelligent Multimodal Question Generation
cs.AI updates on arXiv.org 2025-09-05T04:45:35.000000Z
A Curriculum Learning Approach to Reinforcement Learning: Leveraging RAG for Multimodal Question Answering
cs.AI updates on arXiv.org 2025-08-15T04:18:15.000000Z
ExpliCIT-QA: Explainable Code-Based Image Table Question Answering
cs.AI updates on arXiv.org 2025-07-17T04:14:27.000000Z
Cultivating Multimodal Intelligence: Interpretive Reasoning and Agentic RAG Approaches to Dermatological Diagnosis
cs.AI updates on arXiv.org 2025-07-09T04:01:25.000000Z