热点
"跨模态交互" 相关文章
CMI-MTL: Cross-Mamba interaction based multi-task learning for medical visual question answering
cs.AI updates on arXiv.org 2025-11-05T05:30:27.000000Z
Desenhando experiências multissensoriais: Acessibilidade através dos sentidos
UX Collective 🇧🇷 - Medium 2025-10-14T05:15:33.000000Z
Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement
cs.AI updates on arXiv.org 2025-10-06T04:21:09.000000Z
MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech
cs.AI updates on arXiv.org 2025-09-30T04:07:52.000000Z
Explaining multimodal LLMs via intra-modal token interactions
cs.AI updates on arXiv.org 2025-09-29T04:16:20.000000Z
UniMIC: Token-Based Multimodal Interactive Coding for Human-AI Collaboration
cs.AI updates on arXiv.org 2025-09-29T04:10:32.000000Z
CRISP-SAM2: SAM2 with Cross-Modal Interaction and Semantic Prompting for Multi-Organ Segmentation
cs.AI updates on arXiv.org 2025-07-08T06:58:46.000000Z
商汤科技董事长兼CEO徐立:多模态模型带来的交互革命正在显现|AI领先者心声・2025
深度财经头条 2025-01-15T11:35:20.000000Z