热点
"图像-文本对" 相关文章
Towards Understanding Ambiguity Resolution in Multimodal Inference of Meaning
cs.AI updates on arXiv.org 2025-10-14T04:15:20.000000Z