热点
"视觉Grounding" 相关文章
An Efficient Training Pipeline for Reasoning Graphical User Interface Agents
cs.AI updates on arXiv.org 2025-11-12T05:09:20.000000Z
Towards Understanding Visual Grounding in Visual Language Models
cs.AI updates on arXiv.org 2025-09-15T08:34:30.000000Z
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
cs.AI updates on arXiv.org 2025-09-08T04:52:07.000000Z
突破高分辨率图像推理瓶颈,复旦联合南洋理工提出基于视觉Grounding的多轮强化学习框架MGPO
机器之心 2025-07-24T09:01:21.000000Z
突破高分辨率图像推理瓶颈,复旦联合南洋理工提出基于视觉Grounding的多轮强化学习框架MGPO
机器之心 2025-07-21T10:33:53.000000Z
This AI Paper Introduces GRIT: A Method for Teaching MLLMs to Reason with Images by Interleaving Text and Visual Grounding
MarkTechPost@AI 2025-05-25T06:15:58.000000Z