热点
"图表理解" 相关文章
大模型如何准确读懂图表?微软亚研院教它“看、动手、推理”
智源社区 2025-11-06T07:56:31.000000Z
From Charts to Code: A Hierarchical Benchmark for Multimodal Models
cs.AI updates on arXiv.org 2025-10-22T04:18:00.000000Z
From Charts to Code: A Hierarchical Benchmark for Multimodal Models
cs.AI updates on arXiv.org 2025-10-22T04:18:00.000000Z
EncQA: Benchmarking Vision-Language Models on Visual Encodings for Charts
machinelearning apple 2025-10-13T22:38:40.000000Z
EncQA: Benchmarking Vision-Language Models on Visual Encodings for Charts
machinelearning apple 2025-10-13T22:38:40.000000Z
ChartAgent: A Multimodal Agent for Visually Grounded Reasoning in Complex Chart Question Answering
cs.AI updates on arXiv.org 2025-10-07T04:09:11.000000Z
5大维度、4类模态:LLM/Agent数据分析技术全景图
PaperAgent 2025-10-02T11:51:44.000000Z
ChartHal: A Fine-grained Framework Evaluating Hallucination of Large Vision Language Models in Chart Understanding
cs.AI updates on arXiv.org 2025-09-23T06:05:47.000000Z
Visual Programmability: A Guide for Code-as-Thought in Chart Understanding
Hugging Face 2025-09-11T19:36:58.000000Z
ICCV 2025 | ECD:高质量合成图表数据集,提升开源MLLM图表理解能力
机器之心 2025-08-22T07:31:28.000000Z
Chart-CoCa: Self-Improving Chart Understanding of Vision LMs via Code-Driven Synthesis and Candidate-Conditioned Answering
cs.AI updates on arXiv.org 2025-08-19T04:01:25.000000Z
In-Depth and In-Breadth: Pre-training Multimodal Language Models Customized for Comprehensive Chart Understanding
cs.AI updates on arXiv.org 2025-07-22T04:44:28.000000Z
On Pre-training of Multimodal Language Models Customized for Chart Understanding
cs.AI updates on arXiv.org 2025-07-21T04:06:48.000000Z
POLYCHARTQA: Benchmarking Large Vision-Language Models with Multilingual Chart Question Answering
cs.AI updates on arXiv.org 2025-07-17T04:14:38.000000Z
ICLR 2025 Oral | IDEA联合清华北大提出ChartMoE:探究下游任务中多样化对齐MoE的表征和知识
机器之心 2025-04-01T08:05:59.000000Z
多模态LLM视觉推理能力堪忧,浙大领衔用GPT-4合成数据构建多模态基准
36kr-科技 2024-08-08T07:51:35.000000Z
From Diagrams to Solutions: MAVIS’s Three-Stage Framework for Mathematical AI
MarkTechPost@AI 2024-07-19T18:03:45.000000Z
ChartGemma: A Multimodal Model Instruction-Tuned on Data Generated Directly from a Diverse Range of Real-World Chart Images
MarkTechPost@AI 2024-07-16T16:31:23.000000Z
解密Prompt系列33. LLM之图表理解任务-多模态篇
掘金 人工智能 2024-07-06T02:31:27.000000Z
CharXiv: A Comprehensive Evaluation Suite Advancing Multimodal Large Language Models Through Realistic Chart Understanding Benchmarks
MarkTechPost@AI 2024-06-29T04:01:35.000000Z