热点
"多模态数据集" 相关文章
Aeolus: A Multi-structural Flight Delay Dataset
cs.AI updates on arXiv.org 2025-10-31T04:08:50.000000Z
RadDiagSeg-M: A Vision Language Model for Joint Diagnosis and Multi-Target Segmentation in Radiology
cs.AI updates on arXiv.org 2025-10-22T04:20:34.000000Z
Valeo Near-Field: a novel dataset for pedestrian intent detection
cs.AI updates on arXiv.org 2025-10-20T04:14:26.000000Z
World's largest open-source multimodal dataset delivers 17x training efficiency, unlocking enterprise AI that connects documents, audio and video
VentureBeat 2025-10-17T13:25:56.000000Z
GLOFNet -- A Multimodal Dataset for GLOF Monitoring and Prediction
cs.AI updates on arXiv.org 2025-10-14T04:18:49.000000Z
全球首个真实世界具身多模态数据集,它石智航交卷,比特斯拉还早6个月
量子位 2025-10-11T08:29:41.000000Z
全球首个真实世界具身多模态数据集,它石智航交卷,比特斯拉还早6个月
量子位 2025-10-11T08:29:41.000000Z
FinMR: A Knowledge-Intensive Multimodal Benchmark for Advanced Financial Reasoning
cs.AI updates on arXiv.org 2025-10-10T04:07:13.000000Z
PHORECAST: Enabling AI Understanding of Public Health Outreach Across Populations
cs.AI updates on arXiv.org 2025-10-06T04:27:09.000000Z
Benchmarking Foundation Models with Retrieval-Augmented Generation in Olympic-Level Physics Problem Solving
cs.AI updates on arXiv.org 2025-10-02T04:18:39.000000Z
Liaohe-CobotMagic-PnP: an Imitation Learning Dataset of Intelligent Robot for Industrial Applications
cs.AI updates on arXiv.org 2025-09-30T04:04:05.000000Z
The Art of Saying "Maybe": A Conformal Lens for Uncertainty Benchmarking in VLMs
cs.AI updates on arXiv.org 2025-09-18T04:25:35.000000Z
PianoVAM: A Multimodal Piano Performance Dataset
cs.AI updates on arXiv.org 2025-09-11T15:51:45.000000Z
Hugging Face Open-Sourced FineVision: A New Multimodal Dataset with 24 Million Samples for Training Vision-Language Models (VLMs)
MarkTechPost@AI 2025-09-06T08:10:41.000000Z
CLARE: Cognitive Load Assessment in REaltime with Multimodal Data
cs.AI updates on arXiv.org 2025-09-03T04:18:02.000000Z
MM-Food-100K: A 100,000-Sample Multimodal Food Intelligence Dataset with Verifiable Provenance
cs.AI updates on arXiv.org 2025-08-15T04:18:18.000000Z
ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools
cs.AI updates on arXiv.org 2025-08-06T04:01:58.000000Z
Hydra-Bench: A Benchmark for Multi-Modal Leaf Wetness Sensing
cs.AI updates on arXiv.org 2025-07-31T04:48:16.000000Z
GAITEX: Human motion dataset from impaired gait and rehabilitation exercises of inertial and optical sensor data
cs.AI updates on arXiv.org 2025-07-30T04:12:09.000000Z
JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1
cs.AI updates on arXiv.org 2025-07-29T04:21:30.000000Z