多模态数据集_Fishai

热点

"多模态数据集" 相关文章

Aeolus: A Multi-structural Flight Delay Dataset

cs.AI updates on arXiv.org 2025-10-31T04:08:50.000000Z

RadDiagSeg-M: A Vision Language Model for Joint Diagnosis and Multi-Target Segmentation in Radiology

cs.AI updates on arXiv.org 2025-10-22T04:20:34.000000Z

Valeo Near-Field: a novel dataset for pedestrian intent detection

cs.AI updates on arXiv.org 2025-10-20T04:14:26.000000Z

World's largest open-source multimodal dataset delivers 17x training efficiency, unlocking enterprise AI that connects documents, audio and video

VentureBeat 2025-10-17T13:25:56.000000Z

GLOFNet -- A Multimodal Dataset for GLOF Monitoring and Prediction

cs.AI updates on arXiv.org 2025-10-14T04:18:49.000000Z

全球首个真实世界具身多模态数据集，它石智航交卷，比特斯拉还早6个月

量子位 2025-10-11T08:29:41.000000Z

全球首个真实世界具身多模态数据集，它石智航交卷，比特斯拉还早6个月

量子位 2025-10-11T08:29:41.000000Z

FinMR: A Knowledge-Intensive Multimodal Benchmark for Advanced Financial Reasoning

cs.AI updates on arXiv.org 2025-10-10T04:07:13.000000Z

PHORECAST: Enabling AI Understanding of Public Health Outreach Across Populations

cs.AI updates on arXiv.org 2025-10-06T04:27:09.000000Z

Benchmarking Foundation Models with Retrieval-Augmented Generation in Olympic-Level Physics Problem Solving

cs.AI updates on arXiv.org 2025-10-02T04:18:39.000000Z

Liaohe-CobotMagic-PnP: an Imitation Learning Dataset of Intelligent Robot for Industrial Applications

cs.AI updates on arXiv.org 2025-09-30T04:04:05.000000Z

The Art of Saying "Maybe": A Conformal Lens for Uncertainty Benchmarking in VLMs

cs.AI updates on arXiv.org 2025-09-18T04:25:35.000000Z

PianoVAM: A Multimodal Piano Performance Dataset

cs.AI updates on arXiv.org 2025-09-11T15:51:45.000000Z

Hugging Face Open-Sourced FineVision: A New Multimodal Dataset with 24 Million Samples for Training Vision-Language Models (VLMs)

MarkTechPost@AI 2025-09-06T08:10:41.000000Z

CLARE: Cognitive Load Assessment in REaltime with Multimodal Data

cs.AI updates on arXiv.org 2025-09-03T04:18:02.000000Z

MM-Food-100K: A 100,000-Sample Multimodal Food Intelligence Dataset with Verifiable Provenance

cs.AI updates on arXiv.org 2025-08-15T04:18:18.000000Z

ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools

cs.AI updates on arXiv.org 2025-08-06T04:01:58.000000Z

Hydra-Bench: A Benchmark for Multi-Modal Leaf Wetness Sensing

cs.AI updates on arXiv.org 2025-07-31T04:48:16.000000Z

GAITEX: Human motion dataset from impaired gait and rehabilitation exercises of inertial and optical sensor data

cs.AI updates on arXiv.org 2025-07-30T04:12:09.000000Z

JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1

cs.AI updates on arXiv.org 2025-07-29T04:21:30.000000Z

Copyright © 2019 FISHAI.All Rights Reserved