跨模态学习_Fishai

热点

"跨模态学习" 相关文章

Multi-modal Co-learning for Earth Observation: Enhancing single-modality models via modality collaboration

cs.AI updates on arXiv.org 2025-10-23T04:21:32.000000Z

MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention

cs.AI updates on arXiv.org 2025-10-21T04:29:48.000000Z

XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

cs.AI updates on arXiv.org 2025-10-20T04:12:11.000000Z

SyncLipMAE: Contrastive Masked Pretraining for Audio-Visual Talking-Face Representation

cs.AI updates on arXiv.org 2025-10-14T04:08:31.000000Z

Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes

cs.AI updates on arXiv.org 2025-10-13T04:11:41.000000Z

Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes

cs.AI updates on arXiv.org 2025-10-13T04:11:41.000000Z

Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes

cs.AI updates on arXiv.org 2025-10-13T04:11:41.000000Z

Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation

cs.AI updates on arXiv.org 2025-10-08T04:15:02.000000Z

Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation

cs.AI updates on arXiv.org 2025-10-08T04:15:02.000000Z

SeMoBridge: Semantic Modality Bridge for Efficient Few-Shot Adaptation of CLIP

cs.AI updates on arXiv.org 2025-10-01T06:01:23.000000Z

Preserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios

cs.AI updates on arXiv.org 2025-09-30T04:05:45.000000Z

TS-P$^2$CL: Plug-and-Play Dual Contrastive Learning for Vision-Guided Medical Time Series Classification

cs.AI updates on arXiv.org 2025-09-23T06:08:57.000000Z

Self-Supervised Cross-Modal Learning for Image-to-Point Cloud Registration

cs.AI updates on arXiv.org 2025-09-22T04:43:53.000000Z

Abn-BLIP: Abnormality-aligned Bootstrapping Language-Image Pre-training for Pulmonary Embolism Diagnosis and Report Generation from CTPA

cs.AI updates on arXiv.org 2025-09-16T05:47:06.000000Z

D-CAT: Decoupled Cross-Attention Transfer between Sensor Modalities for Unimodal Inference

cs.AI updates on arXiv.org 2025-09-15T08:22:24.000000Z

Nature子刊多篇文章速览: 大模型赋能的科学发现

集智俱乐部 2025-09-12T02:36:07.000000Z

The Transparent Earth: A Multimodal Foundation Model for the Earth's Subsurface

cs.AI updates on arXiv.org 2025-09-04T05:58:59.000000Z

Continual Learning for Multimodal Data Fusion of a Soft Gripper

cs.AI updates on arXiv.org 2025-08-22T04:02:24.000000Z

SPANER: Shared Prompt Aligner for Multimodal Semantic Representation

cs.AI updates on arXiv.org 2025-08-20T04:16:53.000000Z

A Cross-Modal Rumor Detection Scheme via Contrastive Learning by Exploring Text and Image internal Correlations

cs.AI updates on arXiv.org 2025-08-18T04:21:33.000000Z

Copyright © 2019 FISHAI.All Rights Reserved