热点
"跨模态学习" 相关文章
Multi-modal Co-learning for Earth Observation: Enhancing single-modality models via modality collaboration
cs.AI updates on arXiv.org 2025-10-23T04:21:32.000000Z
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention
cs.AI updates on arXiv.org 2025-10-21T04:29:48.000000Z
XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models
cs.AI updates on arXiv.org 2025-10-20T04:12:11.000000Z
SyncLipMAE: Contrastive Masked Pretraining for Audio-Visual Talking-Face Representation
cs.AI updates on arXiv.org 2025-10-14T04:08:31.000000Z
Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
cs.AI updates on arXiv.org 2025-10-13T04:11:41.000000Z
Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
cs.AI updates on arXiv.org 2025-10-13T04:11:41.000000Z
Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
cs.AI updates on arXiv.org 2025-10-13T04:11:41.000000Z
Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation
cs.AI updates on arXiv.org 2025-10-08T04:15:02.000000Z
Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation
cs.AI updates on arXiv.org 2025-10-08T04:15:02.000000Z
SeMoBridge: Semantic Modality Bridge for Efficient Few-Shot Adaptation of CLIP
cs.AI updates on arXiv.org 2025-10-01T06:01:23.000000Z
Preserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios
cs.AI updates on arXiv.org 2025-09-30T04:05:45.000000Z
TS-P$^2$CL: Plug-and-Play Dual Contrastive Learning for Vision-Guided Medical Time Series Classification
cs.AI updates on arXiv.org 2025-09-23T06:08:57.000000Z
Self-Supervised Cross-Modal Learning for Image-to-Point Cloud Registration
cs.AI updates on arXiv.org 2025-09-22T04:43:53.000000Z
Abn-BLIP: Abnormality-aligned Bootstrapping Language-Image Pre-training for Pulmonary Embolism Diagnosis and Report Generation from CTPA
cs.AI updates on arXiv.org 2025-09-16T05:47:06.000000Z
D-CAT: Decoupled Cross-Attention Transfer between Sensor Modalities for Unimodal Inference
cs.AI updates on arXiv.org 2025-09-15T08:22:24.000000Z
Nature子刊多篇文章速览: 大模型赋能的科学发现
集智俱乐部 2025-09-12T02:36:07.000000Z
The Transparent Earth: A Multimodal Foundation Model for the Earth's Subsurface
cs.AI updates on arXiv.org 2025-09-04T05:58:59.000000Z
Continual Learning for Multimodal Data Fusion of a Soft Gripper
cs.AI updates on arXiv.org 2025-08-22T04:02:24.000000Z
SPANER: Shared Prompt Aligner for Multimodal Semantic Representation
cs.AI updates on arXiv.org 2025-08-20T04:16:53.000000Z
A Cross-Modal Rumor Detection Scheme via Contrastive Learning by Exploring Text and Image internal Correlations
cs.AI updates on arXiv.org 2025-08-18T04:21:33.000000Z