热点
"多模态大语言模型" 相关文章
GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding
cs.AI updates on arXiv.org 2025-11-05T05:27:17.000000Z
FOCUS: Efficient Keyframe Selection for Long Video Understanding
cs.AI updates on arXiv.org 2025-11-03T05:19:29.000000Z
MemeArena: Automating Context-Aware Unbiased Evaluation of Harmfulness Understanding for Multimodal Large Language Models
cs.AI updates on arXiv.org 2025-11-03T05:19:18.000000Z
ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
cs.AI updates on arXiv.org 2025-11-03T05:18:20.000000Z
Perception, Understanding and Reasoning, A Multimodal Benchmark for Video Fake News Detection
cs.AI updates on arXiv.org 2025-10-30T04:15:42.000000Z
超越英伟达Describe Anything!中科院 & 字节联合提出「GAR」,为DeepSeek-OCR添砖加瓦
量子位 2025-10-28T08:21:24.000000Z
Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier
cs.AI updates on arXiv.org 2025-10-28T04:06:32.000000Z
「智汇安全·洞见未来」——复旦大学计算与智能创新学院学科周论坛精彩回顾
复旦白泽战队 2025-10-27T13:59:14.000000Z
REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language Models for Enhanced Remote Health Monitoring
cs.AI updates on arXiv.org 2025-10-27T06:26:41.000000Z
Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
cs.AI updates on arXiv.org 2025-10-24T04:54:20.000000Z
字节Seed团队推出3D生成大模型Seed3D 1.0
界面快报 2025-10-23T07:31:37.000000Z
字节Seed团队推出3D生成大模型Seed3D 1.0
界面快报 2025-10-23T07:31:37.000000Z
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
cs.AI updates on arXiv.org 2025-10-22T04:26:09.000000Z
ESCA: Contextualizing Embodied Agents via Scene-Graph Generation
cs.AI updates on arXiv.org 2025-10-21T04:14:50.000000Z
IDEA提出Rex-Omni:将目标检测变为“下一个点预测”,零样本性能超越DINO
我爱计算机视觉 2025-10-20T14:55:11.000000Z
IDEA提出Rex-Omni:将目标检测变为“下一个点预测”,零样本性能超越DINO
我爱计算机视觉 2025-10-20T14:55:11.000000Z
NeurIPS 2025 | 上交大提出MM-UPT:多模态大模型的“无监督后训练”范式
PaperWeekly 2025-10-19T08:34:28.000000Z
NeurIPS2025 | 攻破闭源多模态大模型:一种基于特征最优对齐的新型对抗攻击方法
机器之心 2025-10-17T13:34:39.000000Z
NeurIPS2025 | 攻破闭源多模态大模型:一种基于特征最优对齐的新型对抗攻击方法
机器之心 2025-10-17T13:34:39.000000Z
攻破闭源多模态大模型:一种基于特征最优对齐的新型对抗攻击方法
36氪 - AI相关文章 2025-10-17T09:42:53.000000Z