热点
关于我们
xx
xx
"
LVLMs
" 相关文章
MemEIC: A Step Toward Continual and Compositional Knowledge Editing
cs.AI updates on arXiv.org
2025-10-31T04:04:10.000000Z
Process Reward Models for Sentence-Level Verification of LVLM Radiology Reports
cs.AI updates on arXiv.org
2025-10-28T04:14:35.000000Z
CityRiSE: Reasoning Urban Socio-Economic Status in Vision-Language Models via Reinforcement Learning
cs.AI updates on arXiv.org
2025-10-28T04:14:09.000000Z
Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context
cs.AI updates on arXiv.org
2025-10-24T04:24:33.000000Z
Learning to Detect Unknown Jailbreak Attacks in Large Vision-Language Models
cs.AI updates on arXiv.org
2025-10-20T04:13:59.000000Z
PatentVision: A multimodal method for drafting patent applications
cs.AI updates on arXiv.org
2025-10-14T04:14:42.000000Z
Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
cs.AI updates on arXiv.org
2025-10-10T04:12:55.000000Z
ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations
cs.AI updates on arXiv.org
2025-10-09T04:06:24.000000Z
ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations
cs.AI updates on arXiv.org
2025-10-09T04:06:24.000000Z
Multimodal Carotid Risk Stratification with Large Vision-Language Models: Benchmarking, Fine-Tuning, and Clinical Insights
cs.AI updates on arXiv.org
2025-10-06T04:27:55.000000Z
INS-MMBench: A Comprehensive Benchmark for Evaluating LVLMs' Performance in Insurance
cs.AI updates on arXiv.org
2025-08-11T04:08:29.000000Z
MAP: Mitigating Hallucinations in Large Vision-Language Models with Map-Level Attention Processing
cs.AI updates on arXiv.org
2025-08-05T17:08:29.000000Z
Self-Aware Safety Augmentation: Leveraging Internal Semantic Understanding to Enhance Safety in Vision-Language Models
cs.AI updates on arXiv.org
2025-07-30T04:11:58.000000Z
紫东太初开源视觉神经增强方法,即插即用终结多模态幻觉 | ACL 2025
智源社区
2025-06-28T14:02:55.000000Z
入选ICML 2025!哈佛医学院等推出全球首个HIE领域临床思维图谱模型,神经认知结果预测任务上性能提升15%
掘金 人工智能
2025-06-23T05:48:53.000000Z
入选ICML 2025!哈佛医学院等推出全球首个HIE领域临床思维图谱模型,神经认知结果预测任务上性能提升15%
智源社区
2025-06-23T04:17:45.000000Z
让视觉语言模型像o3一样动手搜索、写代码!Visual ARFT实现多模态智能体能力
机器之心
2025-05-27T07:20:30.000000Z
Teaching AI to Give Better Video Critiques
Unite.AI
2025-04-01T14:17:19.000000Z
DeepSeek-R1的风吹到了多模态,Visual-RFT发布,视觉任务性能飙升20%
PaperAgent
2025-03-13T12:01:47.000000Z
细粒度对齐无需仔细标注了!淘天提出视觉锚定奖励,自我校准实现多模态对齐
机器之心
2025-01-19T07:26:47.000000Z