VisuoAlign：多模态安全对齐框架提升视觉语言模型安全

cs.AI updates on arXiv.org 10月21日 12:08

VisuoAlign：多模态安全对齐框架提升视觉语言模型安全

本文提出VisuoAlign，通过提示引导的树搜索实现多模态安全对齐，提升视觉语言模型安全性，有效对抗跨模态攻击。

arXiv:2510.15948v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) have achieved remarkable progress in multimodal perception and generation, yet their safety alignment remains a critical challenge.Existing defenses and vulnerable to multimodal jailbreaks, as visual inputs introduce new attack surfaces, reasoning chains lack safety supervision, and alignment often degrades under modality fusion.To overcome these limitation, we propose VisuoAlign, a framework for multi-modal safety alignment via prompt-guided tree search.VisuoAlign embeds safety constrains into the reasoning process through visual-textual interactive prompts, employs Monte Carlo Tree Search(MCTS) to systematically construct diverse safety-critical prompt trajectories, and introduces prompt-based scaling to ensure real-time risk detection and compliant responses.Extensive experiments demonstrate that VisuoAlign proactively exposes risks, enables comprehensive dataset generation, and significantly improves the robustness of LVLMs against complex cross-modal threats.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

视觉语言模型多模态安全对齐安全防御跨模态攻击

相关文章

Top Important Computer Vision Papers for the Week from 29/04 to 05/05

THRONE: Advancing the Evaluation of Hallucinations in Vision-Language Models

Greg 录制了新的ChatGPT实时语音和多模态的演示。最后ChatGPT还即兴创作了一首短歌,歌词涵盖了房间的装饰风格、人物的穿着特点、期间发生的趣味插曲等。真的这...

和@歸藏一起视频会议看完 OpenAI 的发布，讨论了一会，背脊发凉… 1️⃣ 没想到卷推理卷到了这种程度? 现实交流场景下300ms 左右的体验奇点真没想到就这样被...

OpenAI 很鸡贼，提前一天开发布会，让 Google I/O 的气势弱了很多。再加上 Ilya 的官宣离职又分走了不少流量。果然今早一早起来，媒体的报道和用户的关注相比昨...

This AI newsletter is all you need #99

中信建投：OpenAI发布GPT-4o，AGI向前一步

XGen-MM: A Series of Large Multimodal Models (LMMS) Developed by Salesforce Al Research

周鸿祎：留给谷歌的时间不多了，建议把所有产品都开源

Google AI Introduces PaliGemma: A New Family of Vision Language Models