MAFR：工业异常检测的多模态融合框架

cs.AI updates on arXiv.org 10月28日 12:09

MAFR：工业异常检测的多模态融合框架

本文提出了一种名为MAFR的工业异常检测多模态融合框架，通过融合RGB图像和点云数据，实现了对异常的精准定位，并在多个基准测试中取得优异成绩。

arXiv:2510.21793v1 Announce Type: cross Abstract: Industrial anomaly detection (IAD) increasingly benefits from integrating 2D and 3D data, but robust cross-modal fusion remains challenging. We propose a novel unsupervised framework, Multi-Modal Attention-Driven Fusion Restoration (MAFR), which synthesises a unified latent space from RGB images and point clouds using a shared fusion encoder, followed by attention-guided, modality-specific decoders. Anomalies are localised by measuring reconstruction errors between input features and their restored counterparts. Evaluations on the MVTec 3D-AD and Eyecandies benchmarks demonstrate that MAFR achieves state-of-the-art results, with a mean I-AUROC of 0.972 and 0.901, respectively. The framework also exhibits strong performance in few-shot learning settings, and ablation studies confirm the critical roles of the fusion architecture and composite loss. MAFR offers a principled approach for fusing visual and geometric information, advancing the robustness and accuracy of industrial anomaly detection. Code is available at https://github.com/adabrh/MAFR

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

工业异常检测多模态融合 MAFR框架 RGB图像点云数据

相关文章

实践探索加速大模型AI应用普及丨第二届人工智能大模型技术高峰论坛预告

全球首款多模态定位感知模组：非普导航推出 xFusion-A1，用于无人机等

CIKM 2024 | 大语言模型推荐中的协同过滤信号和语义信息的深度融合

自动驾驶不怵恶劣天气，西电&上海AI Lab多模态融合检测端到端算法来了 | NeurlPS Oral

研究人员用大模型深度解析人类认知过程与机制，为理解人类语言认知开辟新视角

[制造] 我国首台作业时速公里级水下敷缆机器人完成下水测试

关于AI的2025年，AI这样回答|特稿

「数字孪生」东京上线！Jim Fan：具身智能零样本迁移现实世界，共享「蜂群思维」

「数字孪生」东京上线！Jim Fan：具身智能零样本迁移现实世界，共享「蜂群思维」

36氪研究院 | 2024年中国人工智能之自然语言处理（NLP）技术洞察