提升虚假新闻检测模型鲁棒性研究

cs.AI updates on arXiv.org 10月14日

提升虚假新闻检测模型鲁棒性研究

本文针对虚假新闻检测模型易受对抗性评论影响的问题，提出了一种基于心理分类和自适应采样机制的对抗训练策略，有效提高了模型的鲁棒性。

arXiv:2510.09712v1 Announce Type: cross Abstract: The spread of fake news online distorts public judgment and erodes trust in social media platforms. Although recent fake news detection (FND) models perform well in standard settings, they remain vulnerable to adversarial comments-authored by real users or by large language models (LLMs)-that subtly shift model decisions. In view of this, we first present a comprehensive evaluation of comment attacks to existing fake news detectors and then introduce a group-adaptive adversarial training strategy to improve the robustness of FND models. To be specific, our approach comprises three steps: (1) dividing adversarial comments into three psychologically grounded categories: perceptual, cognitive, and societal; (2) generating diverse, category-specific attacks via LLMs to enhance adversarial training; and (3) applying a Dirichlet-based adaptive sampling mechanism (InfoDirichlet Adjusting Mechanism) that dynamically adjusts the learning focus across different comment categories during training. Experiments on benchmark datasets show that our method maintains strong detection accuracy while substantially increasing robustness to a wide range of adversarial comment perturbations.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

虚假新闻检测对抗训练鲁棒性心理分类自适应采样

相关文章

Fairness and Robustness in Federated Learning with Virginia Smith -#504

High-Dimensional Robust Statistics with Ilias Diakonikolas - #351

RABBITS: A Specialized Dataset and Leaderboard to Aid in Evaluating LLM Performance in Healthcare

Comprehensive Analysis of The Performance of Vision State Space Models (VSSMs), Vision Transformers, and Convolutional Neural Networks (CNNs)

多模态大模型看懂图片也会答错，智源联合多家机构推出多模态模型鲁棒性测试基准

北航沙磊教授：当Agentic RAG照进现实——Agent Insights

Generalizable Reward Model (GRM): An Efficient AI Approach to Improve the Generalizability and Robustness of Reward Learning for LLMs

LayerShuffle: Robust Vision Transformers for Arbitrary Layer Execution Orders

击败人类又怎样？“超人”AI简直不堪一击？研究发现：ChatGPT等大模型也不行

Advancing Robustness in Neural Information Retrieval: A Comprehensive Survey and Benchmarking Framework