风格字体攻击：社交媒体时代NLP模型的潜在威胁

cs.AI updates on arXiv.org 10月23日 12:21

风格字体攻击：社交媒体时代NLP模型的潜在威胁

本文探讨了社交媒体时代，用户使用风格字体和类似表情符号表达个性的现象，分析了其对自然语言处理模型带来的潜在威胁，并提出了基于风格的攻击方法SAD，实验表明SAD在情感分类和机器翻译任务中具有显著攻击效果。

arXiv:2510.19641v1 Announce Type: cross Abstract: With social media growth, users employ stylistic fonts and font-like emoji to express individuality, creating visually appealing text that remains human-readable. However, these fonts introduce hidden vulnerabilities in NLP models: while humans easily read stylistic text, models process these characters as distinct tokens, causing interference. We identify this human-model perception gap and propose a style-based attack, Style Attack Disguise (SAD). We design two sizes: light for query efficiency and strong for superior attack performance. Experiments on sentiment classification and machine translation across traditional models, LLMs, and commercial services demonstrate SAD's strong attack performance. We also show SAD's potential threats to multimodal tasks including text-to-image and text-to-speech generation.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

社交媒体 NLP模型风格字体攻击

相关文章

How popular is ChatGPT? Part 2: slower growth than Pokémon GO

当你不能欣赏你女朋友分享的东西或者照片时，你女朋友就会给别人分享

Innovating Neural Machine Translation with Arul Menezes - #458

沿着套路，深挖爱情：从《泪之女王》看韩剧变迁 | 编辑部聊天室

Growth Hacking Sports w/ Machine Learning with Noah Gift - TWiML Talk #158

腾讯控股：一季度视频号总用户使用时长同比增长超80%

New tool empowers users to fight online misinformation

药明生物：既没有人类基因组学业务，亦未在其任何业务中收集人类基因组数据

多邻国：做题家们的游戏乐园？| 编辑部聊天室