热点
"防御框架" 相关文章
NTU等联合提出A-MemGuard:为AI记忆上锁,投毒攻击成功率暴降95%
36kr-科技 2025-10-16T05:37:57.000000Z
Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks
cs.AI updates on arXiv.org 2025-10-03T04:13:52.000000Z
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models
cs.AI updates on arXiv.org 2025-09-30T04:06:32.000000Z
Activation Steering Meets Preference Optimization: Defense Against Jailbreaks in Vision Language Models
cs.AI updates on arXiv.org 2025-09-03T04:17:03.000000Z
A Vision-Language Pre-training Model-Guided Approach for Mitigating Backdoor Attacks in Federated Learning
cs.AI updates on arXiv.org 2025-08-15T04:18:53.000000Z
Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning
cs.AI updates on arXiv.org 2025-08-06T04:01:55.000000Z
Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments
cs.AI updates on arXiv.org 2025-07-25T04:28:54.000000Z
Addressing The Devastating Effects Of Single-Task Data Poisoning In Exemplar-Free Continual Learning
cs.AI updates on arXiv.org 2025-07-08T06:58:20.000000Z
RobustRAG: A Unique Defense Framework Developed for Opposing Retrieval Corruption Attacks in Retrieval-Augmented Generation (RAG) Systems
MarkTechPost@AI 2024-06-01T18:31:00.000000Z