防御框架_Fishai

热点

"防御框架" 相关文章

NTU等联合提出A-MemGuard：为AI记忆上锁，投毒攻击成功率暴降95%

36kr-科技 2025-10-16T05:37:57.000000Z

Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks

cs.AI updates on arXiv.org 2025-10-03T04:13:52.000000Z

DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

cs.AI updates on arXiv.org 2025-09-30T04:06:32.000000Z

Activation Steering Meets Preference Optimization: Defense Against Jailbreaks in Vision Language Models

cs.AI updates on arXiv.org 2025-09-03T04:17:03.000000Z

A Vision-Language Pre-training Model-Guided Approach for Mitigating Backdoor Attacks in Federated Learning

cs.AI updates on arXiv.org 2025-08-15T04:18:53.000000Z

Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning

cs.AI updates on arXiv.org 2025-08-06T04:01:55.000000Z

Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments

cs.AI updates on arXiv.org 2025-07-25T04:28:54.000000Z

Addressing The Devastating Effects Of Single-Task Data Poisoning In Exemplar-Free Continual Learning

cs.AI updates on arXiv.org 2025-07-08T06:58:20.000000Z

RobustRAG: A Unique Defense Framework Developed for Opposing Retrieval Corruption Attacks in Retrieval-Augmented Generation (RAG) Systems

MarkTechPost@AI 2024-06-01T18:31:00.000000Z

Copyright © 2019 FISHAI.All Rights Reserved