热点
"Inoculation Prompting" 相关文章
AI Safety at the Frontier: Paper Highlights of October 2025
少点错误 2025-11-05T13:49:15.000000Z
Inoculation prompting: Instructing models to misbehave at train-time can improve run-time behavior
少点错误 2025-10-08T22:19:53.000000Z
Inoculation prompting: Instructing models to misbehave at train-time can improve run-time behavior
少点错误 2025-10-08T22:19:53.000000Z