热点
"沙袋策略" 相关文章
Sandbagging in a Simple Survival Bandit Problem
cs.AI updates on arXiv.org 2025-10-01T06:01:39.000000Z
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
cs.AI updates on arXiv.org 2025-08-05T11:28:48.000000Z