热点
关于我们
xx
xx
"
沙袋策略
" 相关文章
Sandbagging in a Simple Survival Bandit Problem
cs.AI updates on arXiv.org
2025-10-01T06:01:39.000000Z
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
cs.AI updates on arXiv.org
2025-08-05T11:28:48.000000Z