热点
关于我们
xx
xx
"
难度评估
" 相关文章
Searching for Difficult-to-Translate Test Examples at Scale
cs.AI updates on arXiv.org
2025-10-01T06:02:06.000000Z
The LLM Already Knows: Estimating LLM-Perceived Question Difficulty via Hidden Representations
cs.AI updates on arXiv.org
2025-09-17T05:18:40.000000Z
MAB Optimizer for Estimating Math Question Difficulty via Inverse CV without NLP
cs.AI updates on arXiv.org
2025-09-03T04:17:59.000000Z
Project Patti: Why can You Solve Diabolical Puzzles on one Sudoku Website but not Easy Puzzles on another Sudoku Website?
cs.AI updates on arXiv.org
2025-07-30T04:11:47.000000Z
Closed-ended questions aren't as hard as you think
少点错误
2025-02-19T04:01:57.000000Z