热点
"元评估" 相关文章
Feeding Two Birds or Favoring One? Adequacy-Fluency Tradeoffs in Evaluation and Meta-Evaluation of Machine Translation
cs.AI updates on arXiv.org 2025-09-25T06:02:26.000000Z
LaajMeter: A Framework for LaaJ Evaluation
cs.AI updates on arXiv.org 2025-08-15T04:18:46.000000Z