热点
"文本评估" 相关文章
Interpreting LLM-as-a-Judge Policies via Verifiable Global Explanations
cs.AI updates on arXiv.org 2025-10-10T04:16:02.000000Z
CRACQ: A Multi-Dimensional Approach To Automated Document Assessment
cs.AI updates on arXiv.org 2025-10-06T04:22:52.000000Z
IndoPref: A Multi-Domain Pairwise Preference Dataset for Indonesian
cs.AI updates on arXiv.org 2025-07-31T04:48:00.000000Z
让 LLM 来评判 | 基础概念
智源社区 2025-01-12T03:53:11.000000Z