热点
"自动化评分" 相关文章
AI-generated Essays: Characteristics and Implications on Automated Scoring and Academic Integrity
cs.AI updates on arXiv.org 2025-10-17T04:19:40.000000Z
Webinar recap: Eval best practices
Braintrust Blog 2025-10-02T12:52:58.000000Z
谁是最强“打工AI”?OpenAI亲自测试,结果第一不是自己
36kr-科技 2025-09-26T12:03:46.000000Z
OpenAI Introduces GDPval: A New Evaluation Suite that Measures AI on Real-World Economically Valuable Tasks
MarkTechPost@AI 2025-09-25T20:44:44.000000Z
Using LLMs to identify features of personal and professional skills in an open-response situational judgment test
cs.AI updates on arXiv.org 2025-07-21T04:06:52.000000Z