热点
"大规模评估" 相关文章
A Comprehensive Evaluation of Cognitive Biases in LLMs
cs.AI updates on arXiv.org 2025-11-05T05:31:45.000000Z
Can ChatGPT Code Communication Data Fairly?: Empirical Evidence from Multiple Collaborative Tasks
cs.AI updates on arXiv.org 2025-10-24T04:29:41.000000Z
Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
cs.AI updates on arXiv.org 2025-09-30T04:04:43.000000Z
This AI Paper Introduces a Comprehensive Study on Large-Scale Model Merging Techniques
MarkTechPost@AI 2024-10-13T12:17:40.000000Z