热点
关于我们
xx
xx
"
质量评估
" 相关文章
Can Small and Reasoning Large Language Models Score Journal Articles for Research Quality and Do Averaging and Few-shot Help?
cs.AI updates on arXiv.org
2025-10-28T04:14:32.000000Z
B站游戏大模型翻译实践 —— 我们如何用LLM撑起全年百万字本地化翻译任务
哔哩哔哩技术
2025-10-27T16:14:26.000000Z
Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
cs.AI updates on arXiv.org
2025-10-24T04:51:15.000000Z
Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
cs.AI updates on arXiv.org
2025-10-24T04:51:15.000000Z
CreativityPrism: A Holistic Benchmark for Large Language Model Creativity
cs.AI updates on arXiv.org
2025-10-24T04:22:33.000000Z
北大团队提出数据质量评估新标准,破解无线感知领域合成数据质量难题
MIT 科技评论 - 本周热榜
2025-10-23T17:31:23.000000Z
北大团队提出数据质量评估新标准,破解无线感知领域合成数据质量难题
MIT 科技评论 - 本周热榜
2025-10-23T17:31:23.000000Z
DP$^2$O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution
cs.AI updates on arXiv.org
2025-10-22T04:25:48.000000Z
DP$^2$O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution
cs.AI updates on arXiv.org
2025-10-22T04:25:48.000000Z
Readers Prefer Outputs of AI Trained on Copyrighted Books over Expert Human Writers
cs.AI updates on arXiv.org
2025-10-21T04:12:55.000000Z
AutoQual: An LLM Agent for Automated Discovery of Interpretable Features for Review Quality Assessment
cs.AI updates on arXiv.org
2025-10-10T04:08:14.000000Z
Learning to Generate Unit Test via Adversarial Reinforcement Learning
cs.AI updates on arXiv.org
2025-10-01T06:03:00.000000Z
A Data-Centric Perspective on the Influence of Image Data Quality in Machine Learning Models
cs.AI updates on arXiv.org
2025-09-30T04:06:50.000000Z
Q-Mirror: Unlocking the Multi-Modal Potential of Scientific Text-Only QA Pairs
cs.AI updates on arXiv.org
2025-09-30T04:06:33.000000Z
MetaGPT 用户智能体发布,开启端到端自主软件测试新范式!
特工宇宙
2025-09-25T10:02:32.000000Z
Evaluating LLM-Generated Versus Human-Authored Responses in Role-Play Dialogues
cs.AI updates on arXiv.org
2025-09-23T06:08:31.000000Z
DecMetrics: Structured Claim Decomposition Scoring for Factually Consistent LLM Outputs
cs.AI updates on arXiv.org
2025-09-08T04:51:39.000000Z
上海交大发布 AI 生成 3D 人脸质量评估数据集 Gen3DHF
oschina.net
2025-08-15T07:31:51.000000Z
From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms
cs.AI updates on arXiv.org
2025-08-15T04:18:12.000000Z
QAMRO: Quality-aware Adaptive Margin Ranking Optimization for Human-aligned Assessment of Audio Generation Systems
cs.AI updates on arXiv.org
2025-08-13T04:14:59.000000Z