热点
关于我们
xx
xx
"
多维度奖励模型
" 相关文章
Improving Video Generation with Human Feedback
cs.AI updates on arXiv.org
2025-10-28T04:14:38.000000Z
Cultivating Helpful, Personalized, and Creative AI Tutors: A Framework for Pedagogical Alignment using Reinforcement Learning
cs.AI updates on arXiv.org
2025-07-29T04:22:21.000000Z