热点
"重加权" 相关文章
VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision
cs.AI updates on arXiv.org 2025-11-03T05:19:42.000000Z
Holdout-Loss-Based Data Selection for LLM Finetuning via In-Context Learning
cs.AI updates on arXiv.org 2025-10-17T04:18:22.000000Z
Rethinking Entropy Interventions in RLVR: An Entropy Change Perspective
cs.AI updates on arXiv.org 2025-10-14T04:17:54.000000Z