热点
关于我们
xx
xx
"
后训练方法
" 相关文章
Human-in-the-loop Online Rejection Sampling for Robotic Manipulation
cs.AI updates on arXiv.org
2025-10-31T04:07:52.000000Z
推理效率狂飙60倍:DiDi-Instruct让扩散大模型16步超越千步GPT
机器之心
2025-10-27T07:15:35.000000Z
Reconstruction Alignment Improves Unified Multimodal Models
cs.AI updates on arXiv.org
2025-09-19T05:06:02.000000Z
A post-training approach to AI regulation with Model Specs
Interconnects
2024-10-22T06:07:43.000000Z
Post-Training有多重要?AI2研究员长文详解前沿模型的后训练秘籍
智源社区
2024-08-20T06:07:37.000000Z