热点
"偏好匹配" 相关文章
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
cs.AI updates on arXiv.org 2025-10-27T06:27:08.000000Z
Fine-tuning GPT-2 from human preferences
OpenAI blog 2025-09-06T09:45:28.000000Z