热点
"对齐框架" 相关文章
GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare
cs.AI updates on arXiv.org 2025-10-13T04:09:06.000000Z
The Anatomy of Alignment: Decomposing Preference Optimization by Steering Sparse Features
cs.AI updates on arXiv.org 2025-09-17T04:52:49.000000Z
Inversion-DPO: Precise and Efficient Post-Training for Diffusion Models
cs.AI updates on arXiv.org 2025-07-17T04:14:17.000000Z