热点
关于我们
xx
xx
"
对齐技术
" 相关文章
醒醒,LLM根本没有性格!加州理工华人揭开AI人格幻觉真相
新智元
2025-09-21T09:29:51.000000Z
Improving LLM Safety and Helpfulness using SFT and DPO: A Study on OPT-350M
cs.AI updates on arXiv.org
2025-09-12T04:19:06.000000Z
A Comprehensive Evaluation framework of Alignment Techniques for LLMs
cs.AI updates on arXiv.org
2025-08-14T04:18:39.000000Z
Alignment and Safety in Large Language Models: Safety Mechanisms, Training Paradigms, and Emerging Challenges
cs.AI updates on arXiv.org
2025-07-29T04:21:31.000000Z
OpenAI GPT-4.5 系统卡
宝玉的分享
2025-02-27T22:39:19.000000Z
Alignment can be the ‘clean energy’ of AI
少点错误
2025-02-22T00:19:17.000000Z
Is weak-to-strong generalization an alignment technique?
少点错误
2025-01-31T07:17:53.000000Z
综合RLHF、DPO、KTO优势,统一对齐框架UNA来了
机器之心
2024-10-10T06:11:59.000000Z