对齐技术_Fishai

热点

"对齐技术" 相关文章

醒醒，LLM根本没有性格！加州理工华人揭开AI人格幻觉真相

新智元 2025-09-21T09:29:51.000000Z

Improving LLM Safety and Helpfulness using SFT and DPO: A Study on OPT-350M

cs.AI updates on arXiv.org 2025-09-12T04:19:06.000000Z

A Comprehensive Evaluation framework of Alignment Techniques for LLMs

cs.AI updates on arXiv.org 2025-08-14T04:18:39.000000Z

Alignment and Safety in Large Language Models: Safety Mechanisms, Training Paradigms, and Emerging Challenges

cs.AI updates on arXiv.org 2025-07-29T04:21:31.000000Z

OpenAI GPT-4.5 系统卡

宝玉的分享 2025-02-27T22:39:19.000000Z

Alignment can be the ‘clean energy’ of AI

少点错误 2025-02-22T00:19:17.000000Z

Is weak-to-strong generalization an alignment technique?

少点错误 2025-01-31T07:17:53.000000Z

综合RLHF、DPO、KTO优势，统一对齐框架UNA来了

机器之心 2024-10-10T06:11:59.000000Z

Copyright © 2019 FISHAI.All Rights Reserved