热点
"道德对齐" 相关文章
EvalMORAAL: Interpretable Chain-of-Thought and LLM-as-Judge Evaluation for Moral Alignment in Large Language Models
cs.AI updates on arXiv.org 2025-10-08T04:14:45.000000Z
Alignment: "Do what I would have wanted you to do"
少点错误 2024-07-12T16:50:08.000000Z