热点
关于我们
xx
xx
"
道德推理
" 相关文章
AI人格分裂实锤,30万道送命题,撕开OpenAI、谷歌「遮羞布」
36kr-科技
2025-10-27T01:03:57.000000Z
MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes
cs.AI updates on arXiv.org
2025-10-21T04:24:01.000000Z
Deliberative Dynamics and Value Alignment in LLM Debates
cs.AI updates on arXiv.org
2025-10-14T04:08:22.000000Z
Deliberative Dynamics and Value Alignment in LLM Debates
cs.AI updates on arXiv.org
2025-10-14T04:08:22.000000Z
One Model, Many Morals: Uncovering Cross-Linguistic Misalignments in Computational Moral Reasoning
cs.AI updates on arXiv.org
2025-09-29T04:13:47.000000Z
Ethics-Based Refusals Without Ethics-Based Refusal Training
少点错误
2025-09-23T17:15:27.000000Z
MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables
cs.AI updates on arXiv.org
2025-09-17T05:04:42.000000Z
One more reason for AI capable of independent moral reasoning: alignment itself and cause prioritisation
少点错误
2025-08-22T15:55:53.000000Z
Emergent morality in AI weakens the Orthogonality Thesis
少点错误
2025-08-21T18:12:59.000000Z
Beyond Ethical Alignment: Evaluating LLMs as Artificial Moral Assistants
cs.AI updates on arXiv.org
2025-08-19T04:01:35.000000Z
Normative Moral Pluralism for AI: A Framework for Deliberation in Complex Moral Contexts
cs.AI updates on arXiv.org
2025-08-13T04:15:24.000000Z
"Pull or Not to Pull?'': Investigating Moral Biases in Leading Large Language Models Across Ethical Dilemmas
cs.AI updates on arXiv.org
2025-08-12T04:39:42.000000Z
"Just a strange pic": Evaluating 'safety' in GenAI Image safety annotation tasks from diverse annotators' perspectives
cs.AI updates on arXiv.org
2025-07-23T04:03:16.000000Z
MoralBench: Moral Evaluation of LLMs
cs.AI updates on arXiv.org
2025-07-08T05:53:45.000000Z
Token and Taboo
少点错误
2025-04-24T22:22:37.000000Z
Token and Taboo
少点错误
2025-04-24T20:22:24.000000Z
GPT-4o竟是「道德专家」?解答50道难题,比纽约大学教授更受欢迎
36kr
2024-07-05T07:18:55.000000Z
研究发现 OpenAI 的 GPT-4o 道德推理能力胜过人类专家
动点科技
2024-06-24T06:01:46.000000Z