道德推理_Fishai

热点

"道德推理" 相关文章

AI人格分裂实锤，30万道送命题，撕开OpenAI、谷歌「遮羞布」

36kr-科技 2025-10-27T01:03:57.000000Z

MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes

cs.AI updates on arXiv.org 2025-10-21T04:24:01.000000Z

Deliberative Dynamics and Value Alignment in LLM Debates

cs.AI updates on arXiv.org 2025-10-14T04:08:22.000000Z

Deliberative Dynamics and Value Alignment in LLM Debates

cs.AI updates on arXiv.org 2025-10-14T04:08:22.000000Z

One Model, Many Morals: Uncovering Cross-Linguistic Misalignments in Computational Moral Reasoning

cs.AI updates on arXiv.org 2025-09-29T04:13:47.000000Z

Ethics-Based Refusals Without Ethics-Based Refusal Training

少点错误 2025-09-23T17:15:27.000000Z

MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables

cs.AI updates on arXiv.org 2025-09-17T05:04:42.000000Z

One more reason for AI capable of independent moral reasoning: alignment itself and cause prioritisation

少点错误 2025-08-22T15:55:53.000000Z

Emergent morality in AI weakens the Orthogonality Thesis

少点错误 2025-08-21T18:12:59.000000Z

Beyond Ethical Alignment: Evaluating LLMs as Artificial Moral Assistants

cs.AI updates on arXiv.org 2025-08-19T04:01:35.000000Z

Normative Moral Pluralism for AI: A Framework for Deliberation in Complex Moral Contexts

cs.AI updates on arXiv.org 2025-08-13T04:15:24.000000Z

"Pull or Not to Pull?'': Investigating Moral Biases in Leading Large Language Models Across Ethical Dilemmas

cs.AI updates on arXiv.org 2025-08-12T04:39:42.000000Z

"Just a strange pic": Evaluating 'safety' in GenAI Image safety annotation tasks from diverse annotators' perspectives

cs.AI updates on arXiv.org 2025-07-23T04:03:16.000000Z

MoralBench: Moral Evaluation of LLMs

cs.AI updates on arXiv.org 2025-07-08T05:53:45.000000Z

Token and Taboo

少点错误 2025-04-24T22:22:37.000000Z

Token and Taboo

少点错误 2025-04-24T20:22:24.000000Z

GPT-4o竟是「道德专家」？解答50道难题，比纽约大学教授更受欢迎

36kr 2024-07-05T07:18:55.000000Z

研究发现 OpenAI 的 GPT-4o 道德推理能力胜过人类专家

动点科技 2024-06-24T06:01:46.000000Z

Copyright © 2019 FISHAI.All Rights Reserved