XRL算法调试评估方法研究

cs.AI updates on arXiv.org 10月21日 12:11

XRL算法调试评估方法研究

本文提出了一种新的XRL算法调试评估方法，通过Atari的Ms. Pacman环境和四种XRL算法进行测试，发现只有一种算法在测试目标上达到了随机准确性以上，且用户在选择时普遍过于自信，且用户对解释的易识别和理解程度与其准确性无相关性。

arXiv:2510.16956v1 Announce Type: new Abstract: Debugging is a core application of explainable reinforcement learning (XRL) algorithms; however, limited comparative evaluations have been conducted to understand their relative performance. We propose a novel evaluation methodology to test whether users can identify an agent's goal from an explanation of its decision-making. Utilising the Atari's Ms. Pacman environment and four XRL algorithms, we find that only one achieved greater than random accuracy for the tested goals and that users were generally overconfident in their selections. Further, we find that users' self-reported ease of identification and understanding for every explanation did not correlate with their accuracy.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签