LLM内部情感表征几何分析

cs.AI updates on arXiv.org 10月28日 12:12

LLM内部情感表征几何分析

本文研究大型语言模型（LLM）如何通过分析隐藏状态空间的几何结构来内部表征情感，发现了一种低维情感流形，并展示情感表征在多个层次上分布，与可解释维度对齐，这些结构在不同深度下保持稳定，且能够泛化到涵盖五种语言的八个现实世界情感数据集。

arXiv:2510.22042v1 Announce Type: cross Abstract: This work investigates how large language models (LLMs) internally represent emotion by analyzing the geometry of their hidden-state space. The paper identifies a low-dimensional emotional manifold and shows that emotional representations are directionally encoded, distributed across layers, and aligned with interpretable dimensions. These structures are stable across depth and generalize to eight real-world emotion datasets spanning five languages. Cross-domain alignment yields low error and strong linear probe performance, indicating a universal emotional subspace. Within this space, internal emotion perception can be steered while preserving semantics using a learned intervention module, with especially strong control for basic emotions across languages. These findings reveal a consistent and manipulable affective geometry in LLMs and offer insight into how they internalize and process emotion.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签