热点
"内部表示" 相关文章
Do Prompts Reshape Representations? An Empirical Study of Prompting Effects on Embeddings
cs.AI updates on arXiv.org 2025-10-23T04:22:36.000000Z
Do Prompts Reshape Representations? An Empirical Study of Prompting Effects on Embeddings
cs.AI updates on arXiv.org 2025-10-23T04:22:36.000000Z
Localist LLMs with Recruitment Learning
cs.AI updates on arXiv.org 2025-10-21T04:28:15.000000Z
Localist LLMs -- A Mathematical Framework for Dynamic Locality Control
cs.AI updates on arXiv.org 2025-10-13T04:10:23.000000Z
Localist LLMs -- A Mathematical Framework for Dynamic Locality Control
cs.AI updates on arXiv.org 2025-10-13T04:10:23.000000Z
Probing the Difficulty Perception Mechanism of Large Language Models
cs.AI updates on arXiv.org 2025-10-08T04:14:46.000000Z
The View From Space: Navigating Instrumentation Differences with EOFMs
cs.AI updates on arXiv.org 2025-10-07T04:14:16.000000Z
Towards Atoms of Large Language Models
cs.AI updates on arXiv.org 2025-09-26T04:22:04.000000Z
Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models
cs.AI updates on arXiv.org 2025-09-23T06:08:22.000000Z
Hallucination Detection with the Internal Layers of LLMs
cs.AI updates on arXiv.org 2025-09-19T04:29:19.000000Z
Unsupervised Hallucination Detection by Inspecting Reasoning Processes
cs.AI updates on arXiv.org 2025-09-15T08:29:55.000000Z
RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
cs.AI updates on arXiv.org 2025-08-19T04:21:05.000000Z
Cross-Model Semantics in Representation Learning
cs.AI updates on arXiv.org 2025-08-06T04:02:14.000000Z
Decomposing Representation Space into Interpretable Subspaces with Unsupervised Learning
cs.AI updates on arXiv.org 2025-08-05T11:28:53.000000Z
Agent-centric learning: from external reward maximization to internal knowledge curation
cs.AI updates on arXiv.org 2025-07-31T04:48:02.000000Z
Linearly Decoding Refused Knowledge in Aligned Language Models
cs.AI updates on arXiv.org 2025-07-02T04:03:49.000000Z
Latent Functional Maps: A Robust Machine Learning Framework for Analyzing Neural Network Representations
MarkTechPost@AI 2024-12-10T18:19:12.000000Z