热点
关于我们
xx
xx
"
训练机制
" 相关文章
(How) Do Language Models Track State?
cs.AI updates on arXiv.org
2025-11-03T05:20:18.000000Z
Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training
cs.AI updates on arXiv.org
2025-10-01T05:58:51.000000Z
Attention Sinks: A 'Catch, Tag, Release' Mechanism for Embeddings
cs.AI updates on arXiv.org
2025-09-23T06:11:49.000000Z
OpenAI的新论文,为什么被业内嘲讽是营销?
虎嗅
2025-09-12T11:50:08.000000Z
OpenAI 最新论文:语言模型为什么会出现幻觉?
oschina.net
2025-09-08T03:08:21.000000Z