热点
关于我们
xx
xx
"
Grokking
" 相关文章
Finding Features in Neural Networks with the Empirical NTK
少点错误
2025-10-16T18:21:44.000000Z
$\mathbf{Li_2}$: A Framework on Dynamics of Feature Emergence and Delayed Generalization
cs.AI updates on arXiv.org
2025-09-29T04:14:28.000000Z
黑客滥用 X 的 AI 助手传播恶意链接
HackerNews
2025-09-04T02:08:20.000000Z
Tracing the Path to Grokking: Embeddings, Dropout, and Network Activation
cs.AI updates on arXiv.org
2025-07-17T04:14:25.000000Z
Muon Optimizer Significantly Accelerates Grokking in Transformers: Microsoft Researchers Explore Optimizer Influence on Delayed Generalization
MarkTechPost@AI
2025-04-23T06:10:36.000000Z
This AI Research from Ohio State University and CMU Discusses Implicit Reasoning in Transformers And Achieving Generalization Through Grokking
MarkTechPost@AI
2024-07-09T06:01:27.000000Z
Grokfast:通过放大慢梯度加速格罗克学习
buzz
2024-06-04T16:33:14.000000Z