热点
关于我们
xx
xx
"
低秩投影
" 相关文章
NeurIPS 2025 | 仅用20B tokens蒸出SOTA,小模型的「低秩时刻」到了
PaperWeekly
2025-10-21T05:27:14.000000Z
NeurIPS 2025 | 仅用20B tokens蒸出SOTA,小模型的「低秩时刻」到了
PaperWeekly
2025-10-20T16:35:42.000000Z
Q-GaLore Released: A Memory-Efficient Training Approach for Pre-Training and Fine-Tuning Machine Learning Models
MarkTechPost@AI
2024-07-14T04:01:15.000000Z