热点
"矩阵梯度" 相关文章
Isotropic Curvature Model for Understanding Deep Learning Optimization: Is Gradient Orthogonalization Optimal?
cs.AI updates on arXiv.org 2025-11-05T05:26:13.000000Z