热点
"理论分析" 相关文章
Why Federated Optimization Fails to Achieve Perfect Fitting? A Theoretical Perspective on Client-Side Optima
cs.AI updates on arXiv.org 2025-11-05T05:24:25.000000Z
On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
cs.AI updates on arXiv.org 2025-10-24T04:21:07.000000Z
Learning under Quantization for High-Dimensional Linear Regression
cs.AI updates on arXiv.org 2025-10-22T04:21:12.000000Z
Budget-constrained Active Learning to Effectively De-censor Survival Data
cs.AI updates on arXiv.org 2025-10-15T04:57:42.000000Z
Critical attention scaling in long-context transformers
cs.AI updates on arXiv.org 2025-10-08T04:12:20.000000Z
On the Limitations and Capabilities of Position Embeddings for Length Generalization
cs.AI updates on arXiv.org 2025-10-07T04:16:26.000000Z
Calibration Meets Reality: Making Machine Learning Predictions Trustworthy
cs.AI updates on arXiv.org 2025-09-30T04:05:10.000000Z
A Theoretical Analysis of Discrete Flow Matching Generative Models
cs.AI updates on arXiv.org 2025-09-29T04:16:40.000000Z
Adam的Update RMS为何总是0.2?噪声模拟到理论近似全讲透
PaperWeekly 2025-09-13T23:52:55.000000Z
求道之人,不问寒暑(九)
远东轶事 - 知乎专栏 2025-09-11T19:45:08.000000Z
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
cs.AI updates on arXiv.org 2025-08-12T04:39:25.000000Z
A Lower Bound for the Number of Linear Regions of Ternary ReLU Regression Neural Networks
cs.AI updates on arXiv.org 2025-07-23T04:03:18.000000Z
Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory
cs.AI updates on arXiv.org 2025-07-17T04:14:40.000000Z
What Makes Local Updates Effective: The Role of Data Heterogeneity and Smoothness
cs.AI updates on arXiv.org 2025-07-02T04:03:48.000000Z
再次颠覆学界想象,何恺明发表新作:扩散模型不一定需要噪声条件
机器学习初学者 2025-02-23T05:56:23.000000Z
文本-图像全局对比对齐与 Token-Patch 级别的局部对齐
Jina AI 2025-01-10T16:14:15.000000Z