热点
"层剪枝" 相关文章
The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models
cs.AI updates on arXiv.org 2025-10-29T04:22:15.000000Z
When Fewer Layers Break More Chains: Layer Pruning Harms Test-Time Scaling in LLMs
cs.AI updates on arXiv.org 2025-10-28T04:13:44.000000Z
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers, and Gradient Clipping
machinelearning apple 2025-09-29T16:32:53.000000Z
PruneCD: Contrasting Pruned Self Model to Improve Decoding Factuality
cs.AI updates on arXiv.org 2025-09-23T05:43:53.000000Z