热点
"混合精度量化" 相关文章
Mixed-Precision Quantization for Language Models: Techniques and Prospects
cs.AI updates on arXiv.org 2025-10-21T04:27:14.000000Z
MC#: Mixture Compressor for Mixture-of-Experts Large Models
cs.AI updates on arXiv.org 2025-10-14T04:19:31.000000Z
CHORD: Customizing Hybrid-precision On-device Model for Sequential Recommendation with Device-cloud Collaboration
cs.AI updates on arXiv.org 2025-10-06T04:28:04.000000Z
AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models
cs.AI updates on arXiv.org 2025-09-16T05:44:22.000000Z
MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
cs.AI updates on arXiv.org 2025-08-05T11:29:07.000000Z
扩散模型低位量化突破!有效扩散量化的极限推向2-4位,W2A4位宽下FID降低58%,超越SOTA方法
智源社区 2025-01-19T08:37:08.000000Z
扩散模型低位量化突破!有效扩散量化的极限推向2-4位,W2A4位宽下FID降低58%,超越SOTA方法
量子位 2025-01-19T07:39:33.000000Z