神经网络压缩新框架SQS实现高性能压缩

cs.AI updates on arXiv.org 10月13日 12:13

神经网络压缩新框架SQS实现高性能压缩

本文提出了一种基于贝叶斯变分学习的神经网络压缩框架SQS，通过同时进行剪枝和低比特量化，在保持性能的情况下实现更高的压缩率。

arXiv:2510.08999v1 Announce Type: cross Abstract: Compressing large-scale neural networks is essential for deploying models on resource-constrained devices. Most existing methods adopt weight pruning or low-bit quantization individually, often resulting in suboptimal compression rates to preserve acceptable performance drops. We introduce a unified framework for simultaneous pruning and low-bit quantization via Bayesian variational learning (SQS), which achieves higher compression rates than prior baselines while maintaining comparable performance. The key idea is to employ a spike-and-slab prior to inducing sparsity and model quantized weights using Gaussian Mixture Models (GMMs) to enable low-bit precision. In theory, we provide the consistent result of our proposed variational approach to a sparse and quantized deep neural network. Extensive experiments on compressing ResNet, BERT-base, Llama3, and Qwen2.5 models show that our method achieves higher compression rates than a line of existing methods with comparable performance drops.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

神经网络压缩贝叶斯变分学习剪枝低比特量化

相关文章

Rethinking Model Size: Train Large, Then Compress with Joseph Gonzalez - #378

Pruner-Zero: A Machine Learning Framework for Symbolic Pruning Metric Discovery for Large Language Models (LLMs)

The Next Big Trends in Large Language Model (LLM) Research

使用PEFT库进行ChatGLM3-6B模型的QLORA高效微调

基金经理业绩不好，确实可以批评，但以此来否定他的研究，甚至人身攻击，是有失偏颇的。在A股做主观投资，是一门艺术，而不是科学，有学识不一定能赚钱，反而在A...

交易难，难于上青天。早盘集合竞价，大众交通这种当红炸子鸡点的股，资金开始集合竞价加丹引诱量化，量化真就开盘突突了一大片无人驾驶的个股。看起来一片热热闹...

roots-4 - Track your digital dopamine, break your phone addiction

乡亲们，过分了哈！跌的时候天天骂转融通和量化，不停转融通和量化，坚决不入场。现在转融通暂停了，融券保证金提高，量化也在增本降速，这一系列措施中翻中就是...

Q-GaLore Released: A Memory-Efficient Training Approach for Pre-Training and Fine-Tuning Machine Learning Models

$上证指数(SH000001)$ $沪深300ETF(SH510300)$ 当大家以为会议开完，郭嘉不再护盘的时候，郭嘉队反而比前几天更大力度地护盘。从沪深300ETF的分时线来看，今天起...