热点
"量化" 相关文章
Integer-only Quantized Transformers for Embedded FPGA-based Time-series Forecasting in AIoT
cs.AI updates on arXiv.org 2025-11-05T05:31:36.000000Z
[推广] 低费率证券开户继续,送 v2 特产继续,感恩抽奖送千元!
V2EX 2025-11-03T02:00:11.000000Z
STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization
cs.AI updates on arXiv.org 2025-10-31T04:09:36.000000Z
[分享创造] 花了一小时写了个 v2 成分分析器,涵盖消费能力、情绪、贡献、影响力、真实度等方面
V2EX 2025-10-31T02:27:38.000000Z
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
cs.AI updates on arXiv.org 2025-10-30T04:20:25.000000Z
临门一脚
火鸟台风 2025-10-27T14:46:17.000000Z
只能说幻方不愧是做量化的,Deepseek 炒币这么猛啊 这个 N of 1 的项目整了个大活 6 个顶级 AI 模型,每个给 1 万美元本金,在真实的加密货币市场自主交易,看谁...
AI探索站 - 即刻圈子 2025-10-21T17:55:08.000000Z
微软BitDistill将LLM压缩到1.58比特:10倍内存节省、2.65倍CPU推理加速
机器之心 2025-10-20T13:33:30.000000Z
微软BitDistill将LLM压缩到1.58比特:10倍内存节省、2.65倍CPU推理加速
机器之心 2025-10-20T13:33:30.000000Z
Microsoft AI Proposes BitNet Distillation (BitDistill): A Lightweight Pipeline that Delivers up to 10x Memory Savings and about 2.65x CPU Speedup
MarkTechPost@AI 2025-10-19T05:39:34.000000Z
QeRL: NVFP4-Quantized Reinforcement Learning (RL) Brings 32B LLM Training to a Single H100—While Improving Exploration
MarkTechPost@AI 2025-10-16T04:32:29.000000Z
Unlock Faster, Smarter Edge Models with 7x Gen AI Performance on NVIDIA Jetson AGX Thor
Nvidia Developer 2025-10-15T18:40:15.000000Z
Unlock Faster, Smarter Edge Models with 7x Gen AI Performance on NVIDIA Jetson AGX Thor
Nvidia Developer 2025-10-15T18:40:15.000000Z
ParetoQ: Improving Scaling Laws in Extremely Low-bit LLM Quantization
cs.AI updates on arXiv.org 2025-10-15T05:13:12.000000Z
Conformal Sparsification for Bandwidth-Efficient Edge-Cloud Speculative Decoding
cs.AI updates on arXiv.org 2025-10-14T04:17:01.000000Z
Federated Fine-Tuning of Sparsely-Activated Large Language Models on Resource-Constrained Devices
cs.AI updates on arXiv.org 2025-10-13T04:15:24.000000Z
SQS: Bayesian DNN Compression through Sparse Quantized Sub-distributions
cs.AI updates on arXiv.org 2025-10-13T04:13:48.000000Z
[远程工作] [全职远程] [18k~22k] 风控研发工程师( PHP / Python /React)
V2EX 2025-10-13T02:39:37.000000Z
[远程工作] [全职远程] [18k~22k] 风控研发工程师( PHP / Python /React)
V2EX 2025-10-13T02:17:49.000000Z
[远程工作] [全职远程] [18k~22k] 风控研发工程师( PHP / Python /React)
V2EX 2025-10-13T02:17:49.000000Z