热点
"dLLM" 相关文章
扩散语言模型也能强化学习?Meta田渊栋团队用“三明治梯度”打通RL闭环
PaperWeekly 2025-10-21T05:27:14.000000Z
扩散语言模型也能强化学习?Meta田渊栋团队用“三明治梯度”打通RL闭环
PaperWeekly 2025-10-20T16:35:38.000000Z
扩散语言模型也能强化学习?Meta田渊栋团队用“三明治梯度”打通RL闭环
PaperWeekly 2025-10-20T16:35:38.000000Z
推理速度10倍提升,蚂蚁集团开源业内首个高性能扩散语言模型推理框架dInfer
机器之心 2025-10-13T16:13:01.000000Z
推理速度10倍提升,蚂蚁集团开源业内首个高性能扩散语言模型推理框架dInfer
机器之心 2025-10-13T16:13:01.000000Z
dInfer: An Efficient Inference Framework for Diffusion Language Models
cs.AI updates on arXiv.org 2025-10-13T04:13:06.000000Z
dInfer: An Efficient Inference Framework for Diffusion Language Models
cs.AI updates on arXiv.org 2025-10-13T04:13:06.000000Z
CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits
cs.AI updates on arXiv.org 2025-10-08T04:15:03.000000Z
Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs
cs.AI updates on arXiv.org 2025-10-07T04:05:12.000000Z
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models
cs.AI updates on arXiv.org 2025-09-30T04:06:32.000000Z
免训练加速61倍!陈怡然团队新作DPad:仅关注「彩票token」
新智元 2025-09-27T09:31:03.000000Z
扩散大语言模型也能飞?DPad免训练加速61倍,全局规划照样稳
PaperWeekly 2025-09-21T15:33:02.000000Z
扩散大语言模型也能飞?DPad免训练加速61倍,全局规划照样稳
PaperWeekly 2025-09-20T03:50:04.000000Z
扩散语言模型有MoE版了!蚂蚁&人大从头训练LLaDA-MoE,将完全开源
机器之心 2025-09-13T23:52:39.000000Z
扩散语言模型有MoE版了!蚂蚁&人大从头训练LLaDA-MoE,将完全开源
机器之心 2025-09-13T07:18:07.000000Z
DPad: 扩散大语言模型的中庸之道,杜克大学陈怡然团队免训推理加速61倍
机器之心 - 知乎专栏 2025-09-11T19:55:26.000000Z
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
cs.AI updates on arXiv.org 2025-08-21T04:04:24.000000Z
dLLM的「Free Lunch」!浙大&蚂蚁利用中间结果显著提升扩散语言模型
机器之心 2025-08-20T07:29:33.000000Z
Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing
cs.AI updates on arXiv.org 2025-08-14T04:18:58.000000Z