热点
"dLLMs" 相关文章
SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
cs.AI updates on arXiv.org 2025-10-13T04:14:41.000000Z
SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
cs.AI updates on arXiv.org 2025-10-13T04:14:41.000000Z
Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
cs.AI updates on arXiv.org 2025-10-08T04:08:33.000000Z
Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
cs.AI updates on arXiv.org 2025-10-08T04:08:33.000000Z
Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
cs.AI updates on arXiv.org 2025-10-07T04:18:08.000000Z
Principled and Tractable RL for Reasoning with Diffusion Language Models
cs.AI updates on arXiv.org 2025-10-07T04:16:12.000000Z
Principled and Tractable RL for Reasoning with Diffusion Language Models
cs.AI updates on arXiv.org 2025-10-07T04:16:11.000000Z
Principled and Tractable RL for Reasoning with Diffusion Language Models
cs.AI updates on arXiv.org 2025-10-07T04:16:11.000000Z
Principled and Tractable RL for Reasoning with Diffusion Language Models
cs.AI updates on arXiv.org 2025-10-07T04:16:11.000000Z
Quant-dLLM: Post-Training Extreme Low-Bit Quantization for Diffusion Large Language Models
cs.AI updates on arXiv.org 2025-10-07T04:13:14.000000Z
DMark: Order-Agnostic Watermarking for Diffusion Large Language Models
cs.AI updates on arXiv.org 2025-10-06T04:27:51.000000Z
DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
cs.AI updates on arXiv.org 2025-10-03T04:18:39.000000Z
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
cs.AI updates on arXiv.org 2025-10-02T04:17:31.000000Z
A2D: Any-Order, Any-Step Safety Alignment for Diffusion Language Models
cs.AI updates on arXiv.org 2025-09-30T04:04:19.000000Z
免训练加速61倍!陈怡然团队新作DPad:仅关注「彩票token」
智源社区 2025-09-29T03:57:09.000000Z
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
cs.AI updates on arXiv.org 2025-08-21T04:04:24.000000Z
DLLMQuant: Quantizing Diffusion-based Large Language Models
cs.AI updates on arXiv.org 2025-08-21T04:04:08.000000Z
Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position
cs.AI updates on arXiv.org 2025-08-19T04:21:23.000000Z
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models
cs.AI updates on arXiv.org 2025-08-13T04:15:03.000000Z
四款扩散大语言模型全部破防?上交&上海AI Lab发现致命安全缺陷
智源社区 2025-07-24T09:19:10.000000Z