DDTree: Block Diffusion Draft Trees for Speculative Decoding
liranringel.github.ioLocal AIApr 13, 2026

DDTree builds a draft tree from DFlash's block diffusion distributions, then verifies the whole tree in one target-model forward pass with tree attention. Lossless. Qwen3-30B-MoE HumanEval T=0: 8.22x over AR (+2.13x over DFlash). Uses existing DFlash drafters, no retraining. MIT.
19Apr 14, 2026, 3:54 PM