Diffusion Beats Autoregressive in Data-Constrained Settings | Dark Hacker News