D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning | Dark Hacker News