Achieving 3X speedups on Google TPUs with diffusion-style speculative decoding | Dark Hacker News