The Transformer as Renormalization Group Flow | Dark Hacker News