Universal Transformers Need Memory: Depth-State Trade-Offs in Adaptive Recursive | Dark Hacker News