Universal Transformers Need Memory: Depth-State Trade-Offs in Adaptive Recursive(arxiv.org)1 points by che_shr_cat 66 days ago | 0 commentsNo comments yet