Self-attention Does Not Need O(n^2) Memory | Dark Hacker News