A neural network layer is a key-value/attention memory | Dark Hacker News