“Attention”, “Transformers”, in Neural Network “Large Language Models” | Dark Hacker News