Long-range transformers in NLP: existing approaches, assumptions and trade-offs | Dark Hacker News