DashAttention: Differentiable and Adaptable Sparse Hierarchical Attention | Dark Hacker News