MiniMax teased M3 Sparse Attention: 9.7x prefilling, 15.6x decoding at 1M(twitter.com)3 points by rebekkamikkoa 3 hours ago | 0 commentsNo comments yet