Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attention(substack.com)2 points by eigenBasis 6 days ago | 0 commentsNo comments yet