Length-Induced Embedding Collapse in Transformer-Based Models(arxiv.org)3 points by Wheatman 1 year ago | 0 commentsNo comments yet