State-space models can learn in-context by gradient descent(arxiv.org)2 points by trextrex 1 year ago | 0 commentsNo comments yet