1-bit inference of 0.8M param GPT running inside 8192 bytes of sram(twitter.com)3 points by montyanderson 81 days ago | 0 commentsNo comments yet