1-bit inference of 0.8M param GPT running inside 8192 bytes of sram | Dark Hacker News