LLM.int8(): 8-Bit Matrix Multiplication for Transformers at Scale(arxiv.org)7 points by ofirpress 3 years ago | 1 comment