On-Device LLM Inference Powered by X-Bit Quantization(github.com)15 points by dynamix 2 years ago | 0 commentsNo comments yet