Bitnet.cpp: Efficient Inference for 1.58bit LLMs(arxiv.org)1 points by galeos 1 year ago | 0 commentsNo comments yet