Toward Inference-Optimal Mixture-of-Expert Large Language Models(arxiv.org)24 points by zhiQ 2 years ago | 0 commentsNo comments yet