WINA: Weight informed Neuron activation for accelerating LLM inference(arxiv.org)2 points by Ratelman 1 year ago | 0 commentsNo comments yet