WINA: Weight informed Neuron activation for accelerating LLM inference(arxiv.org)2 points by Ratelman 350 days ago | 0 commentsNo comments yet