Large Transformer Model Inference Optimization(lilianweng.github.io)3 points by axit 3 years ago | 0 commentsNo comments yet