Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)(furiosa.ai)9 points by olibaw 213 days ago | 0 commentsNo comments yet