Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)(furiosa.ai)9 points by olibaw 258 days ago | 0 commentsNo comments yet