Experiments with Bitnet 1.5 (Ngmi)(huggingface.co) |
Experiments with Bitnet 1.5 (Ngmi)(huggingface.co) |
As most hardware speedups over the years came from decreasing precision, I'm sure NVIDIA tries everything to make FP4 work (and then get rid of FP8 multiplies if it can get away with it).