Addition is All You Need for Energy-efficient Language Models(huggingface.co) |
Addition is All You Need for Energy-efficient Language Models(huggingface.co) |
It seems to me that we've stumbled upon this method of GPU-heavy matrix-multiplications in deep neural nets, and have only scratched the surface of alternative methods that are actually optimized for current CPU architectures such as Tsetlin Machines, Hyperdimensional Vectors, etc.