Gemlite: Simple and fast low-bit matmul kernels in CUDA | Dark Hacker News