AI-Written CUDA Kernels Outperforms Nvidia's Best Matmul Library | Dark Hacker News