Rust+OpenCL+AVX2 implementation of LLaMA inference code | Dark Hacker News