Accelerating LLM Inference on AMD GPUs with Low-Latency GEMMs | Dark Hacker News