Accelerating LLM Inference on AMD GPUs with Low-Latency GEMMs(rocm.blogs.amd.com)2 points by matt_d 2 days ago | 0 commentsNo comments yet