Fast transformer inference with Metal Performance Shaders | Dark Hacker News