Matmul() using PyTorch's MPS back end is faster than Apple's MLX | Dark Hacker News