Learning CUDA by optimizing softmax: A worklog – Maharshi's blog | Dark Hacker News