Deriving the gradient for the backward pass of Layer Normalization | Dark Hacker News