Deriving the gradient for the backward pass of Layer Normalization(shreyansh26.github.io)3 points by shreyansh26 1 year ago | 0 comments