Weight decay vs. L2 regularization | Dark Hacker News