Language Modeling by Estimating the Ratios of the Data Distribution | Dark Hacker News