3T Token Open Corpus for Language Model Pretraining | Dark Hacker News