FinePDFs: 3T token dataset made from internet PDFs | Dark Hacker News