Ask HN: Identifying duplicate data from a large dataset? | Dark Hacker News