Ask HN: anyone interested to build a hacker news with tags? Could anyone help me build a hacker news with tags?
I am asking only those who are interested to have it as well because I only have a budget for the hosting for this. The point is to be able to search through the whole archive using tags/keywords. example of tags: 'security' 'crm' 'a/b testing' 'optimization' 'http', 'ssl', 'domain name' 'scala', 'c++', 'php', etc 'lua' 'sql' 'marketing' 'website' 'landing page' => get all posts that relate to each tag (and combinations of tags) sorted by points of individual posts/comments. To do list: 1. import all hacker news database 2. insert in database all tags for all posts/comments, using an algorithm similar to the Kaggle Keyword Extraction algo (https://www.kaggle.com/c/facebook-recruiting-iii-keyword-extraction), which will need to be refined. 3. create great user interface to the new database ------- or if no-one has the time, could anyone advise me on how to download the whole hacker news database? |