Show HN: Top business podcasts dataset (7/12/20-7/18/20) We took the top 100 business / entrepreneurship podcasts, transcribed them, and ran them through an NLP algorithm (AWS Comprehend for now, looking to build our own in the future). This dataset is for 7/12/20-7/18/20 (last week Sat - Sun) The synthesized data is in Airtable https://topicbase.co/#week29 It categorizes key phrases of the audio transcripts into 9 different categories (https://docs.aws.amazon.com/comprehend/latest/dg/how-entities.html) Each of the topics has the top 20 results, except in the few cases where we removed duplicates (It counted Instagram and instagram as two different topics for example). This avoids the really long tails of single keyword mentions. The "Top 100" comes from Chartable's top 100 podcasts for the business / entrepreneurship category based on the apple podcast data out of the U.S. If this is interesting/cool/valuable for HN I can post a new one here each week. What other categories do you think would be cool to do? I like the idea of financial news to add to trading algorithms but would be curious what comes to mind. |