Ask HN: Tools for exploratory analysis of 10-100GB graphs I've never had to work with any dataset bigger than L3 cache, so I'm somewhat out of my depth here.
I have a sample of the (relational) data that's about 10GB, with another 80GB available that may or may not be mostly garbage. In the end I would like to have the graph in a visual interface to zoom and pan through it, and a way to experiment with different clustering algorithms based on some proximity measure (I have an idea for what those might look like). I'm not a data scientist so I have no overview of the tooling landscape here and find it difficult to filter through endless pages of marketing for vaguely ML/Big Data related products. I'm not looking for an expensive ready-made solution, I do like to hack on things after all :) |