Are we in another bubble? Empirical data from CrunchBase(kevenlin.com) |
Are we in another bubble? Empirical data from CrunchBase(kevenlin.com) |
There are a number of problems with your analysis:
I suspect you are using founding year. Unfortunately there is a tremendous lag between when a company is founded and when it is entered into CrunchBase. The same is true of funding data in particular where we saw only around 20% of fundings within a quarter of happening and only about 70% a year out. That is due to CrunchBase's continued growth (it's much better known now) as well as a natural reporting lag.
Second, CrunchBase is a very new product and as it turns out data is only reliable as far back as 2007 and even that took a lot of work. Some time has been spent pushing to get more accurate data further back but it is scattershot at best.
(Possibly, can't tell) CrunchBase investments are stored in a number of currencies, did you make sure to recalculate them? Yen can really cause problems :)
Lastly, your NASDAQ chart is from 94 - 2005 which never overlaps with reliable CrunchBase data even by your own admission. I suspect that graph will be a bit more telling and worrisome potentially: http://www.google.com//finance?chdnp=1&chdd=1&chds=1....
I do not necessarily think we are in a bubble and I am happy to see people diving in on data I just wanted to point these things out as it would be irresponsible not to.
Presumably there are also some sort of editorial policies over what sort of startups merit inclusion in CrunchBase? The 2010 drop in startups covered by could easily be a reflection of reduced interest in covering smaller startups that don't effectively court TC and don't disclose relevant funding data.
But there is definitely a huge selection bias of contributors. People love to disclose when they invested in the hot startup and neglect to mention their big mistakes retroactively.
I suspect the biggest reason for the drop is just a smaller team and less commitment like the OP said. There has been a lot of headcount flux at TC for awhile and even more so since the AOL deal.
CrunchBase does not by any means offer stable and comprehensive picture over the years. It has been maintained with different levels of commitment, especially for entering in old data (since Crunchbase did not exist in 2000).
CrunchBase is a great resource, but doing those kinds of statistics without appropriate research of how consistent is the coverage is at least sloppy if not willful negligence.
That is the reason I look at investments from 2005 since most investments prior to that were not entered.
I don't think you did a good job of disclosing or the problems that might lie in the data.
Since I've been following crunchbase both from the data perspective and from the perspective of how much resources TechCrunch is devoting to it, I can assure you it varies wildly.
Also the editorial policy has changed a lot in that period, in about 2007/2008 they started putting much more emphasis on international start-ups. So there are specific skews that you should be aware of and disclose them in the blog post.
So I think you did a nice job, but conclusions are not to be trusted at tall. Yes it might be the most well-maintained open data out there, but it does not make it in any more useful for this kind of analysis.
Also the underlying assumption seems to be exponential increase is the only reason for a bubble.
"This is actually a great time to be a startup founder" - yep, so was 1999, no business plan needed, just a half baked idea and people will start throwing money at you. This is not necessarily a good thing.
At least two Form-D watching services have been mentioned previously on HN:
Though, I think they're relying on easier electronic access that may not go back more than a few years.
Pffttt.... where did this come from? Is it not that after two years of a lousy economy everybody's saving were tapped out?
We are just entering the last quarter of 1998.