Full Text Search in the Cloud(blurpr.com) |
Full Text Search in the Cloud(blurpr.com) |
Search is fundamentally hard to put into the cloud, because it requires so many IO operations. In addition, delivering really high quality results requires machine learning and linguistics.
We have a few tricks up our sleeves to handle these issues, and I'm excited not only to shed the beta-ish feel, but to roll out some truly exiting features :) There's a "Review my startup" post coming pretty soon.
Obviously (for those of you who know the Solr codebase), there are some pretty extreme hacks to get around the lack of file system access, but nothing that couldn't be cleaned up.
I was a little surprised about the lack of interest in it when I emailed the solr-dev list.
From memory, I think Compass had a unique problem with the 30 second limit because it would try and re-sync the non-Lucene data with the Lucene indexes (I can't remember what the trigger was for this).
I had quite a lot of issues with Compass-GAE - my impression was that it wasn't really production ready. However, I did notice that Google is using it for their ThoughSite example app, so maybe it has improved.