InfiniSQL

InfiniSQL(infinisql.org)

49 points by ahassan 12 years ago | 60 comments

mtravis 12 years ago |

A few things (I'm the author of InfiniSQL)

1) I include keystore-like stored procedures in the source. They do get/set with integer key and string val. I haven't done thorough benchmarking, but I expect them to outperform the other benchmark I've published, which is quite a bit more complex workload

2) (camus2) agreed, nothing ever dies in IT. But roll back the clock a few years. How much noSQL would come into exisence if there was a free xzySQL that scaled across nodes, was fast, etc. I believe the answer is that there'd be very few network-based noSQL for operational workloads if that had been the case.

3) jwatte: Yeah! Jagged edges too!

4) stephen24: Also, I intend to change the license from AGPL to GPL next time I push out some code. No excuse not to try it out.

5) siliconc0w: There's an architectural write-up at High Scalability: http://highscalability.com/blog/2013/11/25/how-to-make-an-in... -- I believe that the actor model architecture is distinct in InfiniSQL.

6) diwu1989: Yes and no. Yes, MemSQL is more mature. No,

(a) I'm not sure how MemSQL scales horizontally (especially since that was a feature added after v1 of their code was released), and,

(b) MemSQL isn't free software

7) itsbits: for now InfiniSQL is mainly for hackers and early adopters--the dependencies are pretty clearly documented but it requires some effort to work with in its current state

jsmthrowaway 12 years ago | |

Please consider the Apache License or some other license instead of the GPL. There are many organizations that cannot use any flavor of GPL, including LGPL, for legal reasons. You can debate the wisdom of that amongst yourselves, but alas, that's how it is in some places.

(And I really want to try this...)

DannyBee 12 years ago | | |

"There are many organizations that cannot use any flavor of GPL, including LGPL, for legal reasons"

To be clear, there are no legal reasons I can think of that would ever prevent internal use of LGPL/GPL software.

You mean these companies (Apple, for example) have policies.

Policies like this often change because someone decides the cost vs risk tradeoff is worth it.

Changing a license because of bad policies of certain companies is not a great reason to change a license (in fact, it's, IMHO, an actively bad one).

You really should only change licenses if you find the license you chose does not suit the needs of your users (and policies are not really needs).

mtravis 12 years ago | | |

I assume these shops have Linux in their environments, including the GNU toolchain. There must be some contradiction somewhere that I'm not aware of.

Based on FSF feedback, I'm going to modify the license to include a Classpath-like exception. The intention is to allow people to write stored procedures that link against infinisql without triggering the copyleft. Only if the source to infinisql itself is modified (and distributed) will the copyleft apply.

I'm curious to know the rationale against the GPL in general (not just the AGPL), and how those shops allow Linux & gnu toolchains in spite of their rule against the GPL.

zobzu 12 years ago | | |

So many reasons to keep GPL. They can use GPL just fine, it's just that they don't wanna contribute if they modify it.

tintor 12 years ago | |

Regarding MemSQL: - we have just released v2.5 with full support for JSON and online ALTER TABLE across cluster - MemSQL performs great on both OLAP and OLTP - it scales well: we have several hundred node cluster in production at Zynga - license cost for startups is $1

mtravis 12 years ago | | |

Congratulations!

Do you have benchmark reports?

jacob019 12 years ago |

I'm supposed to use the perl api for user and schema management? Perl holds a special place in my heart, but I'm not too excited about managing my database with it. How about an interactive console?

I'm currently using MySQL, how similar is the SQL syntax?

mtravis 12 years ago | |

On backlog to fix. But InfiniSQL is for hackers and early adopters at this stage.

The SQL support is documented (http://www.infinisql.org/docs/index/)

coolsunglasses 12 years ago | | |

Hackers and early adopters are using Perl in 2013? Sure you aren't off by 12-15 years?

jacob019 12 years ago | | |

Awesome project and a killer concept. No one has been able to really solve relational database scalability yet. I'll have to study the implementation. I was just talking with some friends a few weeks ago about this problem and we concluded that if someone came up with a distributed relational database with decent scalable performance they would be very successful indeed. Will try it out and follow the progress. Hope it takes off.

arnorhs 12 years ago | | |

Did you mean to link to http://www.infinisql.org/docs/index ? I was getting an error on /docs/

camus2 12 years ago |

I believe the original subtitle is "Extreme Scale Transaction Processing" . "The NoSQL killer" is kind of childish, nothing is going to kill anything.

yeukhon 12 years ago | |

Same thought and it being at an early stage, ugh. And there goes at least a dozen of competitors out there trying to be different than MongoDB. I am just sort of happy that in the SQL world we usually either look at MySQL or PostgreSQL (well, Oracle and SQL servers are probably more relevant to corporate web service)... but I think people are trying to migrate too.

tracker1 12 years ago | | |

I think that even in a NoSQL driven domain, that a classic SQL based RDBMS has a place. It's that certain types of load have acceptable levels of relaxed constraints.. that can increase when your data is searched/read over 1000 times for every write. That joins are expensive and even mirroring data to a nosql store has benefits over purely rdbms.

I like document stores like MongoDB and RethinkDB and feel they are a great fit for most scenarios. I also feel that caching layers with Redis or Memcached can help...

Cassandra is interesting in the primary storage space as well, and imho has resolved a lot of issues, while others remain. I'm interested to see if this database can get there faster than Cassandra/CQL can get to more parity with traditional SQL systems.

While I appreciate the options, there is no one solution for everything... If you never break 100 simultaneous users, memory-mapped flat files and map/reduce could be sufficient.

ashah 12 years ago | |

sensationalism sells, probably why your "original" link was missed by poster

wimpycofounder 12 years ago |

So...uh...how does it work? Anyone know if there is an architecture overview somewhere? And why there isn't a link to it on the damn front page?

jfim 12 years ago | |

From their documentation:

> InfiniSQL currently is an in memory database. This means that all records are stored in system memory, and not written to disk. This provides very high performance--but it also means that InfiniSQL currently lacks the property of Durability. If the power goes out, all data is gone. This limitation is temporary.

They do mention that they'll implement persistence, but that's likely to lower performance, as you're limited to how fast the write ahead log can be written, even if updates to on-disk structures are batched.

They also mention:

> No sharding is necessary with InfiniSQL: it partitions data automatically across available hardware. Connect to any node, and all of the data is accessible.

I haven't looked at how joins are done across large tables that span over multiple nodes (or if it's even supported), but that's not likely to be fast either, for obvious reasons.

mtravis 12 years ago | | |

1) persistence: battery-backed UPS and synchronous replication. No WAL anywhere. I'm thinking about ways to do disk-based storage without synchronous IO, to provide decent performance with higher storage capacity

2) no joins supported yet. However, the benchmark that I performed (on the blog) involves 3 updates across random nodes. I designed InfiniSQL specifically to perform multi-node transactions very well, because that's the Achilles' heel of every other distributed OLTP system. I plan to implement joins, but expect them to perform decently for the workload you describe.

sb057 12 years ago | |

Front page > Documentation > Overview

It practically is on the front page.

jbellis 12 years ago |

Last week's discussion here: https://news.ycombinator.com/item?id=6795263

siliconc0w 12 years ago |

Can you compare InfiniSQL to existing in-memory clustered relational database solutions like Galera?

diwu1989 12 years ago |

I see this as fairly similar to memSQL, but less mature.

diger44 12 years ago |

I actually thought this was another joke at first...

stephen 12 years ago |

"Not just a teaser version". Nice!

glibgil 12 years ago |

It uses 2pc so it won't really scale.

mtravis 12 years ago | |

I think you mean 2PL.

It does really scale, check out the benchmark report on the blog. http://www.infinisql.org/blog/2013/1112/benchmarking-infinis...

For deadlock-prone workloads, it will likely not be as good, admittedly.

I'm considering a variation on MVCC that gets around the single transactionid bottleneck, but the currently implementation is based on 2PL. http://www.infinisql.org/docs/overview/#ftn.idp37098256

For concurrency management algorithms, there are no good ones. Only those that are less bad than others in some cases.

MichaelGG 12 years ago | | |

Have you given any more thought to ... not multithreading it? Since you're scaling across servers, apply the same concept across cores. Presto, no more bottleneck on atomically incrementing an ID.

itsbits 12 years ago |

so many dependencies to install...

jwatte 12 years ago |

Oooh! Shiny!