CockroachDB 2.0 Performance Makes Significant Strides

CockroachDB 2.0 Performance Makes Significant Strides(cockroachlabs.com)

384 points by awoods187 8 years ago | 174 comments

atombender 8 years ago |

Looks very promising! We've looked at Cockroach for a particular project, and we've been concerned that performance wasn't good enough.

Cockroach performance seems to scale linearly, but single-connection performance, especially for small transactions, seems rather dismal. Some casual stress testing against a 3-node cluster on Kubernetes showed that small transactions modifying a single row could take as much as 7-8 seconds, where Postgres would take just a few milliseconds.

The documentation recommends that you batch as many updates as possible, but obviously that doesn't work for low-latency applications like web frontends that need to be able to do small, fine-grained modifications.

thanatos_dem 8 years ago | |

7-8 seconds? Something definitely sounds misconfigured. I've been running a 1.1.x cluster for quite a while and I've never seen a single row transaction take that long. And even the slowest queries took at most ~500ms, and that was with:

  - Replication factor increased to 5x (rather than the 3x default)
  - 8 indexes on the table being modified which also needed to be updated
  - Nodes spread across North America, incurring higher RTT latency between nodes
  - Relatively high contention on the data triggering client-side retries
  - HDD's as the storage medium (RockDB is optimized for SSDs)

jetrink 8 years ago | |

7-8 seconds seems extremely long. Human beings performing the raft consensus algorithm using paper and pencil over Skype wouldn't be much slower than that. Are you sure everything was working correctly?

lacker 8 years ago | | |

I don't know about you, but it would take me a lot longer than 7 seconds to perform the raft consensus algorithm with paper and pencil.

tyingq 8 years ago | |

" small transactions modifying a single row could take as much as 7-8 seconds"

That's surprising. I wasn't expecting CockroachDB to be really fast, given the constraints they work within. But that sounds more like a bug or config error. Unless perhaps you mean a really high number of processes trying to update the same row at the same time? Like a global counter or something?

lobster_johnson 8 years ago | | |

Indeed, the stress test updates just one row, which mirrors certain write patterns in our application. I just started this testing, so we'll see what happens when I extend it to more than one row.

orangechairs 8 years ago | |

[cockroach employee here] Have you hit us up on Gitter or Stack Overflow to help debug and tune? We'd also love to learn more about how you're using K8s, what your setup looks like, surprises you're running into with it, etc.

lobster_johnson 8 years ago | | |

Thanks, I'll do that when I get back to testing.

welder 8 years ago | |

Did you use the 2.0 beta version or the latest stable release? They improved performance a lot in the 2.0 beta released this month.

lobster_johnson 8 years ago | | |

I used 1.1.6. Looking forward to testing 2.0.

jnordwick 8 years ago | |

> low-latency applications like web frontends

...

atombender 8 years ago | | |

We have a collaborative, Google Docs-like application that currently issues a write every time someone types into a text field. Now, clearly it's suboptimal and something that should be optimized to batch the updates, but on the other hand, with Postgres we've had zero incentive to make such an optimization, because it's able to handle thousands of writes per node in real time with no queuing happening on the client. I don't expect this from Cockroach, but I would definitely want low latency.

zzzcpan 8 years ago | | |

Poor UI choices these days do not provide feedback and reload, hence making low-latency necessary to even be tolerated by users.

segmondy 8 years ago |

I like what Cockroach is doing, I'm rooting for them to grow and survive. Unfortunately the only time I hear about it is when they post blogs. I never hear about it from other people.

AlimJaffer 8 years ago | |

raises hand we're using them extensively. They're our database of choice that we've paired with Nakama[1] which is an open-source, distributed server. Have nothing but great things to say about the database itself in terms of growing performance and the team behind it :). They've been great to us since day-1.

[1] github.com/heroiclabs/nakama

netghost 8 years ago | | |

What kind of workload are you using it for? What's been your biggest win while using it?

wilbeibi 8 years ago |

The thing I really don't get is why CockroachDB is avoid benchmarking with it's rival tidb (https://github.com/cockroachdb/docs/issues/1412). tidb already pretty mature, used in many big companies (Let's say, Didi, which on the similar scale data with Uber, and banks).

Even if I like CockroachDB's pg sql more, it would be helpful to have the comparison/benchmark to show something more.

etaioinshrdlu 8 years ago |

Project idea: globally hosted / managed CockroachDB that lets developers quickly start building small apps cheaply or free using this database.

This database has the potential to dethrone Spanner in a major way.

joris 8 years ago | |

That’s on their roadmap: https://www.cockroachlabs.com/docs/stable/frequently-asked-q...

SoulMan 8 years ago | | |

My Org/team is too conservative to use this is they have to hire ops and too froogle to use spanner.

thinkloop 8 years ago | |

I just started playing with spanner. The API is nice. Simple txs are a bit slow - seconds instead of milliseconds - ok for my use-case. I don't hear much about it though, few articles on HN, I was wondering how mature it was and how widely used it is. Is it common knowledge that it is used by a lot of orgs?

etaioinshrdlu 8 years ago | | |

De-throne in the sense of being the top-tier multi-master consistent transactional DB. I doubt too many people use Spanner outside of Google in reality. Google would likely eventually adopt cockroackdb if it was clearly better long term.

Gigablah 8 years ago | |

There’s arguably nothing to dethrone, Spanner is too cost prohibitive for small developers in the first place.

qaq 8 years ago |

How is this meaningful without detailed setup description? http://www.tpc.org/tpcc/results/tpcc_results.asp?print=false... Looking at this list of results one wonders what those results actually mean?

ElijahLynn 8 years ago | |

At the bottom of the article it says that information is coming:

"Note: We have not filed for official certification of our TPC-C results. However, we will post full reproduction steps in a forthcoming whitepaper."

qaq 8 years ago | | |

I guess will have to wait. The progress they made is obviously impressive but would really help if one could understand the overhead vs conventional RDBMS 5X might be OK 20X not so much.

pinars 8 years ago | |

I think you can still drive some insights. I clicked on the TPC-C results you shared and read their executive summaries.

The Oracle on SPARC cluster (at the top, 2010) performs 30.2M qualified tx/min vs the 16K tx/min in this blog post. The Oracle cluster also costs $30M, which is clearly higher than the Cockroach cluster's cost.

That said, the TPC-C benchmark is new to me. Happy to update this comment if I'm misreading the numbers.

(Edited to incorporate the reply below.)

arjunnarayan 8 years ago | | |

A short note that the total cost of that SPARC cluster was $30 million. You're not misreading those numbers, but it requires a little context.

We're focusing today on our improvements over CockroachDB 1.1, using a small-ish cluster. We'll be showing some more scalability with larger clusters in the coming weeks. If you've found CockroachDB performance slow in the past, you will be pleasantly surprised with this release!

Rafuino 8 years ago |

How much and what kind of memory and storage (SATA SSD, NVMe SSD, HDD?) is included in the 3 nodes used for testing? This benchmarking is really interesting but the next level is to understand the cost per tmpC measured. Memory especially and storage is a big component of cost these days.

arjunnarayan 8 years ago | |

Short answer: 3 n1-highcpu-16 GCE VMs with Local SSDs attached. We're working on a complete disclosure document, with comprehensive reproduction steps to replicate all our numbers. This document should be out in a couple of weeks. We want to walk you through, command by command, on how to reproduce these numbers, and verify the results for yourself.

Rafuino 8 years ago | | |

Thanks for the short answer. Would be good to know how many local SSDs are attached though for the 850 warehouse scenario. The TPC-C documentation says each warehouse maintains 100,000 items in their stock, but I can't surmise from that how much storage is required to hold 850 warehouses' worth of data. I'm impatient though so let me try to work through the #s myself. I'm using GCP's monthly reserved pricing in the US-Iowa region as a reference as of today's pricing.

A n1-highcpu-16 GCE VM costs $289.84/month. Local SSDs are added at 375GB per drive, and they cost $30/month at $0.08 per GB. I highly doubt you could fit the ~1250 warehouses (what got you the peak TPM-C) on 375GB local SSD, but I have to make assumptions here! So, now you're paying $319.84 per instance per month, or $949.52 for 3 of these instances.

At 16,150 TPC-C, you're paying roughly $0.06 per TPC-C, or, looking at it the other way, you're getting 16.83 TPC-C per dollar spent each month. Is that good? I don't know!

Now, the really interesting question is, is that TPC-C/$ on CRDB 2.0 actually better than TPC-C/$ on CRDB 1.1? The answer lies in how many local SSDs you have to provision to reach that peak throughput. Peak is at ~1300 warehouses on CRDB 2.0, and ~800 warehouses on CRDB 1.1.

Does anyone with more knowledge here know how much storage you need per warehouse in the TPC-C test?

baconomatic 8 years ago |

I'd love to hear from someone who has implemented this in production. Seems like really cool tech, but haven't had a chance to use it on a project yet.

welder 8 years ago | |

Using it in production currently with dual-write and dual-read to compare perf. I'll do a write-up showing how Cockroach performs to Citus and Cassandra for my use case.

qeternity 8 years ago | | |

We use Citus and Memsql (big data analytics use cases). How does Cockroach handle joins and other OLAP style queries?

didip 8 years ago | | |

Please post your write-up on HN! I would love to read CockroachDB performing in real world.

some_account 8 years ago | | |

Please do, that would be a very popular read I think.

baconomatic 8 years ago | | |

Please do!

smnscu 8 years ago | |

Works great, just a tad slow. Hopefully this improves things. Deploying with Kubernetes is pretty seamless as well.

evrydayhustling 8 years ago |

Great stuff. I appreciated being educated about TPC-C, and the whole spirit of not focusing on vanity benchmarks!

itsdrewmiller 8 years ago | |

Same here, but in educating myself more I found that TPC-C seems to be a somewhat obsolete metric compared to TPC-E (see https://stackoverflow.com/questions/9246939/what-is-the-diff...). Why use the old one here?

edit: Looking into it even further, I agree with the co-author's response here that TPC-C is still an appropriate metric. TPC-E is different and newer but still not as widely used.

arjunnarayan 8 years ago | | |

I don't think it's true to claim that TPC-C is obsolete and subsumed by TPC-E. They are both different OLTP benchmarks, with different characteristics. TPC-C is more write heavy, TPC-E is far more read heavy. It's true that TPC-E is newer, but doesn't deprecate TPC-C (the way TPC-A, for instance, is now deprecated).

We chose TPC-C because it's far more understood than TPC-E in 2018. We wanted to provide understandable benchmarks that can be put into context with other databases. Other databases report TPC-C numbers, so we choose to do so as well.

sheeeep86 8 years ago |

I dont like when companies are not transparent about the pricing of their product. If you have a price page, show the price, so that Í can decide if this is relevant for me or not ...

true_religion 8 years ago | |

It's not relevant to you.

Enterprise pricing generally basically scales with the size of your company/budget and how much trouble they think you'll be worth as a customer.

As a rule of thumb, it starts at just above 1000 USD per unit, and goes up from there.

Many contracts are bespoke orders especially when you're dealing with a small company, so you can't have transparency since there isn't a single product.

nhumrich 8 years ago | |

I would usually agree with you, but cockroach is so new, I doubt they have any type of fixed price. They probably work it out on a 1-by-1 basis.

ahmedalsudani 8 years ago | |

The only thing the Enterprise offering gives you is priority access.

mjibson 8 years ago | | |

Enterprise allows access to various features like distributed backup and restore.

skybrian 8 years ago |

I wonder how far apart those three nodes are and how much the latency between them matters?

d0ugie 8 years ago |

Hadn't heard of Cockroach but based on the article, this thread and the rest of their site it sounds at least worth installing on a few hobby nodes if only to get familiar with the behavior and configuration should a need arise - like Cassandra was years ago when I had already on my own learned the gist of it, sort of a road not taken relative to my then-firm's usual prescriptions (MySQL and Mongo), it turned out to be perfect for my team's needs (paperwork to get permission to use it notwithstanding). Thanks for posting and good luck!

hellofunk 8 years ago |

Nice pun there. Cockroaches do indeed have a habit of making significant strides.

Asdfbla 8 years ago |

Does someone have more information about how they implemented serializable in such a way that, as they claim, performance isn't negatively impacted? Seems pretty hard to achieve that.

elvinyung 8 years ago |

Since you only have 3 nodes, doesn't that mean every range is replicated to every node? Doesn't that make joins trivial (i.e. no different from non-distributed joins)?

d4l3k 8 years ago | |

Yeah, though from what I understand this benchmark is measuring both transactional read and write performance rather than just join performance.

Transactional writes are likely the slowest thing since they need to talk to all replicas.

elvinyung 8 years ago | | |

Actually hmm, do reads need to talk to all replicas in this case (serializable isolation)?

some_account 8 years ago |

Congratulations to the cockroach team for putting out an awesome product :)

Would be great to see how it compares against postgres in similar scenarios.

strict9 8 years ago |

Before clicking the comments link, I always know what to expect in HN comment section for a CDB post announcing their latest milestone or feature:

A lot of congrats and excitement, questions about who uses it in a production environment, very specific use-case questions, and of course the name.

Weird how predictable the response to one company/tech always is.

misterbowfinger 8 years ago | |

So. For me, personally, I don't care about the name. I generally care that it's great tech, and it clearly has a great team behind it.

However....

If I worked at CockroachDB, and I saw the negative feedback around the name, I'd take it to heart. At the end of the day, the name is marketing for the hard work of their engineers, and marketing for the engineers that want to use this DB (remember, they need to sell it to their managers who may not be technical).

This issue can show up in unexpected ways. For example, for cloud providers like Compose (IBM company), would they be comfortable with putting "CockroachDB" on the front page? They might if it's good enough, but it's at least a consideration (i.e. another meeting, another stakeholder to convince).

Or how about an enterprise company that's going through due diligence, and when their client asks them about their tech stack do they say "CockroachDB" or do they obfuscate the name by saying "It's a high-performance distributed database". That's a crucial moment to market CockroachDB, and it could get lost. As sad as it is, saying that you're using MySQL "because Oracle" is a point of leverage for some sales people.

Is the name worth it? Asking honestly.

latenightcoding 8 years ago | |

People complaining about the name and how they are never going to be able to use it in production because of how gross cockroaches are is definitely the most recurring point. I think it worked well for them, since everyone remembers the name, specially with all the distributed stores coming out lately.

ngsayjoe 8 years ago | | |

Sometimes i wonder the evolution of cockcroaches' grossness has something to do with its high survivability?

johnmarcus 8 years ago | |

came here to comment on the name.

beamatronic 8 years ago | | |

I came here to upvote comments about the name

pieterhg 8 years ago |

Great stuff but this name really doesn’t work. Make it a name with positive connotations.

as1mov 8 years ago | |

While we are changing names for petty reasons, let's rename Python to something else, since a sizeable part of the population has a phobia of snakes.

expliced 8 years ago | |

I think it's a great name once you get the.. uh.. pun behind it.

arbitrage 8 years ago | | |

What is the pun?

mmilano 8 years ago | |

I agree, the unfortunately clever/punny name will detract from potential consideration based on illogical human subconscious thinking.

api 8 years ago |

What drug was the person who drew that graphic on?

evrydayhustling 8 years ago | |

Heavy doses of Hieronymous Bosch? https://en.wikipedia.org/wiki/Hieronymus_Bosch

2474 8 years ago | |

Ask him.

https://www.dalbertbv.com/about/

GrayShade 8 years ago | |

Windows 7 had some similar wallpapers (Scroll down):

https://blogs.msdn.microsoft.com/e7/2009/05/02/a-little-bit-...

johnmarcus 8 years ago |

I will not use this product based on it's name alone, it give me jeepers. Petty? Damn straight it's petty. Doesn't make it less real though.