UUIDs are popular, but bad for performance (2019)

256 points by timeoperator 4 years ago | 232 comments

web007 4 years ago |

This article talks about random IDs leading to page thrashing, and MySQL b-tree indexes not handling them well. They are bad for _MySQL performance_.

It doesn't talk about NoSQL or sharding, where random IDs usually perform much better than sequential due to a lack of hot shards.

If you distribute your reads and writes at random across N machines you can get ~Nx performance vs one machine. If you make your writes sequential, you'll usually get ~1x performance because every insert goes to the same machine for this second/minute/day, then rolls to a new one for the next period. There are sharding schemes that can counter this, but they require insight into your data design before implementation.

nightpool 4 years ago | |

Sure, but each individual machine still has to do the same slow random lookups, right? Generally you want some deterministic component (for caching) and some random component (for sharding). Snowflakes work well for this, since you can use the upper bits for predictable caching and the lower bits for random entropy.

hifriends 4 years ago | |

its a MySQL blog, why would it talk about NoSQL?

m0rphling 4 years ago | | |

I don't know if you've actually been to their blog recently, but Percona's community includes NoSQL as they maintain their own distro of MongoDB--similar in spirit to their MySQL and PostgreSQL offerings.

https://www.percona.com/blog/category/mongodb

https://www.percona.com/software/mongodb

tehlike 4 years ago | |

Even in MySQL - one can use Sequential UUIDs.

inetknght 4 years ago | | |

I never understood why people would use sequential UUIDs. That rather defeats the purpose of a UUID. If you need something sequential then just use a much more simple number

yeldarb 4 years ago |

Interestingly, for other systems you sometimes want the exact opposite: for your key space to be distributed across indexes to balance the load (vs wanting them all to hit the same “hot” index for MySQL).

For example, Google’s Cloud Firestore is bottlenecked to 500 writes/second if your key is monotonically increasing (like a timestamp or these timestamp-based UUIDs) causing you to “hotspot” the index: https://cloud.google.com/datastore/docs/best-practices#high_...

ummonk 4 years ago | |

Yeah S3 has similar performance issues where accessing objects with the same prefixes has lower throughput because they get sharded onto the same server. It's very counterintuitive when you're used to how performance works on single computers where you want to optimize for cache-locality.

sterwill 4 years ago | | |

This used to be the case, but it's not true any more.

For example, previously Amazon S3 performance guidelines recommended randomizing prefix naming with hashed characters to optimize performance for frequent data retrievals. You no longer have to randomize prefix naming for performance, and can use sequential date-based naming for your prefixes.

https://docs.aws.amazon.com/AmazonS3/latest/userguide/optimi...

nielsole 4 years ago | | |

Ugh really? Why would they not hash the whole filename for shard assignment?

mdasen 4 years ago | |

Is it that they truly want it distributed across the key space or distributed across the shards?

With MySQL, you're writing to a single box. Because of that, you'll want to be on hot pages. With Firestore, you're writing to many boxes. If you hotspot with Firestore, you're only writing to one box. If you write completely randomly to Firestore, maybe each box is bottlenecked to 400 wires/second (rather than 500) because there's no locality on each box. But if Google isn't charging you based on locality, there's no penalty for you since it'll scale up to more boxes (invisibly behind the scenes).

If you're only writing to one box, might as well take advantage of locality. If you have the opportunity to write to many boxes, you don't want all of the load going to one box.

However, one could imagine a distributed system where you could try and get both. Create an ID that has the ShardId to route it to the correct box and then something sequential after the ShardId. If you had 1,000 shards and 25 boxes, each box would be responsible for 40 shards so you wouldn't be perfectly sequential, but you'd kinda end up with 40 hot pages rather than randomly writing all over the place. It would also give you ample room to scale up your cluster.

So there is room to apply this technique to a distributed system as well.

Oh, one thing I'd note is that MySQL's InnoDB uses index-oriented (clustered) tables as they note in the article (which is why this is important with MySQL). PostgreSQL doesn't use index-oriented tables so new rows are (I believe) just written sequentially by default regardless of ID. It will have to update the primary-key index, but because the PK will be a lot smaller than the whole rows, you probably don't have to worry as much about randomly writing there. It's easier to keep an index hot than to keep a whole table hot.

kyrra 4 years ago | |

Your other option is to hash your monotonically increasing numbers. You don't want to use something like SHA, as it does not give good distribution. You use something like murmur hash (https://en.m.wikipedia.org/wiki/MurmurHash). I've used it before for indexes at Google for values that may hotspot. See Google's impl here: https://github.com/google/zetasketch/blob/master/java/com/go...

rossmohax 4 years ago | | |

> You don't want to use something like SHA, as it does not give good distribution

How come? Even distribution is one of requirements for crypto hash functions.

isoprophlex 4 years ago | | |

Can you elaborate on the claim about SHA? If I'm not mistaken, all these cryptographic hashes have a 50% chance of flipping any one output bit upon changing a single input bit -- is there some hidden gotcha I'm unaware of?

espadrine 4 years ago | |

That is generally true for most if not all NewSQL databases, as far as I know. Spanner, CockroachDB, TiDB, … Maybe even Aurora?

The second argument on secondary indexes’ size still holds water, if it matters for a given application.

qalmakka 4 years ago |

Isn't this easily solved by supporting 128 bit keys and using UUIDs as intended, i.e. as integers and not in their string serialization? This is as nonsensical as storing IPv4 as strings instead of 32 bit integers.

mhoad 4 years ago |

I recently read a book by Google’s head guy on API design that was specifically about designing APIs and it had a big section on what makes a good identifier and why people reach for UUIDs and why specifically it is a problem on multiple levels.

The thing that he ended up recommending however was super interesting in that I had never seen it mentioned before but it was basically to use this instead http://www.crockford.com/base32.html

drenei 4 years ago |

Bad for performance as primary keys.

But, still provide strong value as a unique identifier which is what makes them popular.

I’ve used integers as primary keys, with UUIDs as alternate keys for external-to-the-data-store queries.

roenxi 4 years ago |

> and purely random (version 3)

This is a typo, v3 isn't random. It is generated deterministically from inputs.

> The only “repeated” value is the version, “4”, at the beginning of the 3rd field. All the other 124 bits are random.

And this is close but not quite correct. UUID v4 has a couple of other fixed bits, there are only 121-122 random ones. There are patterns in the text representation other than constant numbers. :)

gkop 4 years ago | |

This blog post is over two years old, that 3 vs 4 error is inexcusable.

joshstrange 4 years ago |

I'm sure I take bigger penalty hits for less, sorry but UUID's "click" in my head and they prevent a whole slew of foot-guns. It would be one thing if auto-inc was the same as UUID but there are really annoying things with auto-inc (not knowing the id until after insert being top of mind) and also you can generate UUID's client-side/offline if needed. Yes, I know some people argue for auto-inc as primary and still use UUID for client-facing but I don't understand what that really gets you and it seems way more complicated.

postalrat 4 years ago | |

Typically, it gets your performance. And if performance is what you need it may be one of the simpler things you can do.

olliej 4 years ago |

I am very much not a database person, so forgive me if this is a dumb question.

I'm reading this article and it says that UUID are compared byte by byte, and seems to be indicating they're stored as string. Is that actually the case? I would have assumed that SQL supported 128 bit ints, but this seems to imply it does not.

Another question: if a column is set to char(fixed size) do the various sequel engines really not optimise to do multi word comparisons? (e.g. 8byte at a time, then 4, 2, 1, as size requires)

jrochkind1 4 years ago | |

Not a dumb question, I think you've hit on a key oddity here.

This article is about MySQL, apparently it's really the case in MySQL?

It's not the case in every rdbms universally. Postgres has a uuid type that stores them how you would (rightfully) expect.

I have no idea why MySQL does it this way, it does seem odd.

paulryanrogers 4 years ago | | |

MySQL doesn't have a UUID type, so naive implementations use (VAR)CHAR. Those in the know use BINARY(16) and MySQL 8 now has helper functions to convert to and from hex. Apparently MariaDB will soon have a native UUID type. PostgreSQL has had them from for years.

sudhirj 4 years ago |

I recently wrote about how encoding ULIDs in the UUID format could help with some of these problems

https://news.ycombinator.com/item?id=29794186

https://sudhir.io/uuids-ulids

sandman008 4 years ago | |

Hi Sudhir, Love your blog posts. Keep them coming.

gls2ro 4 years ago |

A lot of things that we do for security or privacy are bad for performance, but I think they are still good tradeoffs.

eerikkivistik 4 years ago | |

Agreed. Using UUID-s for keys is useful to exclude entire classes of security issues. Most notable are many kinds of enumeration attacks.

piaste 4 years ago | | |

Also entire classes of bugs. You screwed up a JOIN, or an application-level lookup for that matter? With UUIDs you'll get no results, with sequential ints you'll get a valid but wrong result.

Or worse, the right result for the wrong reason. I've actually seen a case where creating a new entity in the application populated X records in X child tables, each with a sequential ID, and as a result all of them had the same surrogate PK. They were 1:N relationships in principle, but the software wasn't feature complete yet so the actual records were all 1:1.

Years later one of those tables finally received some extra records, and it caused a really weird bug because a query had accidentally used the PK instead of the FK as a join key, but for years it had happily chugged along because the two columns were in sync.

datavirtue 4 years ago | | |

Quit exposing your keys.

tluyben2 4 years ago | |

Yep, so we need solutions for using these practices in a performant way rather than hearing they are not 'good' and then having to explain over and over again why they are there. Our datasets that use UUIDs have not had issues with performance but of course we keep looking for ways to keep using UUIDs while improving performance. It would be better to provide solutions on how to do that and spend time to get performance on par with int keys. Like someone else said; 32 bit int keys are no good anyway for many cases, so let's go to 128 bit, optimize for that and everyone is happy.

Edit: like https://news.ycombinator.com/item?id=29851653

manuelabeledo 4 years ago | |

And for data safety as well.

In an environment where millions of events are processed every second, being able to uniquely identify them is a must, and temporal keys are not always an option.

kgeist 4 years ago |

Just a month ago we migrated many of the columns from integers to UUIDs (encoded as binary(16)) in several critical tables which are pretty large (Percona server, too), and so far I haven't heard about any serious performance degradation after the release.

jcelerier 4 years ago | |

what's pretty large for you though ? There are fields where 100k entries is a pretty large dataset and others where "large" starts at petabyte

kgeist 4 years ago | | |

I didn't mention that our DB setup uses sharding, and every tenant has their own DB shard (there are tens of thousands of shards). I just checked that one of the largest tenants has 2.2 mln rows in one of the affected tables, which is usually joined with 2-4 more related tables using UUIDs (another such table is 1.1 mln rows, for example), and they're on the hot code path because it's the core of the system. Maybe with sharding the difference is negligible? During code review I raised the concern that inserts and joins can become much slower after we migrate it to UUIDs, but so far my fears haven't materialized. Usually with these tables we have performance problems on the application side, not in the DB, such as ORM fetching data in a very inefficient way, or using too much RAM. Maybe it'll bite us in the long term as the tenants' shards grow in size, who knows.

ebingdom 4 years ago |

> Let’s begin by the base64 notation. The cardinality of each byte is 64 so it takes 3 bytes in base64 to represent 2 bytes of actual value.

Wait, what? I thought it takes 4 base-64 digits to represent 3 bytes of data. Not 3 base-64 digits to represent 2 bytes of data.

brabel 4 years ago | |

base64 means the "vocabulary" used has 6 bits (2^6 = 64). Hence, to complete a full number of bytes (without using any padding), you need 4 b64-letters:

    4 * 6 = 24 = 8 * 3

So, you're correct... the exact amount of bytes that it takes to represent "actual" bytes goes like this:

    Actual bytes | b64 bytes required | overhead
    1            | 2                  | 2x
    2            | 3                  | 1.5x
    3            | 4                  | 1.33x
    4            | 6                  | 1.5x
    5            | 7                  | 1.4x
    6            | 8                  | 1.33x
    7            | 10                 | 1.43x
    8            | 11                 | 1.37x
    9            | 12                 | 1.33x

EDIT: As you can see, this averages with an overhead of between 33% (best case scenario where the encoding requires no padding, happens every 3 rows above) and something like 37%, decreasing with the number of bytes being encoded and approaching the minimum, 33% (e.g. to encode 1024 bytes, you need 1366 b64 digits, an overhead of 1.333984375x).

rawling 4 years ago | |

Maybe they're using a 9-bit byte? :)

immutology 4 years ago |

Microsoft SQL Server / Azure SQL support sequential UUIDs to solve the index distribution problem: https://docs.microsoft.com/en-us/sql/t-sql/functions/newsequ...

It's better than nothing, but one of the values of UUIDs for identifiers is that you can create new ones client-side while offline. These "sequential" UUIDs will fail standard UUID validation because of the byte swapping and, in my experience, when used offline-capable apps, will result in sparse clusters of sequential UUIDs that yield an unpredictable improvement over truly random UUIDs.

NicoJuicy 4 years ago |

I think the author is missing an overall picture, eg. Event driven scenario's.

Where you don't have to check collisions with a db. He mentioned generating the pk's on remote client, but that doesn't capture the interesting bits.

You generate the newly created object with the guid.

You send it to the API/Microservices and it's generated, fire-and-forget style. And the remote client has an Id of the newly created object to do something with.

VWWHFSfQ 4 years ago | |

But now the remote client has an ID of something that may or may not exist the next time they try to use it depending on whether or not it actually made its way into the database.

I've seen this kind of architecture before. It sounds nice but is loaded with consistency problems.

nijave 4 years ago | | |

There's patterns for implementing it. It's not really any different than using more than 1 database which is unavoidable in many scenarios (interacting with 3rd parties)

https://chrisrichardson.net/post/sagas/2019/08/04/developing...

NicoJuicy 4 years ago | | |

That's why the pattern is eventual consistency.

You receive a message "{entity}Created" and it contains the Id of the full object.

globular-toast 4 years ago | | |

Well, you would have some kind of synchronisation protocol where the database confirms it has received that record and it now exists.

kune 4 years ago |

People should have a look as k-sortable unique identifiers (KSUID). Binary they are represented by 20 bytes and their string representation has 27 characters, which is shorter than UUIDs since the use a base62 encoding. They are sortable since the 20 bytes start with a 32 bit UNIX timestamp followed by random 128 bits. They should be very efficient for clustered indexes / B+-Trees.

Note also that as long as you have a single central database you don't need UUIDs. They are only needed if you have several processes creating objects without coordination.

itsdrewmiller 4 years ago | |

Sounds like a ULID that doesn't fit in a UUID column - https://github.com/ulid/spec

hn_throwaway_99 4 years ago |

Note some of the newly proposed UUID formats would take care of some of these issues [1]. They are time-ordered but still have a good bit of random entropy.

1. https://news.ycombinator.com/item?id=28088213

abujazar 4 years ago |

The author seems to have forgotten the obvious alternative of using ints for database primary keys and uuids externally.

andix 4 years ago |

Performance is really bad, if you save them as strings and use those strings as keys. If you take them as 128 bit integers, it’s kind of fine.

Often 32 bit integers are anyway not long enough as a primary key, so you need at least 64 bit keys. UUID is just double the size then.

Zigurd 4 years ago |

Only if you are creating persistent objects in multiple places that may not be in communication with one another, and that at some point need to be distinguished from each other, do you need UUIDs.

Even in mobile apps, which seem like the most common use case for needing them: Unless your app creates these objects while not connected, you don't need UUIDs.

If your backend database is where objects get created, you don't need UUIDs.

Aardwolf 4 years ago |

> The missing 4 bits is the version number used as a prefix to the time-hi field.

Why would you use 4 bits for a version number in something that's supposed to be unique? What is the benefit of following this specification despite such cost, versus creating 128 unique bits based on time / random generators / machine IDs yourself?

Borealid 4 years ago | |

The ability to mix different types of ID in one column or one business data store.

When you start with random UUIDs, and then decide you actually wanted per-host-namespaced ones halfway through, if you have allocated zero bits for the version ID you're up the creek.

You're trading off present-day efficiency for future-day flexibility. Whether that's wise for a particular case depends on that case.

MauranKilom 4 years ago | |

Because 124 random bits are still way enough to make collisions extremely unlikely (needs on the order of 2^62 UUIDs even with birthday paradox - good luck storing them all), yet they are still recognizable as being of that specific format.

busymom0 4 years ago |

Slightly related question-

does anyone know what data type Reddit uses for their post/comment id? It seems like a short alphanumeric yet unique identifier and much shorter than a uuid.

Also what does twitter use for their post/comment id? Seems like some sort of big int?

Dachande663 4 years ago | |

Twitter created Snowflake many years ago to generate IDs. Unsure whether it’s still in use.

https://blog.twitter.com/engineering/en_us/a/2010/announcing...

erwincoumans 4 years ago | | |

Twitter uses strings instead of ints to represent the 64 bit UID, because Javascript only supports 53bit ints instead of 64bit... https://developer.twitter.com/en/docs/twitter-ids

erwincoumans 4 years ago | | |

That sounds nice and simple, and fits in 64 bits (vs 128):

"we settled on a composition of: timestamp, worker number and sequence number. Sequence numbers are per-thread and worker numbers are chosen at startup via zookeeper"

damagednoob 4 years ago |

> The remaining of the UUID value comes from the MD5 of a random value and the current time at a precision of 1us.

I might be misunderstanding something here but if your random seed is based on time, under high-concurrency, doesn't this risk collisions? I can't see any thread-safety guarantees in the documentation.[1]

[1] https://dev.mysql.com/doc/refman/8.0/en/mathematical-functio...

hashimotonomora 4 years ago | |

If the column is UNIQUE there’s no collisions it will just fail to INSERT.

damagednoob 4 years ago | | |

Then it doesn't have the same quality as a UUID which is supposedly guaranteed to be unique across space _and_ time[1].

[1] https://datatracker.ietf.org/doc/html/rfc4122

busymom0 4 years ago |

Is this specific to MySQL or does it apply to Postgres too?

pachico 4 years ago |

This topic has been raised more than once at my work and it scared a lot of people. It's important to understand your use case before you embrace any other solution.

This will affect you only when you are frequently creating and storing new IDs, which leads to reshaping btrees.

If you have IDs and its number is under control and doesn't change a lot you're just fine.

dorongrinstein 4 years ago |

That's the reason I've chosen postgres over mysql years ago and I don't use clustered indexes.

peheje 4 years ago | |

What does postgres do differently here to still have good performance?

jetzzz 4 years ago |

What's the purpose of using non-random data such as time and MAC address in UUID? It increases probability of collision compared to purely random bits and if you need time or MAC it seems better to store them as separate fields - easier to perform selects, groupings, etc.

kazinator 4 years ago |

> Even if you use pseudo-ordered UUID values stored using binary(16), it is still a very large data type which will inflate the size of the dataset

Every IPv6 datagram has a pair of source and destination UUIDs. :)

yalogin 4 years ago |

Why would anyone use a random value as a primary key? I haven’t done databases in a long time but isn’t it standard practice to use a sequential incrementing value for the primary key?

tapas73 4 years ago | |

reasons are: 1. no need for coordination in distributed system, no need to check what is the next available id. 2. if userid is visible to users, sequential userid gives away information about the amount of users and allows guessing other valid userids.

yalogin 4 years ago | | |

This is exactly over engineering and solving for the wrong problem. It can be solved easily d efficiently in other ways.

tehlike 4 years ago |

Sequential UUIDs fix this problem.

lepetitchef 4 years ago |

Fun improvement from my project a long time ago, uuid has 36 char and to save space, we created a shorten version by removing all the dash character in uuid. The result is 4 char could be removed, the field in MySQL table only needs 32 char.

isoprophlex 4 years ago | |

The point is, a uuid contains 128 bits of data, so a varchar(32) is still somewhat excessive

Too 4 years ago | | |

Why varchar over char? The length is always the same.

jimmaswell 4 years ago |

This seems to be just about using UUIDs as indexes in a DB, not using UUIDs in general as an ID for things which I'm not seeing any reason not to continue doing.

nightpool 4 years ago | |

Sure, as long as you never want to look things up or reference them by ID, then there's no reason to worry. Otherwise, yes, you'll have the exact same problems laid out in the article