Clever ideas that failed (2010)(yacoset.com) |
Clever ideas that failed (2010)(yacoset.com) |
--- King James Programming (http://kingjamesprogramming.tumblr.com/post/136036727910/55-...)
Spark has a scheduler that takes data locality into account (although I don't have a ton of experience with it and have no idea how well it works on a fine-grain scale). That itself is probably inspired from Borg and MapReduce, both of which have some kind of data locality-based job scheduling logic.
This is brilliant.
Any time you are storing data, the acceptable number of items to handle are none, exactly one, or a number limited only by machine resources. Don't design a container to hold, say, up to 5 items.