Time, Clocks, and the Ordering of Events in a Distributed System – Paper Review

Time, Clocks, and the Ordering of Events in a Distributed System – Paper Review(percisely.xyz)

85 points by ekez 2 years ago | 22 comments

Thanks for the link, really interesting!

We have a distributed system (clouds & cars) that message events which need to be processed in order. However, the clock of some system participants (cars) are drifting quite often. We plan to use a logical clock for the ordering instead soon.

AdieuToLogic 2 years ago | |

> We have a distributed system (clouds & cars) that message events which need to be processed in order.

Just out of curiosity, would a vector clock[0] be applicable for your problem domain?

0 - https://en.wikipedia.org/wiki/Vector_clock

Phelinofist 2 years ago | | |

AFAIU Vector clocks are hybrid clocks and we plan to use a hybrid one, should've mentioned that from the start.

michaelt 2 years ago | |

No way of accessing your cars' GPS receivers' 100-nanosecond-precise time signal, huh?

econner 2 years ago | | |

Yea I always liked the way Spanner solved this. Let's not bother with any of the distributed system theory and just use that Google money to build in to the system extremely precise clocks :-D.

(Obv I know it's more complicated than that, but where else can you be reductionist if not the internet)

willis936 2 years ago | | |

It probably won't be 100 ns accurate in motion, but still quite accurate. Practically speaking you won't get better than ~1 s unless you connect the PPS pin to an interrupt or PLL.

With a bare minimum implementation you should be able to get the GPS time from the NMEA strings. You just need to guarantee you'll have GPS lock at least intermittently, which is probably true for a fleet of monitored cars in nearly all situations.

Phelinofist 2 years ago | | |

That's what I asked them as well and apparently... no.

myk9001 2 years ago | |

You might also find this paper interesting. To be fair, so far I only had a chance to give it a very superficial look.

https://dl.acm.org/doi/pdf/10.1145/112600.112601

YetAnotherNick 2 years ago | |

Is it P2P connection or something? You could easily order events using kafka, redis or any alternative in cloud if they are less than million per second range.

myk9001 2 years ago | |

Disclaimer: I'm not a distributed systems expert by any stretch of the imagination. Just picked up an interest in the subject some time ago. So, the things I'm about to say may sound silly.

Do the cars participating in the system broadcast/multicast messages to each other? If so, logical clocks like Lamport clocks or vector clocks can be of great use. Logical clocks help capture the order of events in a distributed system (sending or receiving a message is one kind of event, but not the only).

To give an example, let's say we have cars A, B, and C, they broadcast messages with Lamport timestamps. B broadcasts (lts: 33, msg: y), A broadcasts (lts: 18, msg: x). No matter in what order C receives the two messages, it knows A could not have sent "x" in response to "y", as 18 < 33. However, the converse isn't true -- i.e., 18 < 33 doesn't imply B sent "y" in response to "x", the two messages could've been broadcasted concurrently.

If you do want to be able to tell if one event might've caused another based on their logical timestamps, a vector clock is a great choice.

All that said, if this isn't as much about cars broadcasting messages to each other, as it's about cars sending messages to the server, pure logical clocks are not a tool meant for this job. This scenario calls for real-time ordering -- i.e., an imaginary omnipotent observer who could assigns a timestamp based on their own watch to each message could solve this. In the unfortunate absence of such an observer, the approaches to deal with this I'm aware of are:

- A central timestamp server. While this comes with all the obvious downsides, it's also a relatively simple and straightforward solution.

- Keeping the drift as small as possible and establishing an upper bound on it. Then waiting out the uncertainty interval before acquiring a timestamp. Basically, something akin to what Spanner is doing with their atomic clocks. A caveat here is if the cars are competing for offers with these messages, they might not be exactly happy with this strategy.

- Similar to the previous point, but without atomic clocks? Maybe you can adapt some ideas from CockroachDB[^1]?

- Using hybrid logical/physical clocks might be an option[^2][^3]

---

[^1]: https://www.cockroachlabs.com/blog/living-without-atomic-clo...

[^2]: http://users.ece.utexas.edu/~garg/pdslab/david/hybrid-time-t...

[^3]: https://cse.buffalo.edu/tech-reports/2014-04.pdf

Phelinofist 2 years ago | | |

Thanks for your comment, very interesting and great links.

Broadcasts are always done via the Cloud, not P2P. We plan to use a hybrid clock, should have mentioned that.

andsoitis 2 years ago |

The paper: https://amturing.acm.org/p558-lamport.pdf

reality_inspctr 2 years ago |

this is great! very clever logic model.

I wrote something I think is highly relevant about a distributed L1 for time, that you might enjoy, OP?

[oc] https://www.seanmcdonald.xyz/p/the-clockchain-protocol-the-l...

jpgvm 2 years ago |

Possibly the most important distributed systems paper to read, along with the rest of Lamport naturally.

tincholio 2 years ago | |

This is the first Lamport paper I read (over 20 years ago now), and it has stuck with me since then, it's so clearly explained...

frenchLeaf 2 years ago |

What about ticsync?