ELT Schedules Can Improve Root Cause Analysis for Data Engineers(montecarlodata.com) |
ELT Schedules Can Improve Root Cause Analysis for Data Engineers(montecarlodata.com) |
We put together this blog post, showing how an orchestration DAG (like a dbt schedule DAG) can be converted into a Bayesian network. You can then ask causal attribution questions in the form of conditional probability queries against the BN. The idea is still pretty basic / preliminary, but I think it could be extended in all sorts of interesting ways e.g. attributing bad row-level data to upstream transformations, etc.
Would be interested to hear what people think.