I would have assumed the way to evaluate this method would be to back-test it: figure out when in the past this would have predicted that there would be an earthquake, and measure the accuracy/precision/etc. of that prediction.
It often surprises me how often disciplines don't have take a predictive approach by convention: if you make predictions at least it's possible to be *wrong*.
[1] - https://archive.is/tuNnf