As discussed on [this thread](https://twitter.com/natematias/status/1495793971523641346) and expanded on in the [DIFF paper](http://www.bailis.org/papers/diff-vldb2019.pdf) * Support for the other metrics  * Introduce p-values and multiple hypothesis testing 