In the first part of this piece I pointed out why it can be difficult to validate TAR using control set metrics. When the overall proportion of responsive documents is very low, it becomes ...
We revisit the problem of determining the sample size for a Gaussian process emulator and provide a data analytic tool for exact sample size calculations that goes beyond the n = 10d rule of thumb and ...
Back in the day, we learned in statistics that you need a sample size of at least 2% of the size of population to make statistically significant conclusions about the behavior of the population. In ...