CURRENT RESEARCH TOPICS

 

EM P-values. The standard definition of a P-value for testing composite hypotheses based on a statistic T is the tail probability maximised over the null. We prove what has previously been understood informally, namely that this M-step produces the smallest possible P-value subject to the ordering induced by T and test validity. Thus, partially maximised P-values as suggested by Berger and Boos (1994) are inadmissible. On the other hand, allowing for the worst case through maximisation will be more attractive when the test statistic has properties that are stable over the null hypothesis. We suggest replacement of the unknown parameter by an estimate under the null and call this the E-step. The resulting P-value is not valid until the M-step has been applied to produce the EM P-value. The E-step can be applied more than once to further stablise the properties of the test statistic. We illustrate the ideas on the Behrens-Fisher problem and reproduce the better known exact solutions. We also look at testing independence in a 2x2 table and show that the E-step is successful in stabilising performance and that EM P-values appear to be more powerful, and computationally less difficult, than other methods. A third draft of this paper is HERE and my Auckland talk is HERE.