Hypothesis Testing Through Bayesian Factors
Buildup
To be able to use classical hypothesis testing, one must have a mathematical understanding of concepts such as rejection region, p-value, and even a statistical sense intuitive enough to understand them. It is no surprise that many students, even at the freshman college level, spend hours being taught and still fail to properly understand hypothesis testing. It is similar to how many students learn statistics in high school, find problem-solving easy, yet do not grasp its true meaning.
Hypothesis Testing 1
On the other hand, Bayesian statistics allows for very easy hypothesis testing through something called the Bayes Factor.
Let’s assume null hypothesis and alternative hypothesis are given as vs .
- is called prior information for each null hypothesis and alternative hypothesis respectively.
- is called posterior information for each null hypothesis and alternative hypothesis respectively.
- is called the Bayes factor supporting .
Looking closely at the Bayes factor in , and can freely enter each position. Therefore, there is no need to memorize the formula in a complicated way, just remember that is at the very top and is at the very bottom.
In Bayesian analysis, hypothesis testing is simple: if is greater than , it supports the null hypothesis; if smaller, it supports the alternative hypothesis. In particular, thinking of it this way makes understanding much simpler. In simple terms, if the probability of the null hypothesis is higher when actually calculated with the data, it supports the null hypothesis. There’s no need to think about rejection regions or p-values.
If it said , it means the posterior information supports to the extent that it is times more than it supports .
Jeffrey’s Interpretation
Regarding the extent to which the null hypothesis is supported, Jeffrey proposed the following interpretation. From the perspective of supporting , the Bayes factor is interpreted as follows:
- : Weak evidence
- : Positive evidence
- : Strong evidence
- : Very strong evidence
The advantage of this interpretation is much more flexible compared to the extreme dichotomy of whether ’the p-value exceeds the significance level or not’ of frequentist hypothesis testing. If you frequently use regression analysis, you might have wanted to set the significance level to , but the p-value turned out to be , leading you to discard the regression coefficient. Honestly, as analysts are human, experiencing this can only be frustrating. Hence, one looks for solutions in every possible way, but most end up futile.
In contrast, Bayesian hypothesis testing simply accepts the data as is, whether sufficient or not.
Example
When , to conduct a Bayesian test against vs . The prior probabilities of and are the same, under and the observation is . Calculate the Bayes factor .
Solution
Therefore, is weak evidence supporting the null hypothesis.
김달호. (2013). R과 WinBUGS를 이용한 베이지안 통계학: p159~161. ↩︎