Proof of the Neyman-Pearson Lemma
📂Mathematical StatisticsProof of the Neyman-Pearson Lemma
Theorem
Hypothesis Testing:
H0:H1:θ=θ0θ=θ1
In the hypothesis testing above, let θ0,θ1 have a probability density function or probability mass function denoted by f(x∣θ0),f(x∣θ1), and let the rejection region be R, and some constant k≥0, then if
- (i): f(x∣θ1)>kf(x∣θ0), then x∈R
- (ii): f(x∣θ1)<kf(x∣θ0), then x∈Rc
- (iii): α=Pθ0(X∈R)
the following two propositions are equivalent:
Explanation
The parameter space of the given hypothesis test is Θ={θ0,θ1}, and note that the alternative hypothesis is θ∈Θ0c⟺θ=θ1.
Power Function:
- The function β(θ):=Pθ(X∈R) for the parameter θ with the rejection region being R is called the Power Function.
- supθ∈Θ0β(θ)=α then the given hypothesis test is called a size α hypothesis test.
- supθ∈Θ0β(θ)≤α then the given hypothesis test is called a level α hypothesis test.
That all most powerful tests at level α are exactly most powerful tests at the size α means that they satisfy condition (iii). Those hypothesis tests that are the size α
Pθ(X∈R)=Pθ0(X∈R)=α
and Θ0 is a singleton set, so they are also hypothesis tests at the level α.
Proof
Strategy: Let’s prove only for the case of a probability density function, i.e., the continuous case. For discrete random variables, simply change ∫ to ∑. To simplify the proof, let’s define a Test Function ϕ using the indicator function as follows.
ϕ(x):=χR(x)={10,if x∈R,if x∈/R
Thus,
- ϕ is the test function satisfying conditions (i)-(iii)
- β is the power function for ϕ
- ϕ′ is another test function for any other level α
- β′ is the power function for ϕ′
(⟸)
In conditions (i), (ii),
- (i): f(x∣θ1)>kf(x∣θ0) then x∈R⟹ϕ(x)=1
- (ii): f(x∣θ1)<kf(x∣θ0) then x∈Rc⟹ϕ(x)=0
Meanwhile, since 0≤ϕ’(x)≤1,
- (A): x∈R⟹ϕ(x)−ϕ’(x)≥0
- (B): x∈/R⟹ϕ(x)−ϕ’(x)≤0
Therefore, whether (i), (A) or (ii), (B), multiply ϕ(x)−ϕ’(x) by f(x∣θ1)−kf(x∣θ0) to obtain the following inequality.
[ϕ(x)−ϕ’(x)][f(x∣θ1)−kf(x∣θ0)]≥0
Integrating over the entire sample space ∫Ω⋅dx yields
0≤===∫Ω[ϕ(x)−ϕ’(x)][f(x∣θ1)−kf(x∣θ0)]dx∫Ωϕ(x)f(x∣θ1)−ϕ’(x)f(x∣θ1)−ϕ(x)kf(x∣θ0)+ϕ’(x)kf(x∣θ0)dx∫Rf(x∣θ1)−f(x∣θ1)−kf(x∣θ0)+kf(x∣θ0)dxβ(θ1)−β’(θ1)−kβ(θ0)+kβ’(θ0)
By definition, ϕ′ was regarding the test at level α≥β’(θ), and ϕ=supβ(θ) was the test at size α, therefore
β(θ0)−β’(θ0)=α−β’(θ0)≥0
and since k≥0,
0≤β(θ1)−β’(θ1)−[kβ(θ0)−β’(θ0)]≤β(θ1)−β’(θ1)
In summary,
- β’(θ1)≤β(θ1), and
- ϕ′ was any level α, and
- θ1 is the only element of Θ0c,
it has been shown that the hypothesis test satisfying conditions (i)-(iii) is the most powerful test.
Most Powerful Test:
Let C be the set of hypothesis tests as mentioned above.
In C, a hypothesis test A with the power function β(θ), for all θ∈Θ0c and against all power functions β′(θ) of all hypothesis tests in C,
β′(θ)≤β(θ)
is called the (Uniformly) Most Powerful Test, UMP.
(⟹)
Now, let’s assume ϕ′ is the test function for the level α most powerful test.
Since ϕ satisfies conditions (i)-(iii), the corresponding hypothesis test is also the most powerful test, and the value of the power function is the same for all θ∈Θ0c. That is β(θ1)=β’(θ1) and,
0≤=β(θ1)−β’(θ1)−k[β(θ0)−β’(θ0)]0−k[β(θ0)−β’(θ0)]
- By organizing the inequality obtained above, we get β(θ0)≤β’(θ0), and since ϕ was the size α=supθ∈Θ0β(θ) hypothesis test, then α≤β’(θ0).
- From the premise, ϕ′ was the level α≥supθ∈Θ0β′(θ) hypothesis test, therefore β′(θ)≤α.
According to both inequalities, β′(θ)=α, and the hypothesis test of ϕ′ is exactly a size α hypothesis test. However, this inequality seems to hold only outside the set A⊂Ω given ∫Af(x∣θ)dx=0, thus in the summary, A should be considered an exception.
■