logo

Laplace's Succession Rule 📂Mathematical Statistics

Laplace's Succession Rule

Theorem 1

Let’s say the prior distribution of the binomial model p(yθ)=(ny)θy(1θ)ny\displaystyle p(y | \theta) = \binom{ n }{ y} \theta^{y} (1- \theta)^{n-y} follows a uniform distribution U(0,1)U (0,1) and the posterior distribution follows a beta distribution β(y+1,ny+1)\beta (y+1 , n-y+1), hence p(θy)θy(1θ)nyp( \theta | y ) \sim \theta^{y} (1- \theta)^{n-y}. Then, for the data obtained so far yy, the probability of observing a new y~\tilde{y} being 11 is p(y~=1y)=y+1n+2 p(\tilde{y} = 1| y) = {{y+1} \over {n+2}}

Explanation

From a frequentist’s perspective, the probability of y~=1\tilde{y} = 1 will be close to the sample rate yn\displaystyle {{y} \over {n}}. However, fundamentally as nn increases, since y+1n+2yn\displaystyle {{y+1} \over {n+2}} \simeq {{y} \over {n}}, both frequentists and Bayesians will eventually make similar estimates as the sample size increases. On the other hand, if we consider the case of having conducted no trials, that is n=0n=0, it matches well with the uniform distribution as the prior, which is p(y~=1y)=12\displaystyle p(\tilde{y} = 1| y) = {{1} \over {2}}. This effectively demonstrates, through formulas, that our inference started from θ=12\displaystyle \theta = {{1} \over {2}}.

Proof

p(y~=1y)=01p(y~=1θ,y)p(θy)dθ=01p(y~=1θ)p(θy)dθ=01θp(θy)dθ=E(θy) \begin{align*} p (\tilde{y} = 1| y) =& \int_{0}^{1} p(\tilde{y} = 1| \theta , y) p (\theta | y) d \theta \\ =& \int_{0}^{1} p(\tilde{y} = 1| \theta ) p (\theta | y) d \theta \\ =& \int_{0}^{1} \theta p (\theta | y) d \theta \\ =& E( \theta | y) \end{align*}

Since θp(θy)\theta p (\theta | y) follows the beta distribution β(y+1,ny+1)\beta (y+1 , n-y+1),

E(θy)=y+1(y+1)+(ny+1)=y+1n+2 E( \theta | y) = {{y+1} \over {(y+1) + (n-y+1)}} = {{y+1} \over {n+2}}


  1. 김달호. (2013). R과 WinBUGS를 이용한 베이지안 통계학: p95. ↩︎