Relative Entropy (Kullback-Leibler Divergence) in Classical Information Theory

양자정보이론

[ 펼치기 · 접기 ]

양자계산	논리게이트	비트 · 부울함수(AND · OR · NOT · XOR · NAND · NOR · CNOT · CCNOT · CSWAP) · 범용 게이트 · 복제 함수 · 사영 · 주입
	양자게이트	큐비트 · 양자 얽힘 · 양자 회로 · 양자 게이트(파울리 게이트 · 위상 게이트 · 아다마르 게이트 · 양자 CNOT · 교환 게이트 · 양자 CSWAP · 양자 CSWAP) · 솔베이-키타예프 정리 · 복제 불가 정리
정보이론	고전정보이론	정보량 · 엔트로피(결합 엔트로피 · 조건부 엔트로피 · 상대적 엔트로피 · 상호 정보 ) · 부호화 · 복호화
	양자정보이론	TBD · TBD · TBD
관련 분야
선형대수 · 양자역학

For a discrete random variable $X$ with a probability mass function $p, q$, the relative entropy concerning $q$ of $p$ is defined as follows:

$$ D(p \| q) := \sum p(x) \log_{2} \dfrac{p(x)}{q(x)} \tag{1} $$

In this case, for $p \ne 0$, it is defined as $p \log_{2}(\frac{p}{0}) := \infty$. For continuous random variables, it is defined through integration.

$$ D(p \| q) := \int p(x) \ln \dfrac{p(x)}{q(x)} dx $$

The expected value form is as follows:

$$ D(p \| q) = E_{p} \left[ \log \dfrac{p(X)}{q(X)} \right] $$

Explanation

Relative entropy is also referred to as Kullback-Leibler divergence and is denoted using the following notations:

$$ D(p \| q) = D_{\text{KL}}(p \| q) = H(p \| q) $$

$D(p \| q)$, when assuming the distribution of $X$ as $q$ (given that the actual distribution of $X$ is $p$), measures how poorly $q$ represents $p$, in other words, the degree of dissimilarity between $q$ and $p$. Since $-\log q$ indicates the information quantity of $q$, definition $(1)$ represents the average difference in information between $q$ and $p$.

$$ \begin{align*} \sum p(x) \log_{2} \dfrac{p(x)}{q(x)} &= \sum p(x) \big[ -\log_{2}q(x) - (-\log_{2}p(x)) \big] \\ &= \sum p(x) \big[ I(q(x)) - I(p(x)) \big] \\ &= E \big[ I(q) - I(p) \big] \end{align*} $$

Properties

Non-symmetry $$ D(p \| q) \ne D(q \| p) $$
Non-negativity $$ D(p \| q) \ge 0 $$ The equality holds when $p = q$.

Proof

2.

By definition, if $p=q$ then $D(p \| q) = 0$, so let’s consider $p \ne q$.

$$ \begin{align*} -D(p \| q) &= \sum p(x) \log_{2} \dfrac{q(x)}{p(x)} \\ &\le \log_{2} \left( \sum p(x) \dfrac{q(x)}{p(x)} \right) \\ &= \log_{2} \left( \sum q(x) \right) \\ &= \log_{2} 1 \\ &= 0 \end{align*} $$

The inequality holds because the logarithm function is concave, thus verified by the Jensen’s inequality.

Jensen’s Inequality
If $f$ is a concave function, the following holds true for $\sum_{k=1}^{n} \lambda_{k} = 1$,
$$ f\left( \sum\limits_{k=1}^{n}\lambda_{k}x_{k} \right) \ge \sum\limits_{k=1}^{n} \lambda_{k} f(x_{k}) $$

Therefore, by multiplying both sides by negative one, we have:

$$ 0 \le D(p \| q) $$

■