2.4 Relationship Between Entropy and Mutual InformationΒΆ

The mutual information \(I(X; Y)\) can be rewritten as

\[\begin{split}I(X; Y) & = \sum_{x, y} p(x, y) \log \frac{p(x, y)}{p(x)p(y)} \\ & = \sum_{x, y} p(x, y) \log \frac{p(x \mid y)}{p(x)} \\ & = - \sum_{x, y} p(x, y) \log p(x) + \sum_{x, y} p(x, y) \log p(x \mid y) \\ & = H(X) - H(X \mid Y)\end{split}\]

Thus, the mutual information \(I(X; Y)\) is the reduction in the uncertainty of \(X\) due to the knowledge of \(Y\).

Theorem 2.4.1 (Mutual information and entropy).

\[\begin{split}I(X; Y) & = H(X) - H(X \mid Y) \\ I(X; Y) & = H(Y) - H(Y \mid X) \\ I(X; Y) & = H(X) + H(Y) - H(X, Y) \\ I(X; Y) & = I(Y; X) \\ I(X; X) & = H(X)\end{split}\]