2.3 Relative Entropy and Mutual InformationΒΆ
The relative entropy \(D(p \mid\mid q)\) is a measure of the distance between two distributions. It measures the inefficiency of assuming that the distribution is \(q\) when the true distribution is \(p\).
Definition. The relative entropy or Kullback-Leibler distance between two probability mass functions \(p(x)\) and \(q(x)\) is defined as
It is not a true distance between distributions since it is not symmetric and does not satisfy the triangle inequality.
Definition. Consider two random variables \(X\) and \(Y\) with a joint probability mass function \(p(x, y)\) and marginal probability mass functions \(p(x)\) and \(p(y)\). The mutual information \(I(X; Y)\) is the relative entropy between the joint distribution and the product distribution \(p(x)p(y)\):