Shannon.Entropy.Joint #

Joint distributions on product types, marginals, conditional entropy, and the chain rule for Shannon entropy.

These definitions and theorems support the Section 6 properties by providing the infrastructure for multi-variable entropy identities.

Main definitions #

marginalFst, marginalSnd: marginal distributions from a joint distribution
prodDist: product (independent) distribution from two marginals
IsIndependent: predicate for independence of a joint distribution
condEntropy: conditional entropy H_X(Y) = -∑ p(x,y) log(p(x,y)/p(x))
mutualInfo: mutual information I(X;Y) = H(X) + H(Y) - H(X,Y)

Main results #

chain_rule: H(X,Y) = H(X) + H_X(Y)
entropyNat_prodDist: H(X × Y) = H(X) + H(Y) for independent distributions
marginalFst_prodDist, marginalSnd_prodDist: marginals of product distributions

Marginals and product distributions #

source

def LeanPool.Shannon1948Formalization.marginalFst {α β : Type} [Fintype α] [Fintype β] (p : ProbDist (α × β)) :

ProbDist α

First marginal: (marginalFst p)(a) = ∑_b p(a, b).

Equations

LeanPool.Shannon1948Formalization.marginalFst p = ⟨fun (a : α) => ∑ b : β, ↑p (a, b), ⋯⟩

Instances For

source

def LeanPool.Shannon1948Formalization.marginalSnd {α β : Type} [Fintype α] [Fintype β] (p : ProbDist (α × β)) :

ProbDist β

Second marginal: (marginalSnd p)(b) = ∑_a p(a, b).

Equations

LeanPool.Shannon1948Formalization.marginalSnd p = ⟨fun (b : β) => ∑ a : α, ↑p (a, b), ⋯⟩

Instances For

source

def LeanPool.Shannon1948Formalization.prodDist {α β : Type} [Fintype α] [Fintype β] (p : ProbDist α) (q : ProbDist β) :

ProbDist (α × β)

Product distribution: (prodDist p q)(a, b) = p(a) * q(b).

Equations

LeanPool.Shannon1948Formalization.prodDist p q = ⟨fun (ab : α × β) => ↑p ab.1 * ↑q ab.2, ⋯⟩

Instances For

Independence, conditional entropy, mutual information #

source

def LeanPool.Shannon1948Formalization.IsIndependent {α β : Type} [Fintype α] [Fintype β] (p : ProbDist (α × β)) :

Prop

A joint distribution is independent when it factors as the product of its marginals.

Equations

One or more equations did not get rendered due to their size.

Instances For

source

noncomputable def LeanPool.Shannon1948Formalization.condEntropy {α β : Type} [Fintype α] [Fintype β] (p : ProbDist (α × β)) :

ℝ

Conditional entropy H_X(Y) = -∑_{x,y} p(x,y) log(p(x,y) / p_X(x)).

This measures the average remaining uncertainty in Y once X is known. The formula uses Lean's 0 / 0 = 0 and log 0 = 0 conventions: when p_X(x) = 0 we also have p(x,y) = 0, so the term vanishes.

Equations