LeanPool.FormalLearningTheory.Complexity.Generalization.Tail #

source

def blockExtract {α : Type u_1} (k m : ℕ) (S : Fin (k * m) → α) (j : Fin k) :

Fin m → α

Extract block j from a flat array of k*m elements, using finProdFinEquiv.

Equations

blockExtract k m S j i = S (finProdFinEquiv (j, i))

Instances For

source

def majorityVote (k : ℕ) (votes : Fin k → Bool) :

Bool

Boolean majority vote: returns true iff strictly more than half the votes are true.

Equations

majorityVote k votes = decide (2 * {j : Fin k | votes j = true}.card > k)

Instances For

source

theorem block_extract_disjoint (k m : ℕ) (j₁ j₂ : Fin k) (hne : j₁ ≠ j₂) :

Disjoint (Finset.image (fun (i : Fin m) => finProdFinEquiv (j₁, i)) Finset.univ) (Finset.image (fun (i : Fin m) => finProdFinEquiv (j₂, i)) Finset.univ)

Block index sets are disjoint for distinct blocks.

source

theorem block_extract_measurable {X : Type u_1} [MeasurableSpace X] (k m : ℕ) (j : Fin k) :

Measurable fun (ω : Fin (k * m) → X) => blockExtract k m ω j

Block extraction is measurable: extracting block j from a pi-type is measurable.

source

theorem iIndepFun_block_extract {X : Type u_1} [MeasurableSpace X] (k m : ℕ) (D : MeasureTheory.Measure X) [MeasureTheory.IsProbabilityMeasure D] :

ProbabilityTheory.iIndepFun (fun (j : Fin k) (ω : Fin (k * m) → X) => blockExtract k m ω j) (MeasureTheory.Measure.pi fun (x : Fin (k * m)) => D)

Block extractions are independent under the product measure. Key infrastructure for boosting (D4) and probability amplification.

source

theorem shatters_realize {X : Type u} {C : ConceptClass X Bool} {T : Finset X} (hT : Shatters X C T) (f : ↥T → Bool) :

∃ c ∈ C, ∀ (x : ↥T), c ↑x = f x

Shattering lifting: if T is shattered by C and f : ↥T → Bool, then there exists c ∈ C that agrees with f on all of T.

source

theorem shatters_hard_labeling {X : Type u} {C : ConceptClass X Bool} {T : Finset X} (hT : Shatters X C T) (h : ↥T → Bool) (hcard : 2 ≤ T.card) :

∃ c ∈ C, T.card < 4 * {x : ↥T | c ↑x ≠ h x}.card

Key counting lemma: for any h : ↥T → Bool on a shattered set T with |T| ≥ 2, there exists c ∈ C with #{x ∈ T | c x ≠ h x} > |T|/4. Lifts exists_many_disagreements through shattering.

source

theorem nfl_per_sample_shattered {X : Type u} {C : ConceptClass X Bool} {T : Finset X} (hT : Shatters X C T) {m : ℕ} (hTcard : 2 * m < T.card) (xs : Fin m → X) (h : X → Bool) :

∃ c ∈ C, (∀ (i : Fin m), xs i ∈ T → c (xs i) = h (xs i)) ∧ T.card < 4 * {x ∈ T | c x ≠ h x}.card

NFL per-sample lemma for shattered sets: for ANY fixed sample xs and ANY hypothesis h, there exists c ∈ C agreeing with h on sample points but with high error (> 1 / 4) on the shattered set T. Uses the counting argument on unseen points via disagreement_sum_eq.

source

theorem vcdim_infinite_not_pac (X : Type u) [MeasurableSpace X] [MeasurableSingletonClass X] (C : ConceptClass X Bool) (hinf : VCDim X C = ⊤) :

¬PACLearnable X C

If VCDim = ⊤, then C is not PAC learnable. Proof: for any learner L with sample function mf, pick ε = 1 / 4, δ = 1 / 4. Let m = mf(1 / 4, 1 / 4). Since VCDim = ⊤, ∃ shattered set S with |S| ≥ 2m. Put D = uniform on S. For random labeling, any m-sample learner has expected error ≥ 1 / 4 on unseen points. This is the core of pac_imp_vcdim_finite (contrapositive direction).

source

@[reducible, inline]

abbrev DriftRate :

Type

Drift rate: how fast the target concept changes over time.

Equations

DriftRate = ℝ

Instances For