Lyapunov Contraction Step 1: Descent Lemma and Parameter Identities #
Descent Lemma #
For L-smooth f with step x₊ = x - (1/L)∇f(x): f(x₊) ≤ f(x) - (1/(2L))‖∇f(x)‖²
Parameter Identities #
With ρ = (1-a)/(1+a) and λ = (1+a)²/(2(1-a)): (1+a)ρ = 1-a λρ² = (1-a)/2
Velocity Coefficient Bound #
If ε·η ≤ a and 0 ≤ a < 1: (1-a)² + ε·η·(1-a) ≤ 1-a (1-a)(1+a) ≤ (1+a)² = 2(1-a)λ
theorem
PLAcceleratedNesterovLean.descent_lemma
(fx fy grad_sq L : ℝ)
(hL : 0 < L)
(hsmooth : fy ≤ fx - 1 / L * grad_sq + L / 2 * (1 / L) ^ 2 * grad_sq)
:
Descent lemma: f(x - η·∇f(x)) ≤ f(x) - (η/2)·‖∇f(x)‖² for η = 1/L. This is stated abstractly: if f(y) ≤ f(x) + ⟨∇f(x), y-x⟩ + (L/2)‖y-x‖² and y = x - (1/L)·∇f(x), then f(y) ≤ f(x) - (1/(2L))·‖∇f(x)‖².
theorem
PLAcceleratedNesterovLean.lsmooth_qub
{d : ℕ}
(f : E d → ℝ)
(L : NNReal)
{V : Set (E d)}
(hV : IsOpen V)
(hf_diff : DifferentiableOn ℝ f V)
(hf_lip : LipschitzOnWith L (gradient f) V)
(x w : E d)
(hx_V : x ∈ V)
(hseg_V : ∀ (t : ℝ), 0 ≤ t → t ≤ 1 → x + t • w ∈ V)
:
Quadratic Upper Bound (QUB) from L-smoothness: for any x, w, f(x + w) - f(x) ≤ ⟨∇f(x), w⟩ + L/2·‖w‖².
theorem
PLAcceleratedNesterovLean.lsmooth_descent_at
{d : ℕ}
(f : E d → ℝ)
(L : NNReal)
(hL : 0 < ↑L)
{V : Set (E d)}
(hV : IsOpen V)
(hf_diff : DifferentiableOn ℝ f V)
(hf_lip : LipschitzOnWith L (gradient f) V)
(x : E d)
(hx_V : x ∈ V)
(hseg_V : ∀ (t : ℝ), 0 ≤ t → t ≤ 1 → x + t • -((1 / ↑L) • gradient f x) ∈ V)
(η : ℝ)
(hη : η = 1 / ↑L)
:
L-smooth descent: f(x - η·∇f(x)) ≤ f(x) - (η/2)·‖∇f(x)‖² when η = 1/L and the gradient of f is L-Lipschitz. Combines the quadratic upper bound from L-smoothness with the arithmetic simplification (descent_lemma).