Modified Nesterov Scheme and Supporting Definitions #
Shared definitions for the proof of local accelerated convergence:
- Modified Nesterov iteration (state, step, sequence)
- Hessian quadratic form
- Lyapunov function and auxiliary quantities
- Fiber-saturated sets
- Tangent/normal space of an embedded manifold
Hessian quadratic form #
Modified Nesterov scheme #
One step of the modified Nesterov scheme: x' = x + √η · v (look-ahead) g = ∇f(x') (gradient at look-ahead) x₊ = x' - η · g (gradient step) v₊ = ρ(v - √η · g) (momentum update)
Equations
Instances For
The Nesterov sequence starting from x₁ with v₁ = 0. Index 0 corresponds to iteration 1 in the paper.
Equations
- PLAcceleratedNesterovLean.nesterovSeq f η ρ x₁ 0 = { x := x₁, v := 0 }
- PLAcceleratedNesterovLean.nesterovSeq f η ρ x₁ n.succ = PLAcceleratedNesterovLean.nesterovStep f η ρ (PLAcceleratedNesterovLean.nesterovSeq f η ρ x₁ n)
Instances For
The gradient computed at the look-ahead point at step n.
Equations
- PLAcceleratedNesterovLean.nesterovGrad f η ρ x₁ n = gradient f ((PLAcceleratedNesterovLean.nesterovSeq f η ρ x₁ n).lookahead η)
Instances For
Step between consecutive look-ahead points: h_n = x'_{n+1} - x'_n.
Equations
- PLAcceleratedNesterovLean.nesterovH f η ρ x₁ n = (PLAcceleratedNesterovLean.nesterovSeq f η ρ x₁ (n + 1)).lookahead η - (PLAcceleratedNesterovLean.nesterovSeq f η ρ x₁ n).lookahead η
Instances For
Geometric quantities #
Normal displacement: e_n = x'_n - π(x'_n), the error from the manifold. Here π is the nearest-point projection.
Equations
- PLAcceleratedNesterovLean.normalDisp π f η ρ x₁ n = (PLAcceleratedNesterovLean.nesterovSeq f η ρ x₁ n).lookahead η - π ((PLAcceleratedNesterovLean.nesterovSeq f η ρ x₁ n).lookahead η)
Instances For
Auxiliary variable: u_n = P⊥ v_n + √μ' · e_n, where P⊥ = Id - P is the normal projector (P projects onto tangent space).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Curvature error: ξ_n = e_{n+1} - e_n - P⊥ h_n. Measures the deviation from the affine-case recursion.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Potential and Lyapunov functions #
The potential Ψ(x) = f(x) - f⋆ + (μ'/2) · dist(x, S)². Combines the optimality gap with a quadratic distance penalty.
Equations
- PLAcceleratedNesterovLean.psi f μ' S x = f x - PLAcceleratedNesterovLean.fStar f + μ' / 2 * Metric.infDist x S ^ 2
Instances For
The Lyapunov function: L_n = (f(x_n) - f⋆) + ½‖u_n‖² + lam‖P v_n‖² where lam = (1+a)²/(2(1-a)) and a = √(μ'·η).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Fiber saturation #
A set A is fiber-saturated with respect to a projection π if for every x ∈ A, the segment from π(x) to x lies entirely in A.
Equations
Instances For
Tangent space of an embedded manifold #
The tangent subspace of the embedded manifold at a point m, defined as the range of the manifold derivative of ι at m.
Equations
- PLAcceleratedNesterovLean.tangentSubspace ι m = (↑(mfderiv% ι m)).range