Newton-system layer #
This file connects the HSDE/HLP equations with the Newton direction used in the interior-point proof. Conceptually, this is the linear-algebra layer: the later algorithm file should not need to know the details of the Newton matrix.
Lean-reading hints for beginners:
intro hproves an implication or a universal statement by introducing its premise/variable as the local nameh.have h : P := by ...creates an intermediate lemma namedh.rcases h with ⟨a, b, c⟩decomposes a conjunction/existential proof into its components.simpa [defs] using hmeans: simplify the goal and the type ofhusingdefs, then close the goal byh.
Short Lean proof-command guide #
The file is written for readers who know the interior-point algebra but may be new to Lean.
introintroduces an assumption or quantified variable from the current goal.have h : P := by ...proves and names an intermediate claim.rcases h with ⟨...⟩unpacks conjunctions, existentials, and structures.simp only [...]performs controlled rewriting using only the listed facts.simpa [defs] using hsimplifies the goal and the type ofh, then appliesh.linarithcloses linear real-arithmetic goals from the available hypotheses.nlinarithis the same idea for nonlinear arithmetic such as products/squares.ringproves polynomial identities over rings such asℝ.
Active HLP/YTM proof layer #
This file contains the parts that are still under active proof engineering:
- the full Newton block operator and its finite-dimensional solvability proof;
- Schur-complement complementarity arguments;
- the remaining YTM predictor/corrector local estimates.
The stable definitions and elementary lemmas are imported from
HSDInteriorPointLP.PrimalDualData.
Pull a scalar out of a finite sum. This is the only finite-sum scalar rule
used below; the block-operator proof invokes it through matVec, tMatVec, and
dot scalar-multiplication lemmas, rather than by broad rewriting inside the
large product goal.
Component projections for scalar multiplication on the HLP block product space. These
lemmas keep the Newton-block linearity proof away from broad simp.
Component projections for addition on the HLP block product space.
The full Newton block operator as a linear self-map on the product block space.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Zero residual of the block operator is exactly the homogeneous Newton block.
Hitting the inhomogeneous right-hand side is exactly solving the Newton block.
In a finite-dimensional vector space, an injective linear self-map is surjective.
The concrete HLP nullspace equations imply the reduced skew-operator nullspace condition. This is the explicit place where the skew-symmetry of the YTM HLP coefficient matrix must be formalized.
Schur-complement form of the homogeneous Newton-kernel proof #
The following small definitions and lemmas make the proof of
HLP_newton_block_kernel_trivial_from_full_row_rank follow the standard Newton
matrix argument: eliminate (ds, dκ) by the positive diagonal complementarity block,
form the Schur complement in (dy, dτ, dθ), first prove the Schur residual and
dτ vanish, and only then remove the remaining multiplier variables.
The Schur-complement residual
r = Aᵀ dy - c dτ + cbar dθ. In the homogeneous dual block we have ds = -r.
Equations
- HSDInteriorPointLP.HLPSchurResidual P D j = HSDInteriorPointLP.tMatVec P.A D.dy j - P.c j * D.dtau + HSDInteriorPointLP.cbar P j * D.dtheta
Instances For
The positive diagonal ratio Θ = S^{-1}X, written componentwise.
Equations
- HSDInteriorPointLP.HLPThetaDiag w j = w.x j / w.s j
Instances For
The homogeneous dual block gives ds = -r.
The complementarity block and the eliminated dual slack give
dx = Θ r, the usual first step in forming the Schur complement.
Schur-complement complementarity identity.
After eliminating ds, dx, and dκ through the positive diagonal
complementarity block, the homogeneous HLP orthogonality relation becomes
∑ᵢ (xᵢ/sᵢ) rᵢ² + (κ/τ) dτ² = 0.
This is the finite-dimensional inner-product identity used in the standard
interior-point nonsingularity argument.
A finite-dimensional positivity lemma for the reduced complementarity form.
If all diagonal Schur weights are strictly positive and eta > 0, then the identity
∑ᵢ thetaᵢ rᵢ² + eta t² = 0 forces every residual component rᵢ and the scalar
t to vanish. This is the positive-diagonal part of the interior-point
nonsingularity argument, rather than positive-definiteness of the whole
(dy, dτ, dθ) Schur matrix.
The reduced complementarity identity forces the Schur residual and dτ to be
zero.
Once the Schur residual and dτ vanish, the reduced primal and scalar directions
vanish.
Final algebra after the Schur core has vanished: the definitions of bbar, cbar,
and zbar imply dθ = 0; then full row rank gives dy = 0.
Kernel triviality of the full HLP Newton block operator.
This is the part where the proof should use:
- full row rank of
A, - strict positivity of
x,tau,s, andkappa, - the skew-symmetric HLP block identities.
Mathematically, one first eliminates ds and dκ using the complementarity
blocks. With r = Aᵀdy - c dτ + cbar dθ, this gives dx = Θ r, where
Θ = S^{-1}X is positive diagonal. The Schur-complement identity is then
∑ᵢ Θᵢ rᵢ² + (κ/τ)dτ² = 0, so r = 0 and dτ = 0. The remaining HLP equations,
together with the definitions of bbar, cbar, and zbar, give dθ = 0, and full
row rank removes dy.
Finite-dimensional square linear systems: zero kernel implies solvability.
The HLP Newton block system is square. Once the homogeneous block operator has trivial kernel, finite-dimensional linear algebra gives surjectivity, hence existence of a solution for the inhomogeneous Newton right-hand side.
Concrete nonsingularity/surjectivity statement for the full HLP Newton block system. This is now the only finite-dimensional linear-algebra theorem that remains for search-direction existence.
Mathematically, this is proved by eliminating ds and dκ with the positive diagonal
coefficients xᵢ and τ, and then using full row rank of A together with the
self-dual/skew-symmetric HLP block structure.
A full HLP direction equation gives the reduced direction equation used by the rest of the proof skeleton.
Concrete full HLP Newton linear-algebra theorem.
This is stronger and more explicit than the reduced HLPNewtonLinearAlgebra: it
states solvability of the full YTM direction system containing dy and dθ.
The proof should use the full-row-rank assumption on A and the positive diagonal
terms from the interior iterate.
- full_direction_system_solvable : LPStandardAssumptions P → ∀ (w : HSState n), Interior w → ∀ (γ : ℝ), ∃ (D : HLPFullDirection m n), HLPFullDirectionEquation P w D γ
Instances For
Main concrete Newton solvability obligation for the YTM HLP system.
Summing the componentwise complementarity equations, including the scalar
τκ equation, gives the aggregate first-order gap identity.
The skew-symmetry consequence is derived from the bundled HLP nullspace relation. This is the separated-variable version of Theorem 5(ii) in YTM.
The componentwise search-direction equation implies the aggregate complementarity equation.
A solution of the HLP search-direction system gives a valid step direction.
Linear-algebra theorem needed for the YTM/HLP Newton system.
This packages the remaining finite-dimensional linear-algebra task: from the concrete
HLP Newton matrix and the full-row-rank assumption on A, one proves that the
predictor/corrector direction equations are solvable at every interior iterate.
The theorem is kept as an explicit field rather than hidden inside the algorithm. In a later file, this should be replaced by an actual matrix nonsingularity proof.
- direction_system_solvable : LPStandardAssumptions P → ∀ (w : HSState n), Interior w → ∀ (γ : ℝ), ∃ (d : HSDirection n), HSDDirectionEquation w d γ
Instances For
The concrete full Newton linear algebra induces the reduced linear algebra used by the existing convergence skeleton.
Main finite-dimensional linear-algebra obligation for the HLP Newton system.
This is the place where the full-row-rank assumption on A should be used to prove
that the concrete Ye--Todd--Mizuno Newton system has a solution at each interior
iterate. The current skeleton routes this obligation through the explicit full-system
solvability statement above.
Solvability of the YTM/HLP Newton system at a given point and parameter.
- exists_direction : ∃ (d : HSDirection n), HSDDirectionEquation w d γ
Instances For
Full-row-rank linear algebra supplies a solution of the search-direction system.
Once solvability of the HLP Newton system is available, existence of a search direction is immediate.
A noncomputable selector for the search direction once existence is known.
Equations
Instances For
The selected search direction satisfies the search-direction equations.
The selected search direction is a valid step direction.
Algebraic expansion of the complementarity gap under a line step.
Gap update for a direction satisfying the HLP nullspace and complementarity equations.
Predictor step guarantee from the Mizuno--Todd--Ye local analysis.
- neighborhood_next : HSDNeighborhood βwide (addStep w d α)
Instances For
Corrector step guarantee from the Mizuno--Todd--Ye local analysis.
- neighborhood_next : HSDNeighborhood βtight (addStep w d 1)
Instances For
The tight neighborhood parameter used in the YTM proof.
Equations
Instances For
The wide neighborhood parameter used in the YTM proof.
Equations
Instances For
The predictor step-size constant appearing in YTM Theorem 6.
Mathematically this is 8^{-2.5}.
Instances For
Corrector local estimate #
The corrector half of the YTM local analysis is a componentwise estimate for the full step. In the usual notation, set
u = (x, τ)andv = (s, κ),Δu = (dx, dτ)andΔv = (ds, dκ),N = n + 1andμ = gap w / N.
For a corrector direction (γ = 1), the linearized complementarity equations give
uᵢ Δvᵢ + vᵢ Δuᵢ = μ - uᵢ vᵢ,
and skew orthogonality gives Σ Δuᵢ Δvᵢ = 0. The Mizuno--Todd--Ye
componentwise estimate proves that, if w ∈ N(1/2), then the full corrector step
w + d is positive and lies in N(1/4). The proof is the standard scaled estimate
‖Δu ∘ Δv‖ ≤ μ / 4,
obtained by writing pᵢ = Δuᵢ/uᵢ, qᵢ = Δvᵢ/vᵢ,
aᵢ = sqrt(uᵢvᵢ) pᵢ, and bᵢ = sqrt(uᵢvᵢ) qᵢ.
This lemma is stated separately so that the top-level corrector theorem has exactly
the same shape as the IPM proof: local positivity/centrality estimate plus the
already-formalized gap identity.