RL Theory in Lean #
Source: arxiv:2511.03618
Authors: Shangtong Zhang
Status: verified
Main declarations: StochasticMatrix.stationary_distribution_exists
Tags: probability, reinforcement-learning, stochastic-matrices
MSC: 62L20, 60J10
Provenance #
Imported from https://github.com/ShangtongZhang/rl-theory-in-lean (MIT-licensed
upstream; relicensed into Lean Pool under Apache 2.0 with the upstream author's
copyright preserved). Accompanies the paper Towards Formalizing Reinforcement
Learning Theory (arXiv:2511.03618). Ported from Lean v4.28.0-rc1 to Lean Pool's
v4.30.0-rc2. This Lean Pool import keeps the warning-clean core infrastructure.
Mathematical overview #
The project mirrors Mathlib's directory layout and develops stochastic row-stochastic matrices, Doeblin minorization and geometric mixing for finite chains, finite Markov-chain kernels and trajectory measures, measure/kernel helper lemmas, and discrete Gronwall inequalities used in stochastic approximation.
Alias of StochasticMatrix.Aperiodic.
A stochastic matrix is aperiodic if for each state, the GCD of return times is 1
Equations
Instances For
Alias of StochasticMatrix.DoeblinMinorization.
A Doeblin minorization for a finite stochastic matrix.
Instances For
Alias of StochasticMatrix.GeometricMixing.
Geometric convergence to stationarity in total variation/L¹ distance.
Equations
Instances For
Alias of StochasticMatrix.RowStochastic.
A matrix whose rows are stochastic vectors.
Equations
Instances For
Alias of StochasticMatrix.cesaroAverage.
The Cesaro average of the first n + 1 iterates of a stochastic vector.
Equations
Instances For
Alias of StochasticMatrix.cesaro_average_is_svec.
Alias of StochasticMatrix.eventually_positive.
Alias of StochasticMatrix.multi_step_stationary.
Alias of StochasticMatrix.pos_of_stationary.
Alias of StochasticMatrix.returnTimes.
The set of positive return times for state i
Equations
Instances For
Alias of StochasticMatrix.return_times_add_mem.
Return times are closed under addition (used via AddSubmonoid.closure)
Alias of StochasticMatrix.smatAsOperator.
The affine action of a stochastic matrix on the probability simplex.
Equations
Instances For
Alias of StochasticMatrix.smat_as_operator_iter.
Alias of StochasticMatrix.smat_mul_smat_is_smat.
Alias of StochasticMatrix.smat_nonexpansive_in_l1.
Alias of StochasticMatrix.smat_pow_is_smat.
Alias of StochasticMatrix.stationary_distribution_uniquely_exists.
Alias of StochasticMatrix.svec_mul_smat_is_svec.
Alias of StochasticMatrix.uniformDistribution.
The uniform probability distribution on a nonempty finite type.
Instances For
Alias of StochasticMatrix.StochasticVec.le_one.
Alias of MeasureTheory.Integrable.finset_sum.
Alias of Filter.EventuallyEq.finset_sum.
Alias of ProbabilityTheory.MarkovChain.Kernel.iter.
Iterates of the transition kernel of a Markov chain.