Convergence
Histograms of the output of n tosses of a fair coin (head = 1, tail = 0).

Convergence

In this article, we are reviewing the three distinct and popular ways of convergence in Probability Theory.

We let X1, X2, ... and X all defined on the same probability space (Ω, T, P).

  1. Simple convergence

The sequence X1, X2, ... simply converges if

(1)

This notion is too strong in most cases. For example, assume we toss a fair coin infinitely. On average we'll see a head (or tail) half of the time. Assume Xn=1 if the output is a head, and 0 otherwise, at the toss n, for any n. Then there exist an eventuality ω1="HHHHHHHH..." for which the average (X1+X2+...)/n evaluated on ω1 is 1, and an eventuality ω2="FFFFFFFF..." for which the average evaluated on ω2 is 0. This convergence does not coincide with the intuition issued from our empirical observation.

2. Convergence almost surely

The sequence X1, X2, ... almost surely (a.s.) converges if

(2)

Perhaps you may have an incredible lack of luck, you may observe indeed that the average proportion of heads (or tails) is half.

It is interesting to keep in mind with some practical criterii for the convergence almost surely.

  • if the sequence ΣP(|Xn-X|>ε) converges for all ε>0, then the sequence X1, X2, ... a.s. converges.

Hint: use the Borel-Cantelli lemma.

  • if ΣE|Xn-X| converges, then the sequence X1, X2, ... a.s. converges.

Hint: use the Markov inequality and the previous point.

When you perform an reductio ad absurdum, you can prove that the sequence X1, X2, ... a.s. converges if and only if

(3)

Indeed, you can start from:

(4)

(the little "c" means "complement").

3. Convergence in probability

The sequence X1, X2, ... converges in probability if

(5)

This means that the set of Xn's which move away from X is reducing to 0 when n tends to infinity.

In the example above, all the "averages" getting away from half tend to collapse in proportion.

There is a non-straightforward characterization of convergence in probability with the mathematical expectation.

  • Consider a function ψ from R to R+ and which is positive, continuous, bounded, non-decreasing and (strictly) increasing in the neighborhood of 0, such that ψ(0)=0. Then the sequence X1, X2, ... converges in probability if and only if

(6)

  • From this, setting ψ(x)=min(x,1), we deduce: the sequence X1, X2, ... converges in probability if and only if

(7)

This last property is often used to prove convergence in probability.

4. Comparison

To remember:

  • Convergence almost surely implies convergence in probability (use the definitions to prove this)
  • Conversely, if there is convergence in probability, one can extract a sub-sequence which converges almost surely.

Note: there also is the convergence in law, but it might be for next time!

Jad Tabbouch

Data scientist

2 个月

Thank you for this article ! I wish you had included a bit more "real life" examples as you did in the beginning of the article, are there concrete examples from everyday life/data where these modes of convergence are useful ? Of course we have the central limit theorem, the law of large numbers. But are there practical applications to the interactions between these modes of convergence in your experience ? I never had the chance of applying borel-cantelli lemmas in everyday data analyses ??

Estelle Detrembleur MBA ChPP FAPM PDHyp PMO-CP

Strategic PMO Consultant & International Relations Expert: Driving Global Project Excellence PMO Global Culture and DEI insight Team AI & Data Analytics | Cybersecurity & Risk | Business Continuity & ESG Advocate

2 个月

Love this ??

要查看或添加评论,请登录

Julien Riposo, Ph.D, CQF的更多文章

  • On the sum/product of two random variables

    On the sum/product of two random variables

    Let X and Y be two independent random variables admitting a differentiable density on the set of real numbers (without…

    16 条评论
  • The Feynman-Kac Formula for Multi-Dimensional Diffusion Process

    The Feynman-Kac Formula for Multi-Dimensional Diffusion Process

    This post is the immediate sequence of The Feynman-Kac Formula for 1D Diffusion Process | LinkedIn . On proving the…

  • The Feynman-Kac Formula for 1D Diffusion Process

    The Feynman-Kac Formula for 1D Diffusion Process

    On proving the formula We consider the following partial differential equation (PDE) problem: (1) with boundary…

    12 条评论
  • (Very) quick overview of Category Theory

    (Very) quick overview of Category Theory

    Category theory is a branch of mathematics that focuses on the high-level relationships between mathematical…

  • Wigner Theorem for Random Matrices

    Wigner Theorem for Random Matrices

    We give the main steps for a proof of the theorem. The Catalan number For any natural integer n, we define the nth…

    5 条评论
  • Mathematics and Blockchain

    Mathematics and Blockchain

    Whether intertwined with cryptocurrencies or a myriad of other secure applications, blockchain unveils itself as a…

    3 条评论
  • Stochastic Optimal Control

    Stochastic Optimal Control

    The Hamilton-Jacobi-Bellman (HJB) equation is a partial differential equation that arises in optimal control theory…

    5 条评论
  • Lie Groups and Lie Algebras - Applications to Finance and Blockchain

    Lie Groups and Lie Algebras - Applications to Finance and Blockchain

    A Lie group is a group (in the algebraic sense) which is also a differential manifold, meaning that it is a smooth…

    5 条评论
  • Quotient Spaces, Tori and Finance

    Quotient Spaces, Tori and Finance

    The quotient space of a topological space is formed by partitioning the space into equivalence classes based on some…

    5 条评论
  • Topological manifolds and finance

    Topological manifolds and finance

    A topological manifold is a space that locally resembles Euclidean space near each point, but globally may have a more…

    8 条评论

社区洞察