7 Convergence Theorems

1 Convergence Theorem

1.1 LLN

Suppose we have X1,,Xn, where n is very large. We are curious about Snn=i=1nXin. What value it converges to? What is its distribution? Law of Large Numbers (LLN) and Central Limit Theorem (CLT) will answer these questions.

Theorem (Weak Law of Large Numbers, WLLN)

Let X1,,Xn be a sequence of iid RVs with a finite mean μ and finite variance σ2. Let Sn=X1++Xn. Then ε>0, limnP(|Snnμ|>ε)=0.
I.e. Snnpμ. (see below)

The requirement of finite variance can be neglected, if we consider a stronger version

Theorem (Strong Law of Large Numbers, SLLN)

X1,,Xn is a sequence of iid RVs with finite mean μ. Then P(limnSnn=μ)=1.
I.e. Snna.s.μ.

1.2 CLT

Theorem (Central Limit Theorem, CLT)

Again assume μ,σ2<. Let Sn=X1++Xn. Then limnP(nσ(Snnμ)x)=Φ(x),xR,
where Φ(x)=x12πet22dt is the c.d.f of N(0,1).
I.e. nσ(Snnμ)dN(0,1).

1.2.1 Extensions of CLT

If X1,X2, are independent but not identically distributed, CLT can apply with additional conditions like Lyapunov or Lindeberg condition. A special case is for bounded RVs. Suppose X1,X2, independent, with E[Xi]=μi<, Var(Xi)=σi2<, Sn=X1++Xn, then if

  1. a constant M>0: P(|Xi|<M)=1,iN.
  2. limnVar(Sn)=limni=1nσi2=,

we have limnP(SnE[Sn]Var(Sn)<x)=Φ(x),xR.

2 Types of Convergence for RV

X1,X2, is an infinite sequence of RVs. X is another RV. Assume they are all defined on the same probability space. Of course we can define pointwise convergence limnXn(ω)=X(ω),ωΩ.
This notion of convergence however, turns out to be too strong.

Almost Sure Convergence

Almost sure convergence, also called strong convergence, convergence with probability 1, is defined as P(limnXn=X)=1.
Denote as Xna.s.X,n.

More precisely, P({ωΩ|limnXn(ω)=X(ω)})=1.
I.e., ε>0,N(ω,ε)N,nN(ω,ε):|Xn(ω)X|<ε. This is actually a definition for pointwise convergence.

Convergence in Probability

{Xn} converges in probability if ε>0, limnP(|XnX|>ε)=0.
Denote as XnpX,n.

Convergence in r th mean

For r>0, limnE[|XnX|r]=0.
Denote as XnrX,n.

Convergence in Distribution

Convergence in distribution is also called convergence in law. Here X does not need to be defined on the same probability space as {Xn}. And limnFXn(x)=FX(x),xC(FX), where C(FX)={xR|FX(x) is continuous at x}.
Denote as X1dX,n.

The three convergence theorems correspond to different convergence.

Almost sure convergence Convergence in probability Convergence in distribution
SLLN CLT WLLN

3 Convergence Relations

Theorem (Relations between Different Convergence Concepts)

The relations can be shown below:
Pasted image 20241130003132.png|400

By Lemma 1, 0E[|XnX|r]E[|XnX|s]rs, so if limnE[|XnX|s]=0 for RHS, then limnE[|XnX|r]=0.

Suppose Xna.s.X. Since {|XmX|<ε,mn}=m=n{|XmX|<ε} and P(A)P(B) if BA, and by Lemma 2 1P(|XnX|<ε)P(|XmX|<ε,mn)1, so limnP(|XnX|<ε)=1.

Since XnpX, limnP(|XXn|>ε)=0,ε>0. So case 1 and 2 together imply FX(xε)lim infnFXn(x)lim supnFXn(x)FX(x+ε).
If xC(FX), then limε0FX(xε)=FX(s)=limε0FX(x+ε), then limnFXn(x)=FX(x) if xC(FX).

4 Rate of Convergence

Theorem (Berry-Eseen)

There exists a constant C s.t. if X1,,Xn are iid RVs with finite mean μ, finite variance σ2 and finite ρ=E[|Xiμ|3], then nN, supxR|Fn(x)Φ(x)|cρσ3n, where Sn=X1++Xn, Zn=nSnnμσ, Fn(x)=P(Znx).