10 Binomial Approximation

Consider I1,,InBernoulli(p), and SnBinomial(n,p), so Sn=dI1++In. We know P(Sn=k)=(nk)pk(1p)nk=n!k!(nk)!pk(1p)nk,0kn.
We want to estimate the Binomial distribution, because factorials are troublesome to work with. The key tool is to use Stirling Approximation e112n+1<n!(ne)n2πn<e112n.

1 Entropy Approximation

We can use entropy to approximate Binomial distribution. To be more specific, KL divergence.

Theorem (Entropy Approximation)

Let SnBinomial(n,p) and define f=kn. Then for k=1,,n1,12πnf(1f)enKL(f||p)>P(Sn=k)>[1112nf(1f)]12πnf(1f)enKL(f||p), where KL(f||p)=flog(pf)(1f)log(1p1f).

2 Normal Approximation

Taylor expansion of KL(f||p) (see as a function about f) about f=p gives KL(f||p)=(fp)22p(1p)+2g16g2(1g)2(fp)3 for some g between f,p.

To obtain a normal approximation, we need the remainder R quantity to be small. Notice that R=|n(2g1)6g2(1g)2(fp)3|n|fp|36[min(f,p)min(1f,1p)]2.

Good news is that when I1,,Ini.i.dBernoulli(p), by SLLN, Snn=d1n(I1++In)a.s.p.
So normal approximation is that P(Sn=k)12πnp(1p)e(knp)22np(1p).
This is accurate if n|fp|36[min(f,p)min(1f,1p)]21.
However, in general, normal approximation is less accurate than the entropy approximation.

We can also use CLT:

Theorem (CLT for Binomial Distribution)

Let SnBinomial(n,p) for 0<p<1. Then a,bR where a<b, limnP(anSnnpp(1p)b)=ab12πet22dt.

2.1 Application of CLT for Binomial(n,p)

Let p be the proportion of the population supporting Trump.
n be the number of people polled u.a.r from the population.
Sn be the number in the sample who support Trump.
p^n=Snn be the estimator of p.
How large should n be, s.t. P(p[p^nε,p^n+ε])1α for given ε>0 and 0<α<1?

This is the #ConvidenceLevel . For each trial, p^ moves, and this moving interval should cover p at least 100(1α)% of the time.

Denote Ij={1,j-th person polled supported Trump,0,otherwise.
Then I1,,Ini.i.dBernoulli(p),Sn=I1++InBinomial(n,p). By CLT, np^npp(1p)dZN(0,1). so P(p^nεpp^n+ε)=P(εp^npε)=P(εnp(1p)np^npp(1p))Φ(z)Φ(z)=12Φ(z),
so

12Φ(z)1αΦ(z)α2n[Φ1(α/2)ε]2p(1p).

Since p(1p)14,p[0,1], this yields n14[Φ1(α/2)ε].

If ε=0.05,α=0.05, we have n385.
If ε=0.01,α=0.05, we have n9604.