Alice and Bob are playing the following guessing game: 1. Bob writes down two different numbers on two separate cards: . 2. Alice picks one of the cards u.a.r. and looks at the number. 3. Alice wins if she correctly guesses which of the two cards has a larger number. Can Alice do better than random guess?
Yes! Suppose Bob's numbers are . Let e the number Alice picked.
if , call the other card as having a larger number.
if , call to be the larger number.
Denote , then
1 Distribution Derived from 0-1 Sequence
Consider a sequence of s and s. (call this Bernoulli Process) Denote each index a RV .
1.1 Bernoulli Distribution
If , then . .
If , then call this a Bernoulli process. Let . This leads to:
Denote as the waiting time between the th and th successes. Then . .
Then
These moments can be computed more easily using MGF.
By here we know the MGF for is so
1.4 Negative Binomial Distribution
. Use to denote the total waiting time to the th success. Then , . Then
Use to denote number of failures before th success, then . Denote .
Then
2 Hypergeometric Distribution
There are blue balls and red balls. . Sample size .
If sample balls u.a.r with replacement, let be the number of blue balls in (sample space). Then , .
If sample balls u.a.r without replacement, let be number of blue balls. Obviously, when or ,
For , consider as an event that . Denote
@ Key observation: for another event with the same condition, the probability is the same.
So there are distinct sequences with blue balls and red balls. So
This distribution is called Hypergeometric Distribution, .
as .
2.1 Expectation
Define indicator RV
Then , is exchangeable. So , so
This is the same as expectation of .
2.2 Variance
By exchangeability,
Since
then and
Yellow part is the variance of ; green part .
3 Multinomial Distribution
Consider repeated trials. Each with types of outcome. (For Bernoulli trials, ; for rolling a dice, )
Denote as number of trials, as number of times type is observed.
Denote ( are not independent given ), then , with .
A possible outcome should satisfy . So where .
Poissonization of the Multinomial
Now suppose number of trials is a RV . We want to determine distribution of . So
So if ,