Bernoulli trials come out in one of two ways. But many trials come out in multiple different ways, all of which we might want to track. A die can land six different ways. A jury member can have one of several different educational levels. In general, an individual might belong to one of several classes.
The multinomial distribution is a joint distribution that extends the binomial to the case where each repeated trial has more than two possible outcomes. Let’s look at it first in an example, and then we will define it in general.
A box contains 2 blue tickets, 5 green tickets, and 3 red tickets. Fifteen draws are made at random with replacement. To find the chance that there are 4 blue, 9 green, and 2 red tickets drawn, we could start by writing all possible sequences of 4 B’s, 9 G’s, and 2 R’s.
Each such sequence has chance , so all we need for completing the probability calculation is the number of sequences we could write.
There are ways of choosing places to write the B’s.
For each of these ways, there are ways of choosing 9 of the remaining 11 places to write the G’s.
The remaining 2 places get filled with R’s.
So
Notice how this simply extends the binomial probability formula by including a third category.
Analogously, or formally by induction, you can extend the formula to any finite number of categories or classes.
6.3.1Multinomial Distribution¶
Fix a positive integer . Suppose we are running i.i.d. trials where each trial can result in one of classes. For each , let the chance of getting Class on a single trial be , so that .
For each , let be the number of trials that result in Class , so that .
Then the joint distribution of is given by
where for and .
This is called the multinomial distribution with parameters and .
When there are just two classes, then and the formula reduces to the familiar binomial formula written as the joint distribution of the number of successes and the number of failures:
Answer
6.3.2Binomial Marginals¶
No matter how many classes there are, the marginal distribution of each is binomial .
You don’t have to sum terms in the joint distribution to work this out.
is the number of Class individuals in the sample
Each sampled individual is in Class with probability
There are independent draws.
That’s the binomial setting.
6.3.3A Roulette Example¶
A Nevada roulette wheel has 18 red pockets, 18 black pockets, and 2 green pockets. Each time the wheel is spun, all 38 pockets are equally likely to win, independent of all other spins.
Question: If the wheel is spun 10 times, what is the chance that red and black win an equal number of times?
Answer: Let be the number of times red wins, the number of times black wins, and the number of times green wins.