Probability by Conditioning
The theory in this section isn’t new. It’s the old familiar multiplication rule. We are just going to use it in the context of processes indexed by time, in a method that we are going to call conditioning on early moves.
Winning a Game of Dice
Suppose Jo and Bo play the following game. Jo rolls a die, then Bo rolls it, then Jo rolls again, and so on, until the first time one of them gets the face with six spots. That person is the winner.
Question. What is the chance that Jo wins?
Answer. Before you do any calculations, notice that the game isn’t symmetric in the two players. Jo has the advantage of going first, and could win on the first roll. So the probability that Jo wins should be greater than half.
To see exactly what it is, notice that there’s a natural recursion or “renewal” in the setup. For Jo to win, we can condition on the first two moves as follows:
- either Jo wins on Roll 1;
- or Jo gets a non-six on Roll 1, then Bo gets a non-six on Roll 2, and then the game starts over and Jo wins.
So at Time 0 (that is, before the dice are rolled), let x be the chance that Jo is the winner. Then x satisfies an equation:
x=16+(56)2xThis is easy to solve.
1136x = 636 and so x=611which is greater than half as we had guessed.
Gambler’s Ruin: Fair Coin
Let a and b be two positive integers. Suppose a gambler starts with a dollars and bets on the tosses of a coin. Every time the coin lands heads, the gambler wins a dollar. Every time it lands tails, the gambler loses a dollar.
Now suppose the gambler has a stopping rule: he will stop once his net gain is b dollars or he has no money left, whichever happens first. If the gambler ends up when has no money, he is ruined. Our goal in this example is to find the probability that the gambler is ruined.
At each toss we will keep track of the gambler’s net gain. So he will start out at 0 and stop when the he gets to b or −a, whichever happens first.
It’s a good idea to start visualizing the random trajectory of the gambler’s net gain as a path. Here are two graphs that assume a=3 and b=7. The first graph shows a path that leads to the gambler reaching a net gain of $b. The second shows a path to ruin.
Question. What is the probability that the gambler is ruined?
Answer. You can see from the paths above that at the first step the gambler’s net gain will be either -1 or 1, and then we will have to work out the probability of ruin from that point.
For any k, let pk be the chance that the gambler is ruined given that he starts with a net gain of k dollars.
The chance that we are looking for is p0.
By conditioning on the first move, we can see that pk satisfies an equation:
pk=12pk−1+12pk+1, −a+1≤k≤b−1with the “edge cases” defined as
p−a=1 and pb=0Write the left hand side of the equation as 12pk+12pk and rearrange it to see that
pk−pk−1=pk+1−pkThe successive differences are equal, which means that pk is a linear function of k.
Here is the line assuming a=3 and b=7 as before. The red lines show that p0=0.7.
For general a and b, the line starts at (−a,1) and has slope −1/(a+b). So the chance of ruin is
p0 = 1−aa+b = ba+bThe chance that the gambler ends up gaining b dollars is
1−p0=aa+bFor fixed a, this is a decreasing function of b. That makes sense. For fixed a, the larger b is, the harder it is for the gambler to end up making b dollars.
Gambler’s Ruin: Unfair Coin
If the gambler bets on tosses of a coin that lands heads with p≠1/2, then the equations become
pk=q⋅pk−1+p⋅pk+1, −a+1≤k≤b−1where q=1−p, and the edge cases are
p−a=1 and pb=0as before. Now the rearrangement is
q(pk−pk−1)=p(pk+1−pk)which means that the ratio of the successive differences is constant and equal to r=qp. So the probabilities pk are the sums of the terms in a geometric progression with common ratio r. You can check that this works out to
pk=ra+k−ra+b1−ra+b, −a≤k≤band therefore the chance of ruin is
p0 = ra−ra+b1−ra+bNote that if p<1/2 then r>1 and both the numerator and denominator are negative.
Here is a graph of the ruin probabilities, for a=3 and b=10 as before, but now with a coin that is biased towards heads with p=0.6. Not surprisingly, as the initial fortune increases the probability of ruin falls more sharply for this coin than for the fair coin. Even when the gambler starts with only $3, his chance of ruin is less than 30%.