We know what it means for two numbers to be equal: they are at the same spot on the number line. Equality of random variables, however, can be of more than one kind.
3.3.1Equal¶
Two random variables and defined on the same outcome space are equal if their values are the same for every outcome in the space. The notation is and it means that
This is the usual definition of the equality of two mathematical functions. Informally, it says that when has the value 10 then must be 10 too; when is 11, must be 11; and so on.
An example will make this clear. Let be the number of heads in three tosses of a coin, and let be the number of tails in the same three tosses.
Now consider the new random variable . The two random variables and are equal. For every possible outcome of the three tosses, the value of is equal to the value of .
We write this simply as . Equivalently, .
🎥 See More
3.3.2Equal in Distribution¶
and , as defined above, are not equal. For example,
However, there is a sense in which the number of heads “behaves in the same way” as the number of tails. The two random variables have the same probability distribution.
The outcome space is three_tosses:
coin = make_array('H', 'T')
three_tosses = list(product(coin, repeat=3))
three_tosses[('H', 'H', 'H'),
('H', 'H', 'T'),
('H', 'T', 'H'),
('H', 'T', 'T'),
('T', 'H', 'H'),
('T', 'H', 'T'),
('T', 'T', 'H'),
('T', 'T', 'T')]There are only eight outcomes, so it is easy to inspect the table and write the distributions of and . Both take the values with probabilities respectively. This distribution is shown in the table below.
dist = Table().values(np.arange(4)).probabilities(make_array(1, 3, 3, 1)/8)
distWe say that and are equal in distribution.
In general, two random variables and are equal in distribution if they have the same probability distribution.
That is, they have the same set of possible values and the same probabilities for all those values.
Equality in distribution is denoted as
3.3.3Relation between the Equalities¶
Equality is stronger than equality in distribution. If two random variables are the same, outcome by outcome, then they must have the same distribution because they are the same function on the outcome space.
That is, for any two random variables and ,
But as the example of heads and tails in three tosses shows, the converse need not be true.
Answer
Fill (i), (ii), (iii), and (v) with and (iv) with .
3.3.4Example: Two Cards Dealt from a Small Deck¶
A deck contains 10 cards, labeled 1, 2, 2, 3, 3, 3, 4, 4, 4, 4. Two cards are dealt at random without replacement. Let be the label on the first card and be the label on the second card.
Question 1: Are and equal?
Answer 1: No, because for example the outcome could be in which case and .
Question 2: Are and equal in distribution?
Answer 2: Let’s find the two distributions and compare. Clearly the possible values are 1, 2, 3, and 4 in each case. The distribution of is easy:
When a distribution is defined by a formula like this, you can define a function that does what the formula says:
def prob1(i):
return i/10You can create a probability distribution object for using values as before but now with the probability_function method.
The argument to probability_function is the name of the function that takes as its argument and returns .
possible_i = np.arange(1, 5, 1)
dist_X1 = Table().values(possible_i).probability_function(prob1)
dist_X1Convince yourself that the function prob2 below returns for each . The event has been partitioned according to the value of .
def prob2(i):
if i == 1:
return (9/10)*(1/9)
else:
return (i/10)*((i-1)/9) + ((10-i)/10)*(i/9)dist_X2 = Table().values(possible_i).probability_function(prob2)
dist_X2The two distributions are the same! Here is yet another example of symmetry in sampling without replacement. The conclusion is
Answer
Both are true. For (ii), calculate as an example to see what’s going on.