1. A bridge hand is a simple random sample of 13 cards from a standard deck of 52 cards. The face cards are the Jacks, Queens, and Kings; there are 12 face cards in all. Let be the number of face cards in a bridge hand.
(a) Fill in the blank with the name of a distribution along with the appropriate parameters: has the distribution.
(b) Find .
(c) Find .
2. Let have the distribution given in the table below.
| value | -2 | -1 | 0 | 1 |
|---|---|---|---|---|
| probability | 1/8 | 1/2 | 1/4 | 1/8 |
Find
(a)
(b)
(c)
(d)
(e)
3. A student is taking a True/False test in which there are 30 questions. For each correct answer, the student will get 2 points. For each wrong answer, the student will lose 1 point. If the student leaves a question unanswered, the student will get 0 points for that question.
The student answers all of the questions by rolling a die. If the die shows 1 or 2 spots, the student doesn’t answer the question. If it shows 3 or 4 spots, the student chooses False. If it shows 5 or 6 spots, the student chooses True. Find the student’s expected score on the test.
4. Let and let be the number of spots showing on a flattened die that shows its six faces according to the following chances:
(a) Find .
(b) Find . Explain algebraically and also by an intuitive argument why the answer is an increasing function of .
5. A fair -faced die is rolled times. Find the expected number of distinct faces seen.
6. Fix a positive integer . Suppose you decide to run i.i.d. Bernoulli trials until either a trial results in 1 or you have run trials, whichever happens first. Let be the number of trials you run.
(a) Find by using the tail sum formula.
(b) Find by using the non-linear function rule.
7. A group of 100 students contains 20 Data Science majors. I pick students from the group one by one at random, until I pick a Data Science major. Let be the number of students I pick.
In each part below, provide a numerical answer as an integer or as a ratio of integers , without using a calculator or computer.
(a) Find if the students are picked with replacement.
(b) Find if the students are picked without replacement.
8. A robot types on a 26-letter keyboard that has lowercase letters only. Each letter is chosen independently and uniformly at random from the alphabet. If the robot types letters (), what is the expected number of times the sequence probability appears?
9. A box contains red balls, blue balls, and green balls. Balls are drawn one by one without replacement until all the red balls are drawn. Let be the number of draws made. Find . Then evaluate it (no calculators or computers) when .
10. Let and be positive integers, and suppose n draws made at random with replacement from . Find the expectation of the minimum of the values drawn.
11. Suppose there are distinguishable pairs of socks, but a prankster chooses of these socks at random and pokes holes in them. Let be the number of undamaged pairs of socks. Find .
12. The distribution of a random variable involves an unknown parameter .
(a) Find .
(b) Let be i.i.d. with the same distribution as , and let be the sample average. Use to construct an unbiased estimator of .
13. Use probability theory to explain why the math identity below is true for all .
14. Survey respondents understandably don’t like to answer questions about sensitive topics such as illegal drug use. If data scientists want to estimate the proportion of illegal drug users in a population, they have to devise methods of getting the information they need while maintaining the privacy of the individual respondents.
Randomized response schemes are often used in such situations. In one such scheme, each surveyed person is given a coin and asked to answer YES or NO after following these instructions out of sight of the surveyor:
Toss the coin.
If it lands heads, then truthfully answer, “Do you use illegal drugs?”
If it lands tails, then toss it again and answer, “Did the second toss land heads?”
This way each respondent answers YES or NO but the surveyor doesn’t know which question was answered. The data scientists then have to estimate the proportion of illegal drug users based on the overall proportion of YES answers, which includes the YES answers to the second question.
Let the unknown proportion of illegal drug users in a large population be , and suppose a random sample of size is surveyed using the scheme above. You can assume that the sampling is equivalent to drawing at random with replacement.
(a) Let be the proportion of sampled people who answer YES. Find .
(b) Use to construct an unbiased estimate of .