Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

1. Let XX have the distribution given by

                         x~~~~~~~~~~~~~~~~~~~~~~~~~x123
P(X=x)P(X=x)0.20.50.3

Find E(X)E(X) and SD(X)SD(X).

2. Suppose P(X=x0)=1pP(X = x_0) = 1-p and P(X=x1)=pP(X = x_1) = p for values x0x_0 and x1x_1 such that x1>x0x_1 > x_0.

Write XX as a linear function of an indicator that has value 1 with probability pp, and hence find SD(X)SD(X) in terms of pp, q=1pq=1-p, and d=x1x0d=x_1-x_0.

3. Let XX have the Poisson (μ)(\mu) distribution and let YY have the Poisson (λ)(\lambda) distribution independent of XX.

(a) What is the distribution of X+YX+Y?

(b) Which of the following statements is (or are) true? Pick all that are true and justify your choices.

(i) E(X+Y)=E(X)+E(Y)E(X+Y) = E(X) + E(Y)

(ii) SD(X+Y)=SD(X)+SD(Y)SD(X+Y) = SD(X) + SD(Y)

(iii) Var(X+Y)=Var(X)+Var(Y)Var(X+Y) = Var(X) + Var(Y)

4. Let p(0,1)p∈(0,1) and let XX be the number of spots showing on a flattened die that shows its six faces according to the following chances:

P(X=1)=P(X=6)P(X=1)=P(X=6)

P(X=2)=P(X=3)=P(X=4)=P(X=5)P(X=2)=P(X=3)=P(X=4)=P(X=5)

P(X=1P(X=1 or 6)=p6)=p

Find SD(X)SD(X) and explain why it is an increasing function of pp. Compare your answer with the answer you got for the mean absolute deviation in Chapter 8.

5. Consider a sequence of i.i.d. Bernoulli (p)(p) trials. Let TT be the number of trials till the first success and let FF be the number of failures before the first success. You know that TT has the geometric (p)(p) distribution on {1,2,3,}\{1, 2, 3, \ldots\} and that E(T)=1pE(T) = \frac{1}{p}. We will show later, by conditioning, that SD(T)=qpSD(T) = \frac{\sqrt{q}}{p} where q=1pq = 1-p. For now you can just assume that it is true.

(a) Write FF as a function of TT and hence find E(F)E(F) and SD(F)SD(F).

(b) Find the distribution of FF. It is called the geometric (p)(p) distribution on {0,1,2,}\{0, 1, 2, \ldots \}.

6. A random variable XX has expectation 20 and SD 2. Find the best upper and lower bounds you can on

(a) P(15<X<25)P(15 < X < 25)

(b) P(15<X<30)P(15 < X < 30)

7. Consider a probabilistic model that has a numerical parameter θ\theta. A “probabilistic model” is just a set of assumptions about randomness. Let the random variable TT be an estimator of θ\theta. Frequently, TT is a statistic based on a random sample.

Recall that the bias of TT is defined as Bθ(T) = Eθ(T)θB_\theta (T) ~ = ~ E_\theta (T) - \theta, where the subscript θ\theta reminds us that θ\theta is the true value of the parameter.

The mean squared error of the estimator TT is MSEθ(T) = Eθ((Tθ)2)MSE_\theta (T) ~ = ~ E_\theta \big( (T - \theta)^2 \big).

Follow the calculation in Section 12.2 of the textbook to show the bias-variance decomposition given by

MSEθ(T) = Varθ(T)+Bθ2(T)MSE_\theta (T) ~ = ~ Var_\theta(T) + B_\theta^2(T)

Note that the square in the bias term makes sense. Bias has the same units as TT, whereas the MSE and variance are in the square of those units.