Exam on WebCT.
Under construction
1Question 1¶
Let $X_1, \ldots, X_n$ be a random sample from the density function
$$f(x) = \begin{cases} \displaystyle \frac{4}{\theta} x^3 e^{-x^4/\theta} & \text{if } x > 0, \\ 0 & \text{if } x \leq 0,\end{cases}$$
where $\theta > 0$.
(a) Find a sufficient statistic for $\theta$.
(b) Find the maximum likelihood estimator $\hat \theta$ of $\theta$.
(c) Show that your MLE of part (b)1 is unbiased.
(d) Is your MLE an MVUE? Why?
1.1Solution¶
1.2Accuracy and discussion¶
2Question 2¶
(a) Given $\alpha$ with $0 < \alpha < 1$, let $F_{\alpha, n_1, n_2}$ be such that
$$Pr\{Y > F_{\alpha, n_1, n_2}\} = \alpha$$
where $Y$ has the $F$-distribution with $n_1, n_2$ degrees of freedom. Show that $\displaystyle F_{1-\alpha,n_1, n_2} = \frac{1}{F_{\alpha, n_1, n_2}}$.
(b) Given independent random samples of sizes $n_1$ and $n_2$ from the population $N(\mu_1, \sigma_1^2)$ and $N(\mu_2, \sigma_2^2)$ respectively, use the pivot
$$\frac{s_1^2 / \sigma_1^2}{s_2^2 / \sigma_2^2}$$
to construct a $(1-\alpha) \times 100\%$ confidence interval for $\sigma_2^2/\sigma_1^2$.
(c) Two soft drink bottling machines 1 and 2 are to be compared. Five ten ounce bottles are filled by each machine and the resulting contents in fluid ounces were 8.0, 8.3, 7.2, 7.7, 8.1 for machine 1 and 6.0, 9.3, 7.3, 6.7, 8.9 for machine 2. Assuming that the data come from normal populations $N(\mu_1, \sigma_1^2)$ and $N(\mu_2, \sigma_2^2)$, find a 90% confidence interval for $\sigma_2/\sigma_1$.
2.1Solution¶
Later
2.2Accuracy and discussion¶
Later
3Question 3¶
1200 U.S. stores are classified according to type and location, with the following results:
Observed cell frequencies
N | S | E | W | |
---|---|---|---|---|
Clothing stores | 219 (195) | 200 (195) | 181 | 180 |
Grocery stores | 39 (60) | 52 | 89 | 60 |
Other | 42 (45) | 48 | 30 | 60 |
Some expected cell frequencies were calculated and are given in parentheses. Using level 0.05 of significance, test the hypothesis that type and location are independent.
(b) A group of rats, one by one, proceed down a ramp to one of five doors, with the following results:
Door | 1 | 2 | 3 | 4 | 5 |
---|---|---|---|---|---|
Number of rats choosing this door | 36 | 23 | 30 | 31 | 30 |
Are the data sufficient to indicate that the rats show a preference for certain doors? That is, test the hypotheses
$$\begin{align*}H_0 & : p_1 = p_2 = p_3 = p_4 = p_5 = \frac{1}{5} \\ H_1 & : \text{not } H_0,\end{align*}$$
where $p_i$ = probability of choosing door $i$. Use $\alpha = 0.01$.
3.1Solution¶
3.2Accuracy and discussion¶
4Question 4¶
An experiment was conducted to determine the effect of three methods A, B and C of soil preparation on the first year growth of pine seedlings. Four locations 1, 2, 3, and 4 were selected, and each location was divided into three plots. A randomised block design was employed using locations as blocks. On each plot, the same number of seedlings was planted, and the average first-year growth in centimetres was recorded. These observations are given in the following table.
Method | Location 1 | Location 2 | Location 3 | Location 4 |
---|---|---|---|---|
A | 11 | 13 | 16 | 10 |
B | 15 | 17 | 20 | 12 |
C | 10 | 15 | 13 | 10 |
(a) Complete the following ANOVA table.
Source of variation | Degrees of freedom | Sum of squares | Mean square | F |
---|---|---|---|---|
Treatments | 38 | |||
Blocks | 61.67 | |||
Errors | ||||
Total | 111 |
(b) Do the data provide sufficient evidence to indicate differences in the mean growth for the three soil preparations? Use $\alpha = 0.05$.
(c) Is there evidence to indicate differences in the mean growth for the four locations? Use $\alpha = 0.05$.
4.1Solution¶
4.2Accuracy and discussion¶
5Question 5¶
We observe a binomial random variable $X$ with parameters $n$ and $\theta$, i.e.
$$P[X = k] = \left ( \frac{n}{k} \right ) \theta^k(1-\theta)^{n-k}, \quad k = 0, 1, \ldots, n.$$
(a) Use the Neyman-Pearson lemma to find a most powerful critical region of size $\alpha$ for testing
$$\begin{align*} H_o\,:\, & \theta = \theta_0 \\ H_1 \,:\, & \theta = \theta_1\end{align*}$$
where $\theta_1 > \theta_0$.
(b) Suppose that $n = 20$ an we want to test
$$\begin{align*} H_o\,:\, & \theta = 0.3 \\ H_1 \,:\, & \theta = 0.5\end{align*}$$
at level $\alpha = 0.05$. What is the critical region?
5.1Solution¶
5.2Accuracy and discussion¶
6Question 6¶
The median sale prices for new single-family houses is given in the following table for the eight years 1972 through 1979 (i.e. year 3=1974).
Year x | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
---|---|---|---|---|---|---|---|---|
Price $y$ (in thousands of dollars) | 27.6 | 32.5 | 35.9 | 39.3 | 44.2 | 48.8 | 55.7 | 62.9 |
The model $y = \beta_0 + \beta_1 x + \epsilon$ is to be fitted to this data.
(a) Find least squares estimates of $\beta_0$ and $\beta_1$. Hint: $\bar y = 43.3625$, $S_{xy} = 203.35$, $S_{yy} = 1460.25$.
(b) Is there sufficient evidence to indicate that the median sales price increased over the period from 1972 through 1979? Use $\alpha = 0.01$. Recall that
$$t = \frac{(\hat \beta_1 - \beta_1) \sqrt{\displaystyle S_{xx}}}{\hat \sigma}, \quad \hat \sigma^2 = \frac{SSE}{n-2}, \quad SSE = S_{yy} - \hat \beta_1 S_{xy}.$$
(c) Find a 99% confidence interval for $\beta_1$.
6.1Solution¶
Later
6.2Accuracy and discussion¶
Later
-
It says part (c) in the exam booklet but that's probably wrong. ↩