For each question, show your work or give a reason explaining your answer. 4 points for the reason, 1 point for the correct answer.
Ans: b. 2. The sample mean and SD completely describe a univariate normal dataset.
Ans: d. occupation. Nominal variables are categorical or nonnumeric variables.
Ans: Q1 is the median of the bottom half of the dataset (bottom six numbers) is the mean of 0.028 and 0.030, which is 0.029. Q3 is the median of the top half (top six numbers), is the mean of 0.039 and 0.049, which if 0.044. IQR = 0.044 - 0.039 = 0.015.
Ans: a. This box plot has a long whisker to the left. b is symmetric, and c and d are skewed to the right.
Ans: a. z = (340 - 336) / 3 = 1.33. Looking up the bin (-∞ 1.33] in the normal table gives an area of 0.9082. The area under the bin [1.33, ∞) is 1 - 0.9082 = 0.0918 = 9.2%.
Ans: c. Look up 0.8000 in the body of the normal table. The closest z-score is 0.84. Since IQ scores are scaled to have a mean of 100 and SD = 15, we have 0.84 = (x - 100) / 15. Solving for x gives 112.6.
Ans: a. Extrapolation (going past the range of the data for predictions) is never a good idea because the response curve might be nonlinear in an unexpected way.
Ans: d. The r-squared value is the percentage of variation in y that can be explained by x. R2 = 0.852 = 0.7225.
Ans: Compute the average of the products:
Show all of your work. You may use a calculator.
Bin | Percentage of Observations |
---|---|
[1,3) | 30% |
[3,4) | 40% |
[4,5) | 10% |
[5,7) | 10% |
[7,11] | 10% |
Ans: Recall that the area, not the height, of each bar is proportional to the number of observations in that bin. The heights are 30/(3-1) = 15, 40/(4-3) = 40, 10/(5-4) = 10, 10/(7-5) = 5, 10/(11-7) = 2.5.
Ans: The cumulative frequencies are 30, 70 80, 90, 100.
Q1: 25% is 5/6 of the
way from 0 to 30, so Q1 must be 5/6 of the way from 1 to 3:
1 + (5/6)(3-1) = 2.667.
Q2: 50% is half of the way from 30 to 70, so Q2 is half of the way from
3 to 4: 3.5.
Q3: 75% is half of the way from 70 to 80, so Q3 is half of the way from
4 to 5: 4.5.
IQR = Q3 - Q1 = 4.5 - 2.667 = 1.833.
Ans: (2 × 30 + 3.5 × 40 + 4.5 × 10 + 6 × 10 + 9 × 10) / (30 + 40 + 10 + 10 + 10) = 3.95.
Ans: 4.5 to 5 is half of the bin [4,5) and 5 to 7 is the compute bin, so the percentage is (1/2)10 + 10 = 15%.
Ans: Pick the numbers 4, 4, 4, 4, x. No matter what x is, the median is 4. Since the mean is 7, solve for x in (4 + 4 + 4 + 4 + x) / 5 = 7 to get x = 19.
y = 65 SDy = 20
r = 0.6
Ans: z = (80 - 65) / 20 = 0.75. The normal table gives 0.7704 for the area over the bin [0.75, ∞), so the answer is 1 - 0.7704 = 0.2296 or 23%.
Ans: The regression line is
y - 65 = (0.6)(20/25)(x - 60)
y = 0.48(x - 60)
y = 0.48x + 36.2
Ans: Plug x = 80 into the regression equation: 74.6
Ans: Plug x = 50 into the regression equation: 60.2
Perform the following analyses with SPSS. Save your output file as a Word .doc file. Type any interpretation of the output into the output file itself. Questions marked with an asterisk (*) require typed output in addition to the SPSS output.
Ans: Q0 = 576.8, Q1 = 577.64, Q2 = 578.81, Q3 = 579.76, Q4 = 581.56, mean = 578.86, SD = 1.31, SE(ave) = 0.252.
Ans: [578.34, 579.37]
Ans: No outliers on the boxplot.
Ans:
Ans: The plot is relatively homoscedastic, but biased. There is a trend from higher to lower water levels.
Ans: r = -0.819, a negative relationship
Ans:y = -0.007 x + 41.607
Ans: The residual plot shows that the residuals are relatively unbiased and homoscedastic.
Ans: Except for one point on the upper right, the residuals are fairly normal.