To Lecture Notes

IT 223 -- 2/28/11

 

Review Questions

  1. Suppose that S = x1 + ... + xn, where the x1 are independent, all with expected value E(x) and standard deviation σx.
     
    1. What is E(S)?

      Ans: np.

    2. What is σS?
    3. Ans: sqrt(np(1 - p))

  2. You flip a coin ten times and obtain 3 heads. How many ways are there of obtaining 3 heads out of 10?

    Ans: 10C3 = 10! / (3! (10-3)! = (10*9*8) / (3*2*1) = 5*3*8 = 120.

  3. What is Pascal's Triangle? How can you use it to solve Problem 2?

    Ans: See the 2/24 Notes.

  4. A major league batter has a batting average of 0.360.

    1. What are his chances of getting exactly 6 out of 12 hits in a game?

    2. What are his chances of getting 8 or more hits out of 12 in a game?

  5. What does the Law of Large (LLN) numbers say?

    Ans: That S / n will get closer and closer to E(S)/n with high probability as n gets large.

  6. Who first proved the LLN?

    Ans: Bernoulli in Switzerlans.

  7. What does it mean to say that LLN works by swamping?

    Ans: That even if S/n is not very close to E(S)/n, if enough extra outcomes are obtained, they will swamp the outcomes previously obtained, so S/n will then be close to E(S)/n, with high probability.

  8. What does the Central Limit Theorem (CLT) say?

    Ans: That even if x is not normally distributed, S will be approximately normally distributed if n is large.

  9. Who first stated the CLT?

    Ans: DeMoivre from France.

  10. Who first proved the CLT in its modern form?

    Ans: Lyapunov, from Russia.

  11. Fifty four percent of 1000 voters surveyed in an exit survey say that they voted for Candidate A. Assuming that the voters that participated in the survey are telling the truth, and assuming that the sample is representative of all voters, give a 95% confidence interval for the true probability that a randomly chosen voter will vote for Candidate A.

    By the CLT, the sum S of independent random variables is approximately normally distributed, even if the original random variables are not. Therefore, if z = (S - E(S)) / σS is standard normal and a 95% confidence interval for z is [-2, 2]. (A more accurate confidence interval is [-1.96, 1.96].)

    Ans: n = 1000, S = 540, p^ = S / n = SE(S) = sqrt(1000 * p^ * (1-p^)) = sqrt(1000 * 0.54 * (1-0.54)) = 15.76. Then the confidence interval for the true value of p is obtained by solving for p in the following:

    The confidence interval is (0.50848, 0.57152) or (50.8%, 57.2%).

 

Tests of Hypotheses

 

Project 4

 

The Central Limit Theorem

 

Practice Problems

  1. Use the CLT to estimate the following probabilities for the number of heads obtained from a fair coin:

    1. Obtaining 13 to 16 heads out of 25 tosses. (Because the normal table is continuous, use 12.5 to 16.5 tosses.)

      Ans: SES = √np(1-p) = √25(0.5)(1-0.5) = 2.5. Then z1 = (S - np) / SES = (12.5 - 25(0.5)) / 2.5 = 0, z2 = (S - np) / SES = (16.5 - 25(0.5)) / 2.5 = 1.6; the area under the normal curve for the bin [0,1.6] is 0.9452 - 0.5000 - 0.4452 = 44.5%.

    2. Obtaining between 60 to 75 heads out of 100 tosses. (Use 59.5 to 75.5 tosses.)

      Ans: SES = √np(1-p) = √100(0.5)(1-0.5) = 5. Then z1 = (S - np) / SES = (59.5 - 100(0.5)) / 5 = 1.9, 1.9, 1.9, 1.9, 1.9, 1.9, 1.9, 1.9, 1.9, z2 = (S - np) / SES = (75.5 - 100(0.5)) / 5 = 5.1; the area under the normal curve for the bin [1.9,5.1] is 1 - 0.9713 = 0.0287

    3. Obtaining exactly 30 heads out of 60 tosses. (Use 29.5 to 30.5 tosses.)

      Ans: SES = √np(1-p) = √60(0.5)(1-0.5) = 3.87. Then z1 = (S - np) / SES = (29.5 - 60(0.5)) / 3.87 = -0.129, z2 = (S - np) / SES = (30.5 - 60(0.5)) / 3.87 = 0.129; the area under the normal curve for the bin [-0.129, 0.129] is 0.5517 - 0.4880 = 0.0637 = 6%

  • To test if a four-faced die (shape of a tetrahedron) is fair, such a die is rolled 1600 times. 421 aces are obtained. Perform a test of hypothesis to test whether the die is fair.

    Here are the steps to perform the test of hypothesis:

    1. Write down the null hypothesis:    H0: p = 1/4

    2. Compute the text statistic:    z = (S - E(S)) / SES

    3. Write down a 95% confidence interval:    I = (-1.96, 1.96)

    4. If z ∈ I, accept the null hypothesis that the die is fair. If z ∉ I reject the null hypothesis.

    Ans:
    1. H0: p = 1/4.
    2. S = 421   n = 1,600   SE(S) = sqrt(1,600 * (1/4) * (1-1/4)) = 17.32
      z = (S - np) / SE(S) = (421 - 1,600 (1/4)) / 17.32 = 1.212
    3. The 95% confidence interval using the standard normal table is (-1.96, 1.96)
    4. 1.212 ∈ (-1.96, 1.96), so accept the null hypothesis that the die is fair; we don't have enough evidence to reject the null hypothesis.

    5. Use SPSS to simulate the following situations. Use the random number generator Rv.Binomial(n, p).

      1. 500 outcomes of a Bernouilli random variable with p = 0.5.

      A binomial random variable outputs the sum of n outcomes of a Bernouilli random variable with probability p.

    6. For each of the cases in Problem 9, test whether the random number generator is fair for the given probability.

     

    Misuses of Statistics