To Lecture Notes

IT 223 -- 2/16/11

 

Review Questions

  1. What is the regression fallacy (also called regression to the mean)?

    Ans: Regression to the mean says that, given a pre-test/post-test situation, someone that does above average on the pre-test will do worse on the post-test; someone that does below average on the pre-test will do worse on the post-test.

  2. What is the RMSE and how do you compute it?

    Ans: The root mean squared error (RMSE) is the standard deviation of the residuals. It can be computed as RMSE = SDy √(1 - r2).

  3. The heights and weights of a population of female models are recorded and descriptive statistics are computed:

    The heights and weights are bivariate normal.

    1. About what percentage of the models have weights over 58kg?

      Ans: Compute z = (y - y) / SDy = (58 - 50) / 5 = 1.6. Look up -1.6 in the standard normal table: 0.0548 = 5.5%.

    2. Find the regression line for predicting weight from height.

      Ans: y = 80x - 86.

    3. Find the RMSE.

      Ans: RMSE = SDy √(1 - r2)i 5 √(1 - 0.82) = 5 * 0.6 = 3.

    4. Of those models that have a height close to 1.8m, what percentage of them have weights over 58kg.

      Ans: z = (y - y^) / RMSE = (58 - 58) / 5 = 0. 50% of the models have a weight over z = 0.

  4. Define these terms:

    1. Sample Space   Ans: The set of possible outcomes of a probability experiment.

    2. Probability Function   Ans: A function that assigns a number between 0 and 1 to each outcome in the sample space. The sum of the probabilities for the entire sample space must be 1.

    3. Random Variable   Ans: The process of choosing a random number. The more technical definition of a random variable is a probability function where all of the outcomes are real numbers.

  5. What is wrong with this argument? Either the Cubs will win the World series or they won't, so the probability that they will win the world series is 50%.

    Ans: Every event has a probability assigned to it. Likely events have high probabilities; unlikely events have low probabilities. Just because you don't know the probability of an event does not mean that it is 50%.

  6. Find as many errors as you can with this probability distribution?

    Outcome Probability
    3 60%
    5 30%
    6 0%
    7 -10%
    9 110%

    Ans: Probabilities cannot be negative; probabilties cannot be more than 100%; the probabilities in the table must sum to 100%.

  7. Is a "double down after a loss" a good idea in gambling? This means that every time you loose, you double your bet; when you win, you recover your losses. Then you start over with your normal bet after you win.

    Ans: This looks good, except for thing. If you lose too many times in a row, you will either run out of money or will not be able to double because you have reached the betting limit of the casino. This can easily wipe out all of your winnings in one play.

  8. What is a Bernoulli random variable?

    Ans: A random variable whose only possible outcomes are 0 (with probability 1-p) and 1 (probability p). This can be expressed in a table like this:

    x P(x)
    0 1-p
    1 p

  9. What are three ways to obtain probabilities?

    Ans: Theoretical or a priori, empirical, subjective.

  10. A bookie offers 15 to 1 odds for the Cubs to win the World Series in 2009. If this is a fair bet, what is the probability that the Cubs will win the World Series?

    Ans: 15p + (-1)(1 - p) = 0; 15p - 1 + p = 0; 16p = 1; p = 1/16 = 0.0625 = 6.25%.

 

Expected Value

 

The Multiplication Rule for Independent Events