Hypothesis Testing ( my )

 

Hypothesis testing is sometimes referred to as a "Test of Significance". We will tend to use the former term but you may encounter the latter in readings or on a quiz or the final so it is important that you know that both terms refer to the same thing.

You should also bear in mind that hypothesis testing is based on sampling distribution theory. You should also note that hypothesis testing is similar to "proof by contradiction" in that you will assume your null hypothesis true and then see if your sample provides enough evidence to refute it.

 

Definition

Given any hypothesis testing problem you will always identify two hypotheses:

  1. Null hypothesis: denoted H0 and, for this class, will be of the following form:

    H0: my = m

  2. Alternative hypothesis: denoted Ha or Hr or H1 and sometimes referred to as the research hypothesis. For this class the alternative hypothesis will be one sided, that is, it will be of the following form:

    Ha: my > m or Ha: my < m

 

Hypothesis Testing Procedure (my )

To conduct a hypothesis test for my you will follow the following four step procedure:

  1. Examine the problem and identify and state the null and alternative hypotheses.

    e.g. Let us say that after examination of a problem statement you identify the null hypothesis to be my = m and you think that my > m, then you would state this as follows:

    H0: my = m

    Ha: my > m

  2. Examine your sample and do the following:
    1. Determine the sample size n and the statistics ybar and sy
    2. Ensure that ybar is consistent with your alternative hypothesis (i.e. if my > m then ensure that ybar > m and if my < m then ensure that ybar < m).

    Note: An inconsistent ybar means that your sample does not support your alternative and so you cannot proceed.

  3. If ybar is consistent then do the following:
    1. Assume H0 true.
    2. Given that H0 is assumed true, determine the p-value.

      Note: The p-value is the proportion of samples of size n that would result in a ybar more extreme than the one observed if H0 is true. To do this you must consider the sampling distribution of ybar. Remember that the sample size determines the sampling distribution:

      1. n large: Since we assume H0 true then mybar =my=m. However, sy is not known but we can estimate it with sy and so we estimate sybar by sybar=sy/sqrt(n). We can then compute z=(ybar - mybar)/sybar and determine the desired proportion (i.e. the p-value).
      2. n small: Again, since we assume H0 true then mybar =my=m. However, sy is not known but we can estimate it with sy and so we estimate sybar by sybar=sy/sqrt(n-1). If y is normally distributed then we can compute t=(ybar - mybar)/sybar and determine the desired proportion (i.e. the p-value).

      Note: The z and t values computed for hypothesis testing problems are sometimes referred to as test statistics.
      Note: Remember that when n is small ybar is only "t" distributed if y is normally distributed. This means that for n small you must ensure that ybar is consistent and y is normally distributed before determining your p-value.

  4. Apply the following decision rule to your p-value.
    1. If p-value is <= 1% then the p-value is highly significant and so you reject H0.
    2. If p-value is <= 5% then the p-value is significant and so you reject H0.
    3. If p-value is > 5% then the p-value is non-significant and so you have insufficient evidence to reject H0.

 

Problem:

The registrar claims that the mean starting salary of CTI graduates (class of 1999) is $45K. You disagree and believe that CTI graduates got better starting salaries and so the mean is higher than claimed. You select a sample of twenty six from the graduating class and determine that the mean starting salary is $47K with a standard deviation of $4K. Conduct a test of hypotheses.

Solution:

Applying the four step procedure:

  1. Identify and state the null and alternative hypotheses.

    H0: my = $45K

    Ha: my > $45K

  2. Examining the sample:
    1. Determine the sample size n and the statistics ybar and sy

      n=26; ybar=$47K and sy=$4K

    2. Ensure that ybar is consistent with alternative hypothesis:

      Since ybar>$45K then ybar is consistent and we may proceed.

  3. If ybar is consistent then:
    1. Assume H0 true.
    2. Given that H0 is true, determine the p-value.

      n small: mybar =my=45; sybar=sy/sqrt(n-1)=4/sqrt(25)=0.8; t=(47 - 45)/0.8=2.5 hence the t test statistic is 2.5 and from the t-table with df=25 the desired proportion (i.e. the p-value) is between 0.5% and 1%.

  4. Apply the decision rule to your p-value.

    Since the p-value is <= 1% then the p-value is highly significant and so we reject H0 and conclude that the mean is higher than claimed (i.e. my > $45K).