CSC423/324 - Data Analysis
Quiz #5

There are two questions for a total of 45 points. Plan on spending about 30 minutes on all questions.

You may use your notes to answer the questions. Please submit written answers to me at the start of class on Thursday, 3/6.

  1. A software training consultant believes that a short training session can dramatically improve the efficiency of casual Microsoft Word users. She decides to conduct an experiment to investigate this issue and randomly selects two groups of casual Microsoft Word users from her organization. She decides to use one group as a Control group and the other group as a Treatment group. That is, the Control group will not receive training but each member in the Treatment group will receive a short training session on various Microsoft Word features. She then assigns a suite of tasks to each individual in each group and records the time required to complete the suite. (20pts)

    1. Given the problem statement above, identify and state the null and alternative hypotheses (i.e. the primary hypotheses).
    2. Examine this SAS output. Given the SAS output and your hypotheses, conduct a test of hypotheses. Remember to address the following issues:
      1. What can you say about normality for each population.
      2. Comment on the significance of the p-value for your primary hypotheses and comment on the consultants point of view.

  2. You are interested in comparing two different techniques (say method a and method b) that may be used to conduct software inspections. You would like to determine if one is better than the other. To evaluate the methodologies you decide to randomly select two groups of programmers from your software development team. You train each group on one of the techniques and then give them a few days to get comfortable with the technique. Each programmer is then given a suite of modules to inspect and the number of errors detected is recorded. (25pts)

    1. Given the problem statement above, identify and state the null and alternative hypotheses (i.e. the primary hypotheses).
    2. Examine this SAS output. Given the SAS output and your hypotheses, conduct a thorough analysis. That is, address the following issues:
      1. Normality: Do you need to establish normality for each population? If so, state the normality hypotheses and provide a p-value to address them.
      2. Equal Variance: Do you need to establish this requirement? Can you establish it by way of hypotheses? If so, state the hypotheses and provide a p-value to address them or state the mechanism you will use to establish this requirement and comment on the requirement.
      3. Comment on the significance of the p-value for the primary hypotheses.
    3. Given your findings above, comment on the inspection techniques. Can you say which is better? If so, identify the better technique and, if not, explain why not.