CSC423/324 - Data Analysis
Quiz #5
There are two questions for a total of 45 points. Plan on spending about
30 minutes on all questions.
You may use your notes to answer the questions. Please submit
written answers to me at the start of class on Thursday, 3/6.
- A software training consultant believes that a short training
session can dramatically improve the efficiency of casual
Microsoft Word users. She decides to conduct an
experiment to investigate this issue and randomly selects two
groups of casual Microsoft Word
users from her organization. She decides to use one
group as a Control group and
the other group as a Treatment group.
That is, the Control group will not
receive training but
each member in the Treatment group will receive a short training
session on various Microsoft Word features. She then assigns a
suite of tasks to each individual in each group and records the time
required to complete the suite. (20pts)
- Given the problem statement above, identify and state the null
and alternative hypotheses (i.e. the primary hypotheses).
- Examine
this
SAS output. Given the SAS output and your hypotheses,
conduct a test of hypotheses. Remember to address the following issues:
- What can you say about normality for each population.
- Comment on the significance of the p-value for your
primary hypotheses and comment on the consultants point of view.
- You are interested in comparing two different techniques
(say method a and method b) that
may be used to conduct software inspections.
You would like to determine if one is better than the other.
To evaluate the methodologies you
decide to randomly select
two groups of programmers from your software development team. You
train each group on one of
the techniques and then give them a few days to get comfortable with
the technique. Each programmer is then given a
suite of modules to inspect and the number of errors detected
is recorded. (25pts)
- Given the problem statement above, identify and state the null
and alternative hypotheses (i.e. the primary hypotheses).
- Examine
this
SAS output. Given the SAS output and your hypotheses,
conduct a thorough analysis. That is, address the following issues:
- Normality: Do you need to establish normality for each population?
If so, state the normality hypotheses and provide a p-value to address
them.
- Equal Variance: Do you need to establish this requirement? Can
you establish it by way of hypotheses? If so, state the hypotheses
and provide a p-value to address them or state the mechanism you will
use to establish this requirement and comment on the requirement.
- Comment on the significance of the p-value for the primary
hypotheses.
- Given your findings above, comment on the inspection techniques.
Can you say which is better? If so, identify the better technique and,
if not, explain why not.