Programming Assignment 1
CSC 323 - Data Analysis and Statistical
Software
Due: 2/7/2002
The new director of software development at a large local company has decided to investigate software quality. The director is particularly interested in the C++ portfolio. She decides to assess quality by analyzing several quality characteristics for each program in a randomly selected sample of programs taken from the portfolio. The CIO is prepared to support and fund a quality assurance program if software quality is unacceptably low.
The director has recruited you to help with data analysis. You have been asked to analyze failure rate. The director explains that failure rate is the number of failures per unit of time. Failure rate is a well known quality measure that is appropriate for this software portfolio.
The following details are available
for each program in the sample:
- Program Name; 1-6
- Execution Time (Hours); 7-10
- Number of Failures; 11-13
You have been asked to help with the analysis of
this data.
The requirements for your analysis
are detailed below:
- Write a SAS program to analyze these data. In
particular your program should analyze failure rate. (50%)
Your program should accomplish the following:
- Access your data from an external
file.
- Compute failure rate
(if necessary, see "DATA step statements",
point 6, SAS Review).
- Execute the PRINT and MEANS
procedures with appropriate options.
- For PROC PRINT, be sure to use
labels for column headings (e.g. Execution Time, Number of
Failures,
Program Name, Failure Rate).
Use names that are
meaningful. You should generate an appropriate
title for your output.
Note: If necessary,
see the SAS program used for the in-class SAS
demonstration.
Write a short report (no more than a couple of
paragraphs) discussing your findings. (50%)
Your report should
address the
following:
- Provide an estimate of
the population mean
(3 places of decimal).
That is, the mean
failure rate for
the portfolio.
- Provide an estimate of
the population standard deviation
(3 places of decimal).
That is the standard
deviation of failure rate for the portfolio.
- The CIO has asked you to address the following
in your report:
- Program pjprsuit is considered to be problematic by several staff
members. That is, they contend that it fails more frequently than the
average program in the portfolio.
Program pjprsuit
has failed 279 times in 9000 hours of execution.
Determine
its percentile rank with respect to failure rate and comment on this
viewpoint.
- Determine the failure rate
that distinguishes the least reliable
programs from others in the portfolio.
Assume that least reliable refers to
the top 30% of the portfolio with
respect to
failure rate.
Note:
To
address these questions,
assume that failure rate is
normally distributed. Also,
use your estimates of the relevant population parameters.