Programming Assignment 2
CSC 323 Data Analysis and Statistical Software
Due: 2/25/99
Use simple linear regression methods to conduct a thorough
analysis of the following dataset.
Benchmark Dataset:
Each observation consists of the following variables:
- CPU Type
- Architecture
- Benchmark Score -
time in microseconds to make
25 "Tower of Hanoi" moves
- Clock Rating - MHz
Consider Benchmark Score to be the dependent variable and Clock Rating the independent
variable.
- Your program should accomplish the following:
- Execute the PRINT procedure.
- Produce a scatterplot of the dependent variable vs. the independent variable.
- Generate estimates of your slope and intercept using the REG procedure.
- Generate residuals using the REG procedure.
- Produce a residual plot.
- Execute PROC UNIVARIATE for the appropriate variable and with the appropriate options.
Note: For PROC PRINT, be sure to use labels for column headings rather than variable names. Use names for datasets and variables that are meaningful. You should generate an appropriate title for the output of these procedures.
- Write a short analysis (no more than 2 pages) of the output of
your SAS program. Your analysis should (at least) address the following:
- State the regression model including symbols for all of its parameters.
- Give the estimates of the model parameters.
- State the regression equation.
- Interpret the
coefficients of the regression equation.
- Interpret the r-square value from the 'Analysis of Variance' section of your output.
- Determine the correlation between your dependent and independent variables.
- Comment on the model assumptions.
Note: A
sample analysis
is available.