Programming Assignment 3
CSC 323 Data Analysis and Statistical Software
Due: 3/12/98
Use simple linear regression methods to conduct a thorough
analysis of the following dataset.
- Benchmark Dataset:
Each observation consists of the following variables:
- CPU Type
- Architecture
- Benchmark Score - # of "Tower of Hanoi" moves in 25 microseconds
- Clock Rating - MHz
Consider Benchmark Score to be the dependent variable and Clock Rating the independent
variable.
Note: Follow this
link for other datasets. You may analyze any of these datasets. You
may also analyze any other dataset that is of interest to you but you must see me first.
Your analysis should address the following points. See the
sample analysis
for an example.
- Identify the regression model including symbols for all of its parameters.
- Conduct a residual analysis. You must state the hypotheses and p-value for the normality
test.
- Give the estimates of the model parameters, state the regression equation and interpret the
coefficients of the equation.
- Interpret the r-square value from the 'Analysis of Variance' section of your output.
- Determine the correlation between your dependent and independent variables.
- Summarize the results of your analysis.
Your program should accomplish the following:
- Generate estimates of your slope and intercept using the REG procedure.
- Produce a scatterplot of the dependent variable vs. the independent variable.
- Produce a residual plot.
- Execute PROC UNIVARIATE with the NORMAL and PLOT options for the residuals.
- Produce a normal plot of the residuals using the RANK and PLOT procedures.