A review of the statistical principles of data analysis using computerized statistical analysis procedures provided by the Statistical Statistical Analysis System (SAS). Statistical methods reviewed and applied include graphical displays (density estimation), univariate analyses, multiple regression, collinearity diagnostics, influence diagnostics, datadependent model biases, analysis of contingency tables and categorical data, logistic regression for qualitative responses, analysis of variance and covariance, and the general linear model. Each week a statistical method is reviewed and sample analyses presented in SAS listings. Each week a data analysis project is assigned requesting that specific statistical analyses be performed and that the results be presented and interpreted in a typed statistical report. Each student is also required to complete an independent data analysis project. Prerequisites: 1) Stat 118, and 2) either Stat 157 or 201, and 3) Stat 183 or equivalent or proficiency with SAS.
Below are two sas programs and corresponding sets of log and listing outputs, one for "st210a_d.sas" the other for "st210a_u.sas". The latter uses the qqplot.sas macro that is included in the section below.
Below is the sas file, saslog and listing files for the lecture 4.

Below are the sas files, saslog and listing files for lecture 5. There are two sets of files.
The current versions of these files were uploaded on 2/23/04 the day of the lecture. They include a new macro "resplot" that generates the variance plots within deciles of the predicted values.


A_R5 is a continuation of the A_R4 listing that was placed on the website for lecture 4.A_M is a new listing for a multiple regression analysis. this will be used in lecture 5 and lecture 6
The following is an unpublished manuscript on identifying synnergism and antagonism in regression models. Please bring the tables to class since I will use them at the beginning of lecture 6. You can read the paper later if you wish.A_c is a new listing for a use of dummy variables for a categorical variable in a regression model
The following listings for A_I.sas will be used in lecture 7 to describe collinearity and influence diagnostics for a regression model.
The following listings for stat21B.sas will be used in lecture 8 to describe stepwise regression and cross validation.The following listings for sH_a.sas will be used to provide an introduction to logistic regression models.
The following listings describe value added plots in logistic regression, stepwise models and cross validation.The following listings describe binomial regression and interaction models.
The following listings describe analysis of variance.
The following listings describe value unbalanced analysis of means (unbalanced ANOVA)Here is exercise 9 and the sas file. You will have to change the data set path before running the sas program.
The following listings describe analysis of covariance.
The following listings describe analyses of repeated measures.
