Assignment 1: SPSS Discriminant Analysis

While regression analysis evaluates the ability of multiple predictor variables to predict values on a single continuous variable, discriminant analysis evaluates the ability of multiple predictor variables to predict classification on a single categorical variable. Discriminant analysis can also be viewed as the reverse of a MANOVA: In MANOVA, the IVs are the groups and the DVs are the predictors. In DA, the IVs are the predictors and the DVs are the groups. (In order to avoid semantic confusion, it’s easier to refer to IVs as the predictors—or discriminating variables—and to DVs as the grouping variables.)

The emphases of MANOVA and DA are different. While MANOVA seeks to find a linear combination of variables that will maximize the test statistic, DA is used to establish the linear combination of dependent variables that maximally discriminates among groups. DA is used to predict membership in naturally occurring groups and to determine if a combination of variables can reliably predict group membership. Several variables are included in a study to see which ones best contribute to the discrimination between groups.

As with factor analysis, discriminant functions are identified through the analysis, but it remains for the researcher to provide a meaningful interpretation and labeling of these.

Open the Statistical Package for the Social Sciences (SPSS) data file created in

M6: Assignment 1

. Use the following variables:

Create a Grouping Criterion Variable: Using the continuous data for Number of Previous Hospitalizations, transform this into categorical data by assigning patients to different categorical groups. You will add a new variable to your data file, which indicates which group each case falls into. For example, use the descriptive statistics and frequency information for the number of hospitalizations to decide your cut-offs for scores to define each group and then to assign patients to a group such as Group 1 (lower number of hospitalizations), Group 2 (medium number of hospitalizations), and Group 3 (higher number of hospitalizations) or you may consider quartiles, a median split, and dividing by standard deviation units. Justify your method.

Select five (or more, if justified) continuous variables to use as predictor variables for your analysis. Briefly justify your choices.

Conduct a discriminant analysis of these data. Use the same methods and choices found in the textbook’s sample study. Include tests for homogeneity of group variances.

Save the SPSS file as R7034_M7_A1_LastName_FirstInitial.sav.

Prepare a two- to three-page (plus Appendix for tables) response, which presents a summary report of the following information:

• State a research question that could be studied using the specified variables for a discriminant analysis.
• Report the results of prescreens for the missing data, multivariate outliers (Mahalanobis distance), univariate normality, and linearity (bivariate scatter plots). Indicate if any transformations or other decisions are required.
• In your Appendix, report group descriptive statistics, analysis of variance (ANOVA) summary tables, summary of steps, eigenvalues, Wilks’ lambda table, standardized discriminant function coefficients, cannonical correlation or structure matrix, classification of results, and discriminant function means.
• Summarize the results of the discriminant analysis, including an interpretation of discriminant functions. Compare the outcomes in terms of the research question.

