Professional Documents
Culture Documents
Crosstabulation A Tutorial by Russell K. Schutt To Accompany
Crosstabulation A Tutorial by Russell K. Schutt To Accompany
CROSSTABULATION BASICS
ANALYZE-DESCRIPTIVE STATISTICS-CROSSTABS
ROWS = ABANY
COLUMNS = SEX
CELLS = Observed [the default]
Examine the crosstabulation you have just created. Note the following
terms:
The BODY of the table is the part enclosed in the 2 columns and
2 rows corresponding to male/female and yes/no.
Crosstabulation Tutorial Page 2
The CELLS of the table are the little squares in the box created
by the unique combinations of the values (4 in this case).
Each cell represents cases with a unique combination of values
of the two variables, corresponding to that particular row and
column. The MARGINS of the table are on the right (the ROW
MARGINALS) and underneath (the COLUMN MARGINALS). The
marginal distributions are just the frequency distributions for the
two variables, considered separately. The LABELS and TITLE of
the table are necessary, and always must be included.
Exercises:
2) Percentage the above table on the column totals. Read the table
by comparing the distributions of column percents within the body of
the table. Does support for abortion vary by gender?
ANALYZE-DESCRIPTIVE STATISTICS-CROSSTABS
ROWS = ABANY
COLUMNS = SEX
CELLS = Observed, Percentages (Column)
Crosstabulation Tutorial Page 3
3) Now redo the table, percentaging the table on the rows. Read
the tables by comparing each distribution of row percents within the
body of the table. Has anything changed from (2)?
ANALYZE-DESCRIPTIVE STATISTICS-CROSSTABS
ROWS = SEX
COLUMNS = ABANY
CELLS = Observed, Row Percentages
CONDENSED CROSSTABS
DESCRIBING ASSOCIATION
Exercises:
ANALYZE-DESCRIPTIVE STATISTICS-CROSSTABS
ROWS = HAPPY LIFE
COLUMNS = REGION RACE SEX DEGREE
CELLS = Observed, Column Percentages
Exercise:
ANALYZE-DESCRIPTIVE STATISTICS-CROSSTABS
ROWS = VOTE08
COLUMNS = PARTYID
CELLS = Column Percentages
STATISTICS = Lamda
How would you describe the association between voting in the 2008
presidential election and party affiliation based on the percentage
table? What does the lambda statistic (use the value when Vote08 is
dependent) tell you about variation in likelihood of voting by political
party affiliation?
The PRE logic of gamma will not be explained here, but the
calculation of gamma is simple:
Exercise:
ANALYZE-DESCRIPTIVE STATISTICS-CROSSTABS
ROWS = HAPPY, LIFE
COLUMNS = DEGREE
CELLS = Column Percentages
STATISTICS = Gamma
Chi-square
ANALYZE-DESCRIPTIVE STATISTICS-CROSSTABS
ROWS = LIFE
COLUMNS = EDUCR
CELLS = Column Percentages
STATISTICS = Chi Square
Now request the table with intermediate results that are used to calculate chi-
square and explain what these figures mean.
ANALYZE-DESCRIPTIVE STATISTICS-CROSSTABS
ROWS = LIFE
COLUMNS = EDUCR
CELLS = Column Percentages, Expected Count, Observed Count,
Unstandardized residuals
STATISTICS = Chi Square
Do the cells with high and low residual values make sense to
you? The high residuals indicate cells in which there are more cases
than would be expected on the basis of chance, while the negative
residuals indicate cells in which there are fewer cases than would be
expected on the basis of chance.
table overall, the value of chi-square and the number of rows and
columns in the table (leading to the value of the degrees of freedom,
or “df”) can be used to estimate the probability that the association
was due to change. The statistical convention is that the association
in a table is unlikely to have been due to chance if p < .05 (i.e., if the
probability that it is due to chance is less than 5 out of 100). Is the
probability in this table less than .05?