Professional Documents
Culture Documents
Regresion Logistica en Ed Superior
Regresion Logistica en Ed Superior
Author(s): Chao-Ying Joanne Peng, Tak-Shing Harry So, Frances K. Stage and Edward P.
St. John
Source: Research in Higher Education, Vol. 43, No. 3 (Jun., 2002), pp. 259-293
Published by: Springer
Stable URL: http://www.jstor.org/stable/40196455
Accessed: 13-08-2016 13:13 UTC
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
http://about.jstor.org/terms
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted
digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about
JSTOR, please contact support@jstor.org.
Springer is collaborating with JSTOR to digitize, preserve and extend access to Research in Higher
Education
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
Research in Higher Education, Vol. 43, No. 3, June 2002 (© 2002)
This article examines the use and interpretation of logistic regression in three leading
higher education research journals from 1988 to 1999. The journals were selected
because of their emphasis on research, relevance to higher education issues, broad
coverage of research topics, and reputable editorial policies. The term "logistic re-
gression" encompasses logit modeling, probit modeling, and tobit modeling and the
significance tests of their estimates. A total of 52 articles were identified as using
logistic regression. Our review uncovered an increasingly sophisticated use of logistic
regression for a wide range of topics. At the same time, there continues to be confu-
sion over terminology. The sample sizes used did not always achieve a desired level
of stability in the parameters estimated. Discussion of results in terms of delta-Fs
and marginal probabilities was not always cautionary, according to definitions. The
review is concluded with recommendations for journal editors and researchers in
formulating appropriate editorial policies and practice for applying the versatile logis-
tic regression technique and in communicating its results with readers of higher edu-
cation research.
KEY WORDS: logistic regression; logit; probit; tobit; delta-P; marginal probability; odds
ratio; higher education research; multivariate statistics.
Since 1988, research using logistic regression has been published with increasing
frequency in three leading higher education journals: Research in Higher Educa-
tion, The Review of Higher Education, and The Journal of Higher Education.
Yet, there is great variation in the presentation and interpretation of results in
these publications, which can make it difficult for readers to understand and
compare the results across articles. A systematic review of articles that have
used logistic regression not only promotes the learning about this method, but
Chao-Ying Peng and Tak-Shing Harry So, Indiana University-Bloomington. Frances K. Stage,
New York University. Edward P. St. John, Indiana University-Bloomington.
Address all correspondence to: Chao-Ying Joanne Peng, Department of Counseling and Educa-
tional Psychology, School of Education, Room 4050, 201 N. Rose Ave., Indiana University,
Bloomington, IN 47405-1006; peng@indiana.edu.
259
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
260 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 261
where y is the estimated outcome variable value for the true K, b0 is the consta
of the equation, bh . . . , bp are estimated parameters corresponding to predicto
values xu . . . , xp, b0 is alternatively called the K-intercept; bx, . . . , bp are slopes
regression coefficients, or regression weights.
One method used by statisticians to estimate parameters is the least square
method. The values obtained under the least squares method are called lea
squares estimates. In the case of categorical outcome variables (such as whethe
a high school graduate matriculated into college), the linear regression model is
inadequate. For one thing, the plot of such data appears to fall on parallel lines
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
262 PENG, SO, STAGE, AND ST. JOHN
/ p \
In
Hence,
ea+(k
Probability (Y = outcome of interest | X = x) = P =
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 263
1.0-- ^ -
0.9 - - /^^ ^
0.8 - ■ /
0.7 ■ - /
P °6 - /
I P °'5
°- 0.4- /
0.3 - /
0.2 ■ ■ j/
0.1 - - ^^^^
0.0
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
264 PENG, SO, STAGE, AND ST. JOHN
1. Unless P = 0, the binary logistic regression maps the regression line onto the
interval (0,1) which is compatible with the logical range of probabilities.
2. The regression line is monotonically increasing if p > 0, and monotonically
decreasing if P < 0.
3. The function takes the value of 0.5 at x = -a/p and is symmetric to the point
of (-a/p, 0.5).
Therefore,
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 265
P, = Prob(K = 1 | X),
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
266 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 267
[o]nce we have ascertained that each of the continuous variables is in the correct scale,
we begin to check for interaction in the model
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
268 PENG, SO, STAGE, AND ST. JOHN
Diagnostic Statistics
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 269
Delta-P statistic, marginal probability, and odds ratio are commonly used to
express logistic regression results. These three distinct concepts are not inter-
changeable. The delta-P statistic was popularized by an article written by Pe-
terson (1985) in which the concept of delta-P was mathematically defined. Ac-
cording to Peterson, the delta-P means the change in probability as a result of
a unit change in an independent predictor, say Xy. Thus, delta-P (AP) equals
The quantity L, represents the logit after a 1-unit change in Xy, whereas Lq
represents the logit before the 1-unit change in Xy. Both L, and Lq are computed
from Equation 4 as a result of a one-unit change in Xy while holding constants
for all other predictors. It is evident from Equation 8 that delta-P is a function
of both L, and Lq. In other words, the magnitude of delta-P is not a constant but
rather a variable for the entire range of Xj. If X; is continuous, the delta-P
changes as a function of both ft (the regression coefficient of Xj) and Lq. If X,
is categorical, the magnitude of delta-P depends on ft and the reference category
of Xj. Equation 5 shows that the relationship is not linear between the probability
of observing the outcome of interest and the predictors. Thus, it is incorrect to
convert a regression coefficient, ft, to delta-P and then interpret the delta-P
value without regard to the value of Lq (in the case of a continuous predictor)
or the reference category (in the case of a categorical predictor).
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
270 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 271
1.000 -• ^s
(Marginal Pr
0.925 -■ I
0.700 - | |
--I
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
272 PENG, SO, STAGE, AND ST. JOHN
4. What modeling technique was used to derive the model? What indexes/coef-
ficients were used in assessing the model? What statistical package was used
in deriving the model and its related statistics?
5. Was any possible interaction or confounding examined?
6. Was diagnostic analysis of outliers performed? If so, what type of data was
used?
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 273
Among the 52 articles reviewed, only one by Dey and Astin (1993) perfor
both the logit and probit analyses. Tobit modeling was used by Bivin
Rooney (1999); it was mentioned in Lindahl and Winship (1994), thoug
used. The remaining 50 articles applied the logistic (or logit) model to dat
these, three implemented polytomous logistic modeling.
Guided by the eight questions posited earlier, we reviewed each article
pendently. Results for each question are as follows.
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
274 PENG, SO, STAGE, AND ST. JOHN
/-I ^^
.2
fell? 3? ^ II
I 5 1| - If8f ^ «S
£ o -S 8 °°- n <» -i <n ** ^ °. 2 P5 < b s 5 ^ 'S g § 2
On Qs
On On
2P ^ c
On -8 * I «« 85 I ^O
I ^ ~-S r j I 5 1 p| ^
g i n = i -a i I 1 iS £|
O< < <0Q pQ pQ PQ {J U (J QQ Q£
^^ (N cn-<t «n vd r^ 06 on o ^cn cnrt
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 275
T I ^ ^ f I f I f I ? I ? I ? I T I
oo^ c^ONm^»n^:»n»Ort^ooo6 m *CJ
^ in
oo - T
in
^ « 'a, "a. ^ os
^ 00 Tt ^
- oo »n
(N »n
\D in
oo
w ON
§
8-g
*ft-
1 Si
G^ ^ ^ -^ ^ jT
i I 2 2^|i
~ « •§■ g I % I
!~ ~ .1 i S *
g I 15 I 8 i§ I ~ 5 S I 1 1
iEE O
1 OOl O I IX
I I I iDC
X i 1!
3 DC 3 3SI S
3 J I JZI O
S I S Z s j O
^2£ Di222i^?i?i?S^jqSSi S
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
276 PENG, SO, STAGE, AND ST. JOHN
^'3<n£
O °3
°°. ^ w
CN
S ivo
5 i co
£ i I£I i II
SiSTSi
III
•S O ^ °3 CN d - w vo
•| '•§ S ^ 5 ^ ^ ® co^-
s I
i
s 2; i On § c« i C/5
2; On c« C/5
1 1 1 i ^
i 8 § |'a s ^ ^ | =5
00 •> N*^/ ^""^ ^_^ \^^ ^ Qq
j J s S 1 I I? 5 ? ? 5 5
^on d^cM'cn^-wSvdr^oooNd
<.c* cncnmcnmcnencncnen^
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 277
"~" oC ^ .2
od <*\ 5
2
« 1
!k « oc £ 8* "° ^ ^X n £ 2 I
* I 1
3R33S 2£5iS»2ilS6I3isElas| ||
s"a "it5 "s B sss f ;«
"s i-li
* If? §
1 £ I -s *
~ 3 o 1 e a
s; § * t | .ill
1 - i i * ^ ©tf -pis
=«2^ ur ^ _ C C « 1. b S E
sis < § 2 •» § js S||i
*1
* ^ 8*
E 85*
« | .sI.s .s
s2 1-1
(j ! a ^3
I I I I 111 1 1 II I Jiu
^rjro^t invdr^ oo onO- -r4 ►§"§££
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
278 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 279
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
280 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 281
The results of logistic regression were presented in five different yet related
formats: the logistic regression equation (the most popular format by 36 articles,
or 69.23%), the delta-P statistic by 20 articles (38.46%), predicted probabilities
or marginal probability by 9 articles (17.31%), odds ratio by 5 articles (9.62%),
and logit by 3 articles (5.77%). None used relative risk or the kappa index that
are more popular in other fields of study, such as health-related fields. Most
articles used more than one expression to report their findings.
The logistic regression equation reported is similar to Equation 4 in which
logits of the dependent variable were linearly related to a set of predictors. Six out
of 36 articles did not include standard errors of slope coefficients. Two out of
the six articles plus another four, by different authors, did not include the
F-intercept for the unstandardized logistic regression equation.1 The absence of
the standard error for the parameter estimate means that readers will not be able
to assess the stability of these parameter estimates. The instability of parameter
estimates can be caused by the low observation to predictor ratio, as we ad-
dressed previously. Missing this piece of information undermined the utility of
the logistic regression results.
Similarly, by not including the K-intercept in an unstandardized equation, the
authors of six articles offered incomplete information. Consequently, readers
will not be able to revalidate the result with another sample or at another time/
place. This, too, undermined the utility of logistic regression findings.
The delta-P statistic was the second most used statistic in reporting logistic
regression results. As Equation 5 demonstrates, the relationship is not linear
between the probability of observing the outcome of interest and the predictors.
It is therefore incorrect to convert a regression coefficient, Py, to delta-P and
then interpret the delta-P value without regard to the value of Lq (in the case of
a continuous predictor) or the reference category (in the case of a categorical
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
282 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 283
Among the multivariate textbooks, three references were cited most fre-
quently: Aldrich and Nelson (1984), Fienberg (1983 or earlier editions) and
Hanushek and Jackson (1977). These highly appraised textbooks provide reputa-
ble information on logistic regression. Five papers published by higher education
researchers (Cabrera, 1994, St. John, Kirshstein, and Noell, 1991; Stage, 1990;
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
284 PENG, SO, STAGE, AND ST. JOHN
1. Logistic regression was conducted in several studies with sample sizes that
may not have achieved a reasonable level of stability. Although differences
in opinions by multivariate statisticians do not allow one to state a precise
standard for the observation/predictor ratio, seven studies (13.46%) had low
observation to predictor ratios - lower than standards recommended by at
least four textbooks.
2. Seven studies explored the need to transform scales of predictors in order to
improve the fit of a model. In several studies, the construct underlying a
predictor was measured by a single item on a 5-point Likert scale. Such
measurements lack the variability to relate a predictor to the outcome vari-
able, whether the latter is expressed in probabilities or logits.
3. Categorical predictors were typically recoded into a set of dummy variables
before they entered logistic regression models. All studies but one trans-
formed categorical predictors correctly. A majority of these studies labeled
the reference category or group clearly so that readers may correctly interpret
regression coefficients of the dummy variables.
4. More than half of the studies applied direct modeling to derive logistic mod-
els. Less than one eighth of studies performed stepwise logistic modeling - a
procedure notorious for capitalizing on random errors in data and for which
larger samples are required.
5. All studies but one assessed the adequacy of logistic regression models in
terms of statistical tests of individual parameters, of the overall model, of the
goodness-of-fit statistic, or validities of predicted probabilities. Most studies
provided multiple statistics as the evidence of models' fit. However, several
of these statistics were calculated incorrectly by commercially available sta-
tistical packages. Information on statistical packages was disclosed only in a
handful of studies. One study crossvalidated the logistic regression result;
none performed the diagnostic analysis.
6. Interactions were typically handled by subgroup analyses when, in fact, they
could have been directly specified in the logistic regression model by multi-
plicative terms.
7. Several studies employed data that contained extremely large or small base-
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 285
In light of our review regarding the use and interpretation of logistic regres-
sion, leading research journals in higher education are in a position to help guide
researchers in using logistic regression techniques. With the wide availability of
sophisticated statistical software installed on high-speed computers, the antici-
pated use of this technique appears to be increasing. That potential expanded
usage can move higher education research toward the elimination of future mis-
uses by bringing clarity to the reporting of results and by avoiding the pitfalls
that have been identified. The following recommendations emerged from our
review:
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
286 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 287
1. Adams, J. L., and Becker, W. E. (1990). Course withdrawals: A probit model and
policy recommendations. Research in Higher Education 31(6): 519-538.
2. Andrieu, S. C, and St. John, E. P. (1993). The influence of prices on graduate
student persistence. Research in Higher Education 34: 399-419.
3. Antony, J. S. (1998). Personality-career fit and freshman medical career aspirations:
A test of Holland's theory. Research in Higher Education 39(6): 679-698.
4. Belfield, C. R., Bullock, A. D., and Fielding, A. (1999). Graduates' views on the
contribution of their higher education to their general development: A retrospective
evaluation for the United Kingdom. Research in Higher Education 40(4): 409-438.
5. Bers, T. (1994). English proficiency, course patterns, and academic achievements of
limited-English-proficient community college students. Research in Higher Educa-
tion 35(2): 209-234.
6. Bivin, D., and Rooney, P. M. (1999). Forecasting credit hour. Research in Higher
Education 40(5): 613-632.
7. Bruggink, T. H., and Gambhir, V. (1996). Statistical models for college admission
and enrollment: A case study for a selective liberal arts college. Research in Higher
Education 37: 221-240.
8. Cabrera, A. F., Stampen, J. O., and Hansen, W. L. (1990). Exploring the effects of
ability to pay on persistence in college. Review of Higher Education 13(3): 303-306.
9. Cleveland-Innes, M. (1994). Adult student drop-out at post-secondary institutions.
The Review of Higher Education 17: 423-445.
10. Cooksey, E. C, and Rindfuss, R. R. (1994). Prior activities and progress in MBA
programs. Research in Higher Education 35: 647-668.
11. Delucchi, M. (1997). "Liberal Arts" colleges and the myth of uniqueness. The Jour-
nal of Higher Education 68(4): 414-426.
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
288 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 289
36. St. John, E. P. (1990b). Price response in persistence decisions: An analysis of the
high school and beyond senior cohort. Research in Higher Education 31: 387-403.
37. St. John, E. P. (1991). What really influences minority attendance? Sequential analy-
sis of the high school and beyond sophomore cohort. Research in Higher Education
32: 141-158.
38. St. John, E. P. (1999). Evaluating state students grant programs: a case study of
Washington's grant program. Research in Higher Education 40(2): 149-169.
39. St. John. E.P., Andrieu, S., Oescher, J., and Starkey, J. B. (1994). The influence of
student aid on within-year persistence by traditional college-age students in four-
year colleges. Research in Higher Education 35(4): 455-480.
40. St. John, E. P., and Noell, J. (1989). The effects of student financial aid on access
to higher education: An analysis of progress with special consideration of minority
enrollment. Research in Higher Education 30: 563-581.
41. St. John, E. P., Kirshstein, R. J., and Noell, J. (1991). The effects of student financia
aid on persistence: A sequential analysis. The Review of Higher Education 14(3):
383-406.
42. St. John, E. P., and Starkey, J. B. (1995). An alternative to net price: Assessing the
influence of prices and subsidies on within-year persistence. The Journal of Higher
Education 66(2): 156-186.
43. St. John, E. P., Paulsen, M. B., and Starkey, J. B. (1996). The nexus between college
choice and persistence. Research in Higher Education 37: 175-220.
44. Stage, F. K. (1988). University Attrition: LISREL with logistic regression for the
persistence criterion. Research in Higher Education 28(4): 343-357.
45. Strenta, A. C, Elliott, R., Adair, R., Matier, M, and Scott, J. (1994). Choosing and
leaving science in highly selective institutions. Research in Higher Education 35:
513-547.
46. Tinto, V. (1997). Classrooms as communities - Exploring the educational character
of student persistence. The Journal of Higher Education 68(6): 509-623.
47. Tornquist, K. M, and Hoenack, S. A. (1996). Firm utilization of university scientific
research. Research in Higher Education 37: 509-534.
48. Toutkoushian, R. K. (1994). Using citations to measure sex discrimination in faculty
salaries. The Review of Higher Education 18(1): 61-82.
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
290 PENG, SO, STAGE, AND ST. JOHN
ENDNOTES
1 . Specific evaluative information on each article may be obtained from the first author.
2. Concerning the use of delta-P, Long (1997) rightly points out that 'The amount of discret
change in the probability for a change in X, [i.e., delta-P] depends on (1) the amount of change
in Xjy (2) the starting value of X;, and (3) the values of all other predictors in the model. F
example, if we have two predictors, X] and X2, the change in Probability (K= 1 | X,, X2) wh
X, changes from 1 to 2 does not necessarily equal the change [in probability] when Xx goes fro
2 to 3
does not necessarily equal the change [in probability] when X2 = 2. Thu
is choosing which values of the variables to consider and how much to
3. One illustration of real cases was found in Stage (1989) in which tw
of ethnicity, academic integration score, social integration score, and
were first presented. Their predicted probabilities (which should h
predicted using the logistic regression equation formulated in the stud
come (persisting versus dropping out from the university) was com
probabilities. Another illustration of the predicted probabilities for hy
found in Leppel (1993). These hypothetical individuals were suppos
points on SAT and lived either within 10 miles, 10-50 miles, or mor
university which they had been accepted into. Their predicted probabil
at that university were calculated from a logistic regression model
related to the distance from that school, according to a gravity model
4. According to Sedaie (1998, p. 352, footnote a of Table 6), "For each
dummy variables, marginal probabilities were calculated as the effect
increase in the regressor above its sample mean while keeping all other
tive sample averages. For the dummy variables, marginal effects we
in the probability of each outcome when the dummy variable takes ea
keeping all other variables at their respective sample means." By Okun
(1997, p. 208, footnote b of Table 2), marginal probability is "compute
tion density function value (evaluated at the means of the regressor v
parameter estimate."
REFERENCES
Afifi, A. A., and Clark, V. (1990). Computer- Aided Multivariate Analysis, 2nd ed
York: Van Nostrand Reinhold.
Agresti, A. (1990). Categorical Data Analysis. New York: Wiley.
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 291
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
292 PENG, SO, STAGE, AND ST. JOHN
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms
USE AND INTERPRETATION OF LOGISTIC REGRESSION 293
Press, S. J., and Wilson, S. (1978). Choosing between logistic regression and discrimi-
nant analysis. Journal of the American Statistical Association 73: 699-705.
Pugh, R. C, and Hu, Y. L. (1991). Use and interpretation of canonical correlation analy-
ses in Journal of Educational Research articles: 1978-1989. Journal of Educational
Research 84(3): 147-152.
Ryan, T. P. (1997). Modern Regression Methods. New York: John Wiley and Son, Inc.
SAS Institute Inc. (1999). SAS/STAT User's Guide, Version 8, Volume 2. Cary, NC:
SAS Institute Inc.
Scott, K. G., Mason, C. A., and Chapman, D. A. (1999). The use of epidemiological
methodology as a means of influencing public policy. Child Development 70(5):
1263-1272.
Silverman, R. J. (1985). Higher education as a maturing field? Evidence from referencing
practices. Research in Higher Education 23(2): 150-183.
Soderstrom, I. R., and Leitner, D. W. (October 1997). The effects of base rate, selection
ratio, sample size, and reliability of predictors on predictive efficiency indices associ-
ated with logistic regression models. Paper presented at the annual meeting of the
Mid- Western Educational Research Association, Chicago, II.
SPSS Inc. (1999). SYSTAT® 9.0 Statistics I. Chicago, IL: SPSS Inc.
Stage, F. K. (1990). LISREL: An introduction and applications in higher education. In
J. C. Smart (ed.), Higher Education: Handbook of Theory and Research, Vol. VI, pp.
427-466. New York: Agathon Press.
St. John, E. P., Kirshstein, R. J., and Noell, J. (1991). The effects of student financial
aid on persistence: A sequential analysis. The Review of Higher Education 14(3):
383-406.
Tabachnick, B. G., and Fidell, L. S. (1983). Using Multivariate Statistics. New York:
Harper Collins.
Tabachnick, B. G., and Fidell, L. S. (1996). Using Multivariate Statistics, 3rd ed. New
York: Harper Collins.
Thorndike, R. M. (1978). Correlational Procedures for Research. New York: Gardner
Press.
Tobin, J. (1958). Estimation of relationships for limited dependent variables. Economet-
rica 26(1): 64-85.
Weiler, W. C. (1987). An application of nested multinomial logit model to enrollment
choice behavior. Research in Higher Education 27: 273-282.
Weiler, W. C. (1989). A flexible approach to modeling enrollment choice behavior. Eco-
nomics of Education Reviews 8(3): 277-283.
This content downloaded from 128.143.23.241 on Sat, 13 Aug 2016 13:13:13 UTC
All use subject to http://about.jstor.org/terms