Professional Documents
Culture Documents
Ados 1989
Ados 1989
2, 1989
Lynn Mawhood
MRC Child Psychiatry Unit, University ofLondon
E r i c Schopler
University of North Carolina
~This research was funded in part by grants from the Social Sciences and Humanities Research
Council of Canada, the Alberta Heritage Foundation for Medical Research, and the Medical
Sciences Institute to the first author. Susan Goode was supported by a grant from the Bethlem-
Maudsley Research Fund. Lynn Mawhood was supported by a postgraduate studentship funded
by the John D. and Catherine T. MacArthur Foundation. We acknowledge the work of Joyce
Magill, Deborah Dewey, and numerous members of the Department of Child and Adolescent
Psychiatry at the Institute of Psychiatry, Division TEACCH, and Glenrose Rehabilitation Hospi-
tal for their help in the development of this scale.
2Address all correspondence to Catherine Lord, Department of Psychology, Glenrose Rehabili-
tation Hospital, 10230-111 Avenue, Edmonton, Alberta T5G 0B7, Canada.
185
0162-32.57/89/0(~0,-0185506.00/0 9 1989 Plenum Publishing Corporation
186 Lord, Rutter, Goode, Heemsbergen, Jordan, Mawhood, and Schopler
the quality o f behaviors. Interrater reliability for five raters exceeded weighted
kappas o f .55 f o r each item and each pair o f raters f o r matched samples o f
15 to 40 autistic and nonautistic, mildly mentally handicapped children (M
IQ = 59) between the ages o f 6 and 18 years. Test-retest reliability was ade-
quate. Further analyses compared these groups to two additional samples
o f autistic and nonautistic subjects with normal intelligence (M IQ = 95),
matched f o r sex and chronological age. Analyses yielded clear diagnostic
differences in general ratings o f social behavior, specific aspects o f commu-
nication, and restricted or stereotypic behaviors and interests. Clinical guide-
lines f o r the diagnosis o f autism in the draft version o f ICD-IO were
operationalized in terms o f abnormalities on specific A D O S items. An al-
gorithm based on these items was shown to have high reliability and dis-
criminant validity.
& Wolf, 1986; Volkmar et al., 1988). In general, discrimination has been
most clear when samples of autistic children with quite severe mental retarda-
tion have been compared to normally developing children (Wenar et al., 1986)
or to other nonautistic mentally handicapped children without specific
matching for intellectual level (Teal & Wiebe, 1986; Volkmar et al., 1986).
Because items associated not only with autism but with developmental ages
of less than 1 or 2 years were included in most of these scales (Wenar et al.,
1986), these schedules are less effective in identifying higher functioning au-
tistic children and adolescents than autistic children who are severely han-
dicapped (Krug et al.. 1980).
The ADOS differs from other scales in a number of ways. One of the
underlying purposes of the ADOS was to facilitate observation of social and
communicative features specific to autism rather than those accounted for
or exacerbated by severe mental retardation. The development of the ADOS
began with the goal of discriminating autistic children with mild or no men-
tal handicap across the age span of 6 years to young adulthood from age-
and IQ-matched normally developing and nonautistic mildly retarded in-
dividuals. Moreover, the focus of the ADOS is on social and communica-
tive behaviors. It is less comprehensive in its attention to specific autistic-type
movements, behavior difficulties, and sensory interests than many scales
(Krug et al., 1980; Siegel et al., 1986).
Two aspects of this social and communicative focus are most novel.
First, the ADOS is an interactive schedule. What is standardized in the ADOS
are the contexts that provide the background for all observations and, more
specifically, the behaviors of the examiner, not the sample. Social behavior
and communication generally involve more than one person. We wanted to
be able to take into account the examiner's actions in the objective evalua-
tion of the subject's behavior. In the ADOS, the examiner is considered a
participant/observer or confederate in a social experiment. He or she fol-
lows a set protocol that provides not just social but also contextual presses
for social behavior. "Press" is a term borrowed from Murray (1938). We use
it here without pejorative connotations to refer to aspects of the immediate
environment that have direct implications for the subject's behavior.
One critical issue was how to create dear presses for s o m e sort of social
or communicative behaviors to occur, without overtly structuring the social
aspects of the task so much that the quality of the behavior was predeter-
mined by the setting. The general format of the schedule is to encourage an
interaction that appears natural, during which preplanned "occasions" for
certain behaviors arise, with the imposed structure as invisible to the subject
as possible. However, in reality, this structure has been carefully determined
in terms of social tasks that are defined in detail by variations in cognitive
demands, in the type of materials, and in the behavior of the interviewer.
These standard situations thus provide comparable social stimuli for all
subjects.
188 Lord, Rutter, Goode, Heemsbergen, Jordan, Mawhood, and Schopler
D E S C R I P T I O N OF T H E I N S T R U M E N T
whether the subject can engage in reciprocal play and whether he or she takes
some initiative in extending the interaction. Third, turn-taking and joint object
use are observed in a structured drawing game. Fourth, the subject is asked
to demonstrate through mime and gesture, as well as language, how to carry
out a familiar series of actions (e.g., how to brush one's teeth).
Along with the demonstration task, the remaining tasks are intended
to provide standardized contexts for the collection o f a language sample as
well as an assessment o f nonverbal behaviors. The fifth task involves the
description of a poster depicting a variety o f familiar activities (e.g., a shop-
ping center with people in and out of stores). In the sixth task, subjects
are presented with a book that portrays a simple story without any text. The
requirement is for the subject to tell a story, indicating in some way an un-
derstanding of events as they occur within a story fine. For the seventh task,
the examiner seeks to engage the subject in conversation about topics that
have arisen in the course of the interview. The eighth task consists of specific
questions about emotions and social relationships, and is designed to assess
the subject's ability to describe social and emotional situations and concepts
(Wolff & Barlow, 1979).
Behaviors targeted for observation in each task are coded as the inter-
view proceeds, with general ratings made immediately after the interview.
General ratings are provided for four areas (a) reciprocal social interaction,
Co) communication/language, (c) stereotyped/restricted behaviors, and (d)
mood and nonspecific abnormal behaviors. Finally, an overall autism rat-
ing is included to facilitate comparisons with the CARS (Schopler et al., 1986).
In most cases, general ratings are made on a 3-point ordinal scale, from
0 = within normal limits, to 1 = infrequent or possible abnormality, to 2
= definite abnormality. For a few items requiring categorical scoring, an
additional rating of 7 is used to indicate behavior in the same category that
is abnormal in a way not encompassed by the codings (e.g., pronoun errors
other than those involving first-person references, such as "they" for "it").
Scores are not strictly intended to judge severity but to allow room for some
uncertainty on the part o f the examiner. The same aspect o f the same be-
havior cannot be coded as abnormal more than once in different ratings,
although different aspects of the same behavior can result in an abnormal
code for more than one item. Ratings are described in terms of-principles,
supported by examples. For example, the scores for quality o f social over-
tures are as follows: 0 = integrates appropriate facial expression, gesture,
and vocaliTation to communicate social intentions; 1 = slightly odd quality
o f social overtures. Overtures may often be for personal demands or related
to own interests, but there is some attempt to involve the examiner in that
interest; 2 = inappropriate overtures that lack social quality and integration
of own and other's behavior. Includes subject bringing up preoccupations
Autism Diagnostic Observation Schedule 191
Method
Subjects
As shown in Table II, the subjects for the assessment of interrater relia-
bility were 20 autistic children and adolescents and 20 mentally han-
dicapped/lower IQ children and adolescents matched individually for
chronological age, verbal IQ (on WISC-R or WAIS-R; Wechsler, 1974; 1981)
and sex (12 male, 8 female). Verbal IQ was used for matching as an approxi-
mate index of the complexity of each subject's productive and receptive lan-
guage skills. All subjects were between 6 and 18 years of chronological age,
and between 50 and 80 full-scale IQ on the appropriate intelligence test.
Autistic subjects were recruited from three sources: 10 subjects from
Division TEACCH (North Carolina), 5 subjects from the Department of
Child and Adolescent Psychiatry, Institute of Psychiatry (London, England),
and 5 subjects from the Child and Family Psychiatric Unit, Glenrose Re-
habilitation Hospital (Edmonton, Canada). Prior to inclusion in the study,
all subjects were judged to meet Rutter (1978) and DSM III-R (APA, 1987)
criteria for autism by one of the authors (C.L. or M.R.) on the basis of per-
sonal contact. In addition, all subjects from North Carolina and Canada had
CARS scores over 30 (Schopler et al., 1986).
192 Lord, Rutter, Goode, Heemsbergen, Jordan, Mawbood, and Schopler
Procedures
Except for the live interrater reliability assessment (where there were
two examiners), subjects were videotaped during the ADOS while alone in
a room with the examiner. The assessments took place in a variety of set-
tings including schools, homes, and clinics in all three countries. The sub-
jects and examiner sat next to or diagonally across from each other at a table,
with sufficient distance from the camera that both were in the picture. An
external microphone was placed on the table. The ADOS generally took 20-30
min to complete. The examiners were not usually blind to the diagnosis of
Autism Diagnostic Observation Schedule 193
the subject because subjects were often seen in residential and school place-
ments. However, all except live reliability statistics were computed using cod-
ings of videotapes made by raters who were blind to diagnosis.
During the live interrater reliability assessments, two examiners sat at
the table with the subject, with only the examiner currently interacting with
the child appearing on cmnera. Midway through the ADOS, the examiners
exchanged places, and the second examiner administered the second half of
the schedule. Both examiners scored the tasks whether or not they carried
them out, and both made notes throughout the interview, as well as scoring
the general ratings immediately after the scale's completion. In this case, be-
cause all consecutive referrals were seen (though not used in the study), the
examiners were blind to diagnosis. This procedure was decided upon when
agreement for practice ratings carried out live with one examiner administer-
ing the entire ADOS and the other examiner watching from inside the same
room was found to be closely similar to earlier ratings made from videotapes.
Test-retest subjects were seen by two different examiners in the same
location on two occasions, separated by 3 to 9 months. Alternate sets of
materials were used in the tasks for which they were available. Scores from
live ratings during both assessments were used.
Raters
Five persons (two from London, three from Edmonton) served as ex-
aminers and raters. Before this study was undertaken, all examiners had
worked together for over a year to devise the scale, modify its procedures
and codings, and standardize its administration.
A balanced incomplete block design was used to assess interrater relia-
bility (Fleiss, 1986). Four of the raters (all except C.L.) were each randomly
assigned 16-20 videotapes to rate, with the following constraints: Each rater
coded at least one subject in every cell, defined by diagnosis (autistic/mildly
retarded vs. mentally handicapped) and continent (UK vs. USA/Canada);
each pair rated 6-8 subjects in common; and no rater rated videotapes of
herself. Because the assignment of the examiners to subjects was not ran-
dom due to varying availability of children of different diagnoses in differ-
ent locations, assignment of raters to subjects could not be perfectly balanced.
An additional rater (C.L.) scored four tapes with each other rater, balanced
across diagnostic group and continent. Thus, the videotape of each subject's
ADOS was coded twice and, in some cases, three times (with C.L. always
as the third rater). Live ratings and test-retest administration and scoring
were carried out only by the three Edmonton raters.
194 Lord, Rutter, Goode, Heemsbergen, Jordan, Mawhood, and $chopler
Results
Interrater Reliability
for intelligibility and unusual eye contact were made dichotomous by eliminat-
ing the 1 code completely. Codes of 1 for social distance and inappropriate
questions and statements were combined with 2's because of consistent overlap
between the two scores.
Reliability statistics for final versions o f items retained in the scale are
portrayed in Table III (tasks) and IV (general ratings). Overall, weighted kap-
pas for task items ranged from .61 to .92; weighted kappas for general rat-
ings ranged from .58 to .87. For the 10 pairs of raters, mean weighted kappas
across items ranged from .68 to .75 for each subject, with a grand mean of
.72 (SD = 0.01). Although the distribution of scores from 0 to 2 varied across
items and diagnostic groups, o f greatest diagnostic significance are disagree-
ments between 0 (i.e., normal) and 2 (i.e., definitely abnormal). On the whole,
such disagreements were very rare. They are reported for each item with the
reliability statistics in Table IV.
Test-Retest Reliability
Weighted kappas for test-retest reliabilities were adequate for all task
items (range .57-.84) and general ratings (range .58-.92). Mean weighted kap-
pas, combined across items for individual subjects, ranged from .58 to .92,
with no consistent differences across diagnostic groups.
Rater Bias
subjects. Using the final data sets reported in Tables IIr and IV, no signifi-
cant results were found.
VALIDITY STUDIES
Methods
Subjects
Results
m,
z~
^^ I I I I I I I I iX I XI I
<<
^ XXX XXXXXX I X I XX XX
.<
ii
o
rO
.<
A
<
,-]
I XX XXXXX I ~ I XXX Xk.
8
I:::
0
o
8 ..~ n
N
",~ Z ..~E!
~ ~
~ ~o l l ~ ~ j ~
o o
o o o ~ ~ r ~ o = Ou o ~=- ~
, ~ ' ~ ~,,.. ~ ~ o o
0 r 0"~ ~'~ ~'~ ~ ~"~ ~-'~
~'U'~.~ ~'~ ~ ~. ,~ = ~ o
Autism Diagnostic Observation Schedule 199
"a
E
II I I I I I I I ,'!
II
r-L
;=..,
E
II
:=
:E
X XX IX ~ I I I I I I
I=
'13 0
.u E
.9 o
~<
.=~
"a~
0
0
O.J
~v o
=
E
I
o~
~.= ~ ~.=~ ~
u 0
--,,, ;~, ~ ~ ~ .=
~..,~ ~ o,..~
9~ o . ~ ~o ~ ~,_~'~_, . . .
all items for which significant differences occurred between autistic samples,
the mildly retarded autistic sample was rated as more abnormal than the non-
retarded autistic group. Four general ratings showed significant differences
between autistic and nonautistic groups, but not between the autistic sam-
ples. These items were unusual use of eye contact, nonverbal communica-
tion linked with language, amount of overtures, and unusual preoccupations.
Two other ratings had significant differences within IQ ranges (i.e., between
normally developing and nonretarded autistic groups and between mentally
handicapped and autistic/mildly retarded groups) but not between the non-
retarded autistic group and the mentally handicapped subjects. These rat-
ings were level of nonechoed language and social distance. General ratings
for which no significant group differences were found were overactivity, at-
tention, negativism, overall distress, anxiety, and inappropriate cheerfulness.
Ratings of Tasks. Out of 12 task items, 3, describing a poster, asking
for help, and giving help to interviewer, yielded no significant specific group
comparisons. For two items, demonstration/mime and turn-taking, the only
group that differed significantly from all others was the autistic/mildly retard-
ed group. Two task ratings, reciprocal play and telling a story, discriminat-
ed autistic from nonautistic groups within the same IQ range, but did not
differentiate the nonretarded autistic group from the mentally handicapped
sample. These results are summarized in Table VI. Symbolic play, questions
about emotions, and questions about marriage each yielded significant differ-
ences between autistic and nonautistic groups, but not between the two autis-
tic samples. Finally, for two other socioemotional questions concerning
concept of friendship and quality of person, significant differences were iden-
tified between all groups.
Algorithm
Z
A XIIIIXIII
,- 0
~"
IXl I ~_o_.o
i~ I I XXXX g ~
O
I IX IXX IX IXX I
i
~r
O
.=.
I ='o" u
e~
~ o ~ ~.~
9- = ,, "~ .~,'EI ~'~o'
9~ c ~
....
~.~
o'~ o .~ ~ . ~ I ~' ~
I
a~
~0 ~ ~ 0 ,.-*,. ~-~1
"~ I ~ " ~ ' ~ ~ ~ ~ = = ~1"~ ~v v
202 Lord, Rutter, Goode, Heemsbergen, Jordan, Mawhood, and Schopler
Validity of Algorithm
The discriminant validity of the algorithm was quite good; both the
social criteria and the communication criteria were indeed successful in
differentiating autistic subjects. Kruskal-Wallis ANOVAs yielded highly sig-
nificant differences, X2(3, N -- 80) -- 57.40 for social, 53.12 for communi-
cation, p < .0001. Paired comparisons 6O < .01) indicated significant
differences for scores for the social guideline, with the normally developing
group scoring lower than the mentally handicapped group, who scored low-
er than both autistic groups. Paired comparisons 6o < .01) for scores for
the communication/language guideline revealed significant differences for
all pairs of groups, including between the two autistic groups. These sum-
mary scores are depicted according to each guideline in Figures 1 and 2.
On the other hand, though criterion-related validity was highly signifi-
cant for the two guidelines considered together, Kruskal Wallis x2(3, N =
80) = 63.47, p < .0001, when cutoffs of 6 points for the social guideline
and 4 points for communication were used, not all individual subjects were
appropriately classified as autistic or nonautistic. As shown in Table VIII,
three nonretarded autistic subjects failed to meet the communica-
tion/language criteria for autism. For two of these subjects, summary scores
were 1 point below the cutoff. All three of these subjects had verbal IQs and
full-scale scores above 100, and scores on language tests (such as the Pea-
body Picture Vocabulary Test-Revised Version; Dunn & Dunn, 1981) above
a 7-year age equivalent.
In addition, two mentally handicapped subjects met criteria for autism
in both social and communication areas. These two children were quite differ-
ent from each other. One was an exceptionally active 12-year-old girl with mild
mental retardation who was very disinhibited. Her eye contact was unusual,
and she showed inappropriate social distance, very odd intonation, and repeti-
Autism Diagnostic Observation Schedule 203
20. [] Normal
9 Autistic - High
9 Autistic - Low
15
[] Mentally Handicapped
~-j 1084
"6
0 O0!i ojo
~s
20-
[] Normal
9 Autistic - High
t 9 Autistic - Low
15- [] Mentally Handicapped
O9
-a
z 5
ol is o
o-2 3"5 e'e 8-~2
Total Scores
Fig. 2. ADOS algorithm for communication/language:
distribution of scores by diagnostic group.
204 Lord, Rutter, Goode, Heemsbergen, Jordan, Mawhood, and Sehopler
Table VII. ADOS items for Algorithm Derived From Draft of ICD-10 Criteria
Criterion ADOS item
~Q
z oo
~5 0 0
.<
o~
8 ~
U
"d
t~
~ ~ "~ ~ ' ~ ,
~. "~ ~ ' ~
206 Lord, Rutter, Goode, Heemshergen, Jordan, Mawhood, and Sehopler
20, [] Normal
A u t i s t i c - High
9 Autistic - Low
15
Mentally Hsndicapped
o
@
~-3 lO 84
"6
JD
E
0 9
o
IJd I
Total Scores
:~
0
five language. The other subject was a painfully shy 14-year-old boy with mild
mental retardation. He was very restricted in his movements and nonverbal com-
munication. He had a particularly odd way of speaking. The rhythm and volume
of his speech were unusual and, in this situation, he had virtually no reciprocal
conversation. However, neither child was judged to be autistic on the overall
rating by any of three raters and, clinically, neither child would have even been
considered autistic-like in their social behavior. Thus, while statistical analyses
indicated significantly different distributions across groups and between each
pair of groups (Tukey, ps < .01) for combined scores of social and communi-
cation criteria, the assignment of individuals to diagnostic groups did not per-
fectly reflect clinical judgments.
Identifying restricted and stereotypic behaviors for some subjects was
difficult within the context of the ADOS, even though, once again, distribu-
tions across diagnostic groups were significantly different, Kruskal-Wallis x20,
N =80) = 48.11 p < .0001. In this case, the two nonautistic groups did not
differ from each other, but all other comparisons were significant. As shown
in Figure 3, for the autistic/mildiy retarded children, it was possible to observe,
during the ADOS, clear examples of sensory anomalies, preoccupations, or
autistic-type mannerisms (yielding a total score of 2 in 12 of 20 children. Seven
other autistic/mildly retarded children scored I, with only 1 autistic/mildly
retarded child showing no evidence of this type of behavior. On the other hand,
it was relatively unusual to observe (or at least recognize) compulsions or ritu-
als during the ADOS, even in the autistic/mildly retarded children.
Autism Diagnostic Observation Schedule 207
Restricted and stereotypic behaviors, and interests were seen with suffi-
cient clarity (i.e., received a total score of 2) in only 7 of 20 of the nonretarded
autistic subjects,~ with 6 nonretarded autistic subjects receiving a 1, and 7 more
receiving a 0, even though all of these children and adults were reported by
parents as regularly showing such behaviors in other contexts. Behaviors fall-
hag in this category occurred very rarely in the mildly mentally handicapped
and normally developing subjects.
The distribution of scoressummed across all three guidelines was also com-
pared across diagnostic groups, as portrayed in Figure 3, Kruskal-Wallis • (3,
N = 80) = 57.30, p < .0001. Again, all pairs of groups differed from each
other, with the distribution for the two autistic groups becoming slightly more
skewed.
Reliability of Algorithm
DISCUSSION
REFERENCES
American Psychiatric Association. (1987). Diagnostic and statistical manual of mental disord-
ers (3rd ed. rev.) Washington: Author.
Cicchetti, D. V., Lee, C. Fontana, A. F., & Dowds, B. N. (1978). A computer program for
assessing specific category rater agreement for qualitative data. Educational and Psy-
chological Measurement, 38, 805-813.
Cicchetti, D. V., & Sparrow, S. A. (1981). Developing criteria for establishing interrater relia-
bility of specific items: Applications to assessment of adaptive behavior. American Journal
of Mental Deficiency, 86, 127-137.
Cohen, J. (1968). Weighted kappa: Nominal scale agreement with provision for scaled disagree-
ment of partial credit. Psychological Bulletin, 70, 213-220.
Cohen, D. J., Caparulo, B. K., Gold, J. R., Waldo, M. C., Shaywitz, B., Ruttenberg, B. A.,
& Rirrdand, B. (1978). Agreement in diagnosis: Clinical assessment and behavior rating
scales for pervasively disturbed children. Journal of the American Academy of Child
Psychiatry, 17, 589-603.
Conger, A. J. (1980). Integration and generalization of kappa for multiple raters. Psychologi-
cal Bulletin, 88, 322-328.
Dunn, L. M., & Dunn, L. M. (1981). Peabody Picture Vocabulary Test-Revised: Manual for
Forms L and M. Circle Pines, MN: American Guidance Service.
FarreU, A. D., Mariotto, M. J., Conger, A. J., Curran, J. P., & WaUander, J. L. (1979). Self-
ratings and judges' ratings of heterosexual social anxiety and skill: A generaliTabilitystudy.
Journal of Consulting and Clinical Psychology, 47, 164-175.
Fleiss, J. (I986). The design and analysis of clinical experiments. New York; Wiley.
Freeman, B. J., Ritvo, E. R., Guthrie, D., Schroth, P., & Bail, J. (1978). The behavior obser-
vation scale for autism: Initial methodology, data analysis, and preliminary findings on
89 children. Journal of the American Academy of Child Psychiatry, 17, 576-588.
Freeman, B. J., Ritvo, E. R., & Schroth, P. C. (1984). Behavior assessment of the syndrome
of autism: Behavior Observation System. Journal of American Academy of Child Psy-
chiatry, 23, 588-594.
Hubert, L. (1977)~ Kappa revisited. Psychological Bulletin, 84, 289-297.
Hubert, L. J. (1978). A general formula for the variance of Cohen's weighted kappa. Psycho-
logical Bulletin, 85, 183-184.
Autism Diagnostic .Observation Schedule 211
Krug, D. A., Arick, J., & Almond, P. (1980). Behavior checklist for identifying severely han-
dicapped individuals with high levels of autistic behavior. Journal of Child Psychology
and Psychiatry, 21, 221-229.
LeCouteur, A., Rutter, M., Lord, C., Rios, P., Robertson, S., Holdgrafer,,M., & McLennan,
J. D. 0989). Autism diagnOSticinterview. Journalof Autism and DevelopmentalDisorders, 19.
Lord, C., & Magill, J. (1989). Observing_social behavior in an asocial population: Methodo-
logical and clinical issues. In G. Dawson (Ed.), Autism: New perspectives on diagnosis,
nature and treatment. New York: Guilford Press.
Magill, J., & Lord, C. (1988). An observational study of greetings of autistic, behavior-disordered
and normally developing children. Manuscript submitted for publication.
Marascuilo, L. A., & McSweeney, M. (1977). Nonparametric and distribution-free methods
for the social sciences. Monterey, CA: Brooks/Cole Publishing Co.
Mundy, P., Sigman, M., Ungerer, J., & Sherman, T. (1986). Defining the social deficits of
autism: The contribution of nonverbal communication measures. Journal of ChildPsy-
chology and Psychiatry, 27, 657-669.
Murray, H. A. (1938). Explorations in personality. New York: Oxford.
Parks, S. L. (1983). The assessment of autistic children: A selective review of available instru-
ments. Journal of Autism and Developmental Disorders, 13, 225-268.
Pettit, G. S., McClaskey, C. L., Brown, M. M., & Dodge, K. A. (1987). The generalizability
of laboratory assessments of children's socially competent behavior in specific situations.
Behavioral Assessments, 9, 81-96.
Rice, J. P., McDonald-Scott, P., Endicott, J., Coryell, W., Grove, W. M., Keller, M. B., &
Altis, D. (1986). The stability of diagnosis with an application to bipolar II disorder.
Psychiatry Research, 19, 285-296.
Ruttenberg, B. A., Kalish, B. I., Wenar, C. & Wolf, E. G. (1977). Behavioral rating instru-
ment for autistic and other atypical children. Philadelphia: The Developmental Center
for Autistic Children.
Rutter, M. (1978). Diagnosis and definition of childhood autism. Journal of Autism and De-
velopmental Disorders, 8, 139-161.
Rutter, M. (1985). Infantile autism. In D. Shaffer, A. Erhardt, & L. Greenhill (Eds.). A clini-
cal guide to child psychiatry (pp. 48-78) . New York: Free Press.
Rutter, M., Le Couteur, A., Lord, C., MacDonald, H., Rios, P., & Folstein, S. (1988).
Diagnosis and subclassification of autism: Concepts and instrument development. In E.
Schopler & G. B. Mesibov (Eds.), Diagnosticand assessment issuesin autism (pp. 239-260).
New York: Plenum Press.
Rutter, M., & Schopler, E. (1987)~. Autism and pervasive developmental disorders: Concepts
and diagnostic issues. Journal of Autism and Developmental Disorders, 17, 159-186.
Schopler, E., Reichler, R. J., & Renner, B. R. (1986). The ChildhoodAutism Rating Scale (CARS)
for diagnostic screening and classification o f autism. New York: Irvington Publishers.
Siegel, B., Anders, T. F., Ciaranello, R. D., Beinenstock, B., & Kraemer, H. C. (1986). Empir-
ically derived classification of the autistic syndrome. Journal of Autism and Develop-
mental Disorders, 16, 275-293.
Teal, M. B., & Wiebe, M. J. (1986). A validity analysis of selected instruments used to assess
autism. Journal of Autism and Developmental Disorders, 16, 485-494.
Volkmar, F. R., Ciccbetti, D. V., Dykens, E., Sparrow, S. S., Leckman, J. F., & Cohen, D.
J. (1988). An evaluation of the Autism Behavior Checklist. Journal of Autism andDe-
velopmental Disorders, 18, 81-98.
Voikmar, F. R., Sparrow, S. S., Goudreau, D., Ciccbetti, D. V., Paul, R., & Cohen, D. J.
(1987). Social deficits in autism: An operational approach using the Vineland Adaptive
Behavior Scales. Journal of the American Academy of Child and Adolescent Psychiatry,
26, 156-161.
Wechsler, D. (1974). WechslerIntelligence Scalefor Children-Revised. New York: Psychologi-
cal Corp.
Wechsler, D. ( 1981). WechslerIntelligence Scalefor Children-Revised. New York: Psychologi-
cal Corp.
212 Lord, Rutter, Goode, Heemshergen, Jordan, Mawhood, and Schopler
Wenar, C., Ruttenberg, R. A~, Kalish-Weiss, B.,& Wolf, E. G. (1986), The development of
normal and antistic children: A comparativesmdy. Journal of Autism and Developmental
Disorders, 16, 317-334.
Wolff, S., & Barlow, A. 0979). Schizoid ,personality in childhood: A comparative study of
schizoid, autistic and normal children, Journal of Child Psychology and Psychiatry, 20,
29-46. 9
World Health Organization. (1987). ICD-I O1986 draft o f chapter 5 categories FOO-F99.Men-
tal, behavioral and developmental disorders. Geneva: Author.