Professional Documents
Culture Documents
Accounting For Individual Differences When Comparing The Effectiveness of Remedial Language Teaching Methods
Accounting For Individual Differences When Comparing The Effectiveness of Remedial Language Teaching Methods
Accounting For Individual Differences When Comparing The Effectiveness of Remedial Language Teaching Methods
ABSTRACT
Two language intervention methods were compared on their effectiveness at improving syntactic
structures as measured by gain scores on the Developmental Sentence Scoring (DSS) procedure.
Forty-one expressive language delayed children were assigned to either an Interactive approach
{n = 25) or a Programmed approach (n = 16), following a series of 13 pre-training diagnostic
tests. An analysis of group mean DSS gain scores did not indicate a significant difference
between the two teaching methods. However, when individual differences on the pre-training
tests were incorporated into further group comparisons, disordinal Aptitude x Treatment
interactions occurred for measures of intelligence, initial syntax level, and visual-motor integra-
tion. The Interactive approach optimized syntax improvement for children with relatively high
pre-training scores on each of the three measures, whereas the Programmed approach resulted
in superior syntax performance for low-scoring children.
Design
Two rehabilitative teaching programs were developed for use with small
groups of language-delayed children at the Northwestern University Speech
and Language Clinics. Twenty-five children were assigned to Interactive
Language Development Teaching (Lee, Koenigsknecht, & Mulhern, 1975)
Applied Psycholinguistics 1:2 154
Friedman & Friedman: Individual reactions to language training
Criterion measure
RESULTS
Pre-training analyses
Means and standard deviations by group for each of the 13 pre-training
measures are shown in Table 1. A series of two-tailed t tests (Ferguson,
1971) was computed to examine whether children within the two training
groups were comparable on these attribute measures prior to intervention.
Results of these analyses, also shown in Table 1, did not reveal significant
differences on any of these measures. Of particular importance was the
finding that groups were comparable in mean syntactic development as
Applied Psycholinguistics 1:2 160
Friedman & Friedman: Individual reactions to language training
Test X SD X SD i"
measured on the pre-training DSS. The average syntax delay estimated for
those in the Interactive groups was 21 months and in the Programmed
groups, 17 months. These age-score delays were determined from norms
developed by Lee (1974).
Deficiencies in pre-training performance were apparent on two additional
expressive language tasks that placed some demand on the children's syntac-
tic skills (see Table 1). At the beginning of the treatment period the mean
expressive score on the NSST was more than two standard deviations below
expectation for a 3-year-old. The average Spencer-McGrady Sentence repe-
tition score was also substantially below reported norms. Two additional
measures indicating a general impairment in this group's language skills
were the Receptive section of the NSST and the Northwestern University
Articulation Proficiency Test. On the Receptive portion of the NSST, a
screening device for sampling comprehension of specific syntactic structures,
the mean age equivalent score for all subjects was more than one year below
the group's chronological age. On the general articulation proficiency test
the children obtained a mean of only 32.5 correct single-consonant produc-
tions out of a possible 67 items. Similarly, the mean pre-treatment perform-
Applied Psycholinguistics 1:2 161
Friedman & Friedman: Individual reactions to language training
ance on the Memory Span for Digits was less than the three-digit span
normally obtained by 3-year-olds.
It is also important to consider those measures on which these children
averaged at or above their age-level expectations as a group. Mean scores
reported in Table 1 for the Stanford-Binet intelligence scale, the total ITPA,
and the Preschool Attainment Record indicated that the general level of
functioning for these children was commensurate with their chronological
ages. However, individual subtests on the ITPA did show specific problems
in language learning and usage. Pre-treatment results from two receptive
language measures were also normal. These were the Pea body Picture
Vocabulary Test, a measure of receptive vocabulary for single words, and the
Oral Commissions subtest of the DTLA, a measure of comprehension of
sequenced basic commissions. Finally, the children scored within normal
limits on the two visual-motor skill measures: the Developmental Test of
Visual-Motor Integration and the Draw-a-Man test.
Both training groups did, on the average, show significant gains in overall
DSS scores during the training period, ?(24) = 5.59, p < .01 for the
Interactive group, and t{l5) = 2.51,/? < .05 for the Programmed group. The
mean gain per child was approximately 10.8 months over the eight-month
training period. However, a comparison of these average DSS gain scores for
the two groups did not indicate a significant effect of teaching type, f (39) =
1.30, p > .10 (two-tailed test). This main effect comparison ignored any
individual child differences that may have interacted with the two teaching
procedures. Further analyses incorporated information on the 13 pre-training
variables to assess the relative effectiveness of the two teaching approaches
at improving child syntax.
Table 2. Within-group correlations and tests of parallel slopes for DSS gain crite-
rion score regressed on 13 separate pre-training measures
Figure 1 shows that the interaction between the IQ measure and the two
teaching procedures was disordinal. That is, the two within-group regression
lines crossed over within the measured range of IQ scores. The Interactive
approach resulted in greater DSS improvement than did the Programmed
approach for children with IQs toward the top of the normal range.
Conversely, the Programmed method resulted in greater gains on the DSS
for those children with lower scores. The within-group Pearson product-
moment correlations, reported in Table 2, between IQ and DSS measures
also demonstrated the differential regression. A significant (p < .05) direct
relationship was observed for children within the Interactive group (r = .50),
and an inverse correlation within the Programmed group (r = -.26, p >
.05).
To further analyze this disordinal interaction the Johnson-Neyman tech-
nique was employed (Johnson & Neyman, 1936). This test defines a region
of homogeneity around the crossover point of two nonparallel regression
lines. In other words, a range of scores on the aptitude variable was specified
over which neither of the two teaching methods produced significantly more
improvement on the DSS criterion variable. Results of an application of this
Johnson-Neyman technique to the IQ data revealed a region of homogeneity
between 58 and 112 on the Stanford-Binet (p = .05). Although there were
nine children (22.0% of the entire sample) observed above this region of
Applied Psycholinguistics 1:2 163
Friedman & Friedman: Individual reactions to language training
3.6
3.0
S 2.4
S 1.8
a
1.2
0.6
I I I I I
70 80 90 100 110 120 1.0 2.0 3.0 4.0 5.0 6.0
Figure 1. Significant disordinal interactions for DSS gain scores regressed on three
pre-treatment variables.
DISCUSSION
The findings of this study indicated that research efforts examining the
relative merits of different remedial language training procedures may be
enhanced by the inclusion of individual difference variables in their designs.
Rather than accounting for a child's characteristics, which may differen-
tially interact with specific training procedures, most language training
comparison studies have simply discussed them as error variance. This
unexplained variability within groups could serve only to statistically mask
treatment differences that might otherwise have proven important. Perhaps
there are very few main effects to be found, and the relative effectiveness of
different intervention procedures will be observed only when successful
attempts are made to account for individual differences.
Although some experimenters of remedial language training procedures
have incorporated individual characteristic variables into their research, they
have done so by simply treating these variables as additional factors in an
analysis of variance design. For example, to examine the relationship
between intelligence and DSS gain scores, or its interaction with the two
language training procedures, the entire subject pool might have initially
been stratified into high- and low-IQ groups. Children within each intelli-
gence group would then be randomly assigned to the two language interven-
tion techniques. The resulting Treatments x IQ Levels factorial analysis of
variance would have been a less powerful analysis procedure than a compari-
son of within-group regression lines. Creating artificial levels of a continuous
individual difference variable, such as IQ, results in the loss of considerable
information. By lumping high intelligent children into a single group, any
individual difference effects within that group have been lost. There is also
some difficulty in deciding on the exact point of dichotomy between groups
of high and low intelligent children. Should the data be examined for a
natural split in the score distribution, or should a measure of central
tendency be used as a cutoff point? No matter how a group division is
determined, it will inevitably lead to placing subjects who differ by only a
few points on some attribute measure into different classification groups. In
Applied Psycholinguistics 1:2 165
Friedman & Friedman: Individual reactions to language training
addition, when starting out with typically small treatment samples, dividing
subjects into even smaller subgroups makes it more difficult to satisfy the
underlying distribution assumptions for appropriately using most parametric
statistics.
A variety of previous methods studies have demonstrated that global,
multifactor measures are less likely to show significant Aptitude x Treat-
ment interactions than unifactor measures of specific subject characteristics
(see Bracht, 1970). Therefore, the significant disordinal interaction observed
between intelligence scores and the language procedures was not expected.
However, for children who scored high within this group on the intelligence
measure, the Interactive approach does, on the surface, seem to be more
effective than the Programmed approach. It is certainly not as repetitive or
drill-like, and appears less likely to bore higher IQ children. In addition, it is
possible that to be effective the Interactive story procedure requires children
to come into training with a higher level of conceptual skill. Some of the
lower IQ children may have been confused by the relatively complicated
discussion procedures of the Interactive approach.
A major finding of this study was the inverse relationship for both training
groups between initial syntax level and amount of syntactic growth ^s
measured by the DSS. That is, children who scored lowest on the DSS
procedure prior to either teaching approach showed the largest gains during
training. The statistical artifact of regression toward the mean, based on
imperfect test-retest correlations, might at least partially explain these
negative coefficients. One criterion for determining that a child was delayed
in expressive language and in need of remedial training was an abnormally
low score on the DSS screening task. Since the DSS is not a perfectly reliable
instrument, any errors of measurement during this initial screening would
tend to be consistently biased against those who scored poorly on the test.
This was particularly true for children measuring at the very lowest levels,
who were then selected for remedial training. Their scores would have the
greatest tendency to regress upward when retested with the same instrument.
It is also possible that children who initially scored at the lowest syntax
levels simply demonstrated a spontaneous spurt in language development
that happened to coincide with the training period. This is the well-known
phenomenon of the "late bloomer." However, since children were arbitrarily
assigned to the two teaching procedures, such a spontaneous language jump
would not explain the stronger inverse relationship observed within the
Programmed group.
It seems more likely that the nature of the DSS instrument itself may have
prevented the more advanced children from demonstrating syntactic growth
that had actually occurred. The hierarchical maturity scales developed for
each of the DSS syntax categories may be interpreted only as ordinal
measures. That is, a jump in score from 1 to 3 on any category might not
reflect the same magnitude of syntactic development as a jump from 5 to 6.
It is entirely possible that numerical DSS gains at the higher category stages
were considerably more difficult to make than gains at lower levels. This
Applied Psycholinguistics 1:2 166
Friedman & Friedman: Individual reactions to language training
would allow children who were beginning their training at less mature stages
of grammatical development to more easily show quantitative improvement
than those at a more mature stage. In addition, if one intervention procedure
was more effective at high levels of syntax construction, and the other at
lower levels, this would be reflected in differential correlation coefficients.
Such confounding effects as regression artifacts and maturational gains
could be more easily explained if a control group of untrained language-
disordered children were administered the pre- and post-tests. However, in
the area of child language disabilities, clinical researchers attempting to
establish the effectiveness of their procedures face an important limitation:
They cannot ethically use control groups. It would be unconscionable to
withhold clinical training for any period of time from a child who needs it
and might gain from it. This preoludes designs in which one group of
children receives treatment while treatment is withheld from a comparable
group in order to show that the method produced results.
That a visual-motor aptitude measure significantly interacted with the
teaching methods was not at all surprising. The Beery-Buktenica, through a
copying task, measures the child's ability to actively demonstrate visual-
motor skills. It must be pointed out that the Interactive approach lessons
were designed to employ a combination of media to get the training message
across. In fact, the use of visual-motor skills was specifically incorporated at
the end of each lesson. Children with more advanced visual-motor skills
might have looked forward to this portion of the lesson, and were, therefore,
more motivated to participate in the preceding story telling. On the other
hand, the Programmed approach required a passive reaction to verbal and
pictorial stimuli, rather than an active demonstration of motor skills. Chil-
dren who were inclined to more active visual-motor behaviors may have lost
interest in this procedure, resulting in an inverse correlation between visual-
motor integration scores and DSS gains.
Although several interpretations have been presented for the three signifi-
cant disordinal interactions, some discussion seems necessary of why so many
of the individual difference measures did not interact with the treatments.
This is particularly critical for the other measures of language abilities. Why
wouldn't high scorers on the NSST Expressive or Receptive, or any of the
other developmental language measures, behave similarly to high scorers on
the DSS? It must be emphasized that significantly different training-group
slopes were obtained for only 3 of the 13 pre-treatment variables.
One explanation is that, when the two contrasting types of language
training were designed, not enough thought was given to analyzing the data
for Aptitude x Treatment interactions. An effort should have been made to
avoid overlapping procedures in the contrasting methods. For example,
although teacher reinforcements were not carefully controlled during the
Interactive training, they were extensively used to stimulate "natural"
interaction. It was also inefficient first to identify alternative training
approaches and then through trial and error to find individual difference
variables to interact with these teaching methods. The analysis of an
Applied Psycholinguistics 1:2 167
Friedman & Friedman: Individual reactions to language training
ACKNOWLEDGMENTS
Some of the observed training groups were part of the Modified Language Input
project supported by Grant #OEG-0-71-0949 from the Office of Education. The
authors wish to thank Laura L. Lee and Roy A. Koenigsknecht for suggestions about
the manuscript, and Susan T. Mulhern for aid in collection of the data. Requests for
reprints should be sent to Philip Friedman, Department of Psychoeducational Stud-
ies, Howard University, Washington, D.C. 20059.
REFERENCES
Baker, H. J., & Leland, B. Detroit tests of learning aptitude. Indianapolis, Ind.: Bobbs-Merrill,
1967.
Bates, E. Pragmatics and sociolinguistics in child language. In D. Morehead and A. Morehead
(Eds.), Normal and deficient child language. Baltimore: University Park Press, 1976.
Beery, K. E., & Buktenica, N. A. Developmental test of visual-motor integration. Chicago:
Follett, 1967.
Bloom, L. Language development: Form and function in emerging grammars. Cambridge,
Mass.: MIT Press, 1970.
Bracht, G. H. Experimental factors related to aptitude-treatment interactions. Review of
Educational Research, 1970,40, 627-645.
Bricker, W., & Bricker, D. Early language training strategy. In R. Schiefelbusch and L. Lloyd
Applied Psycholinguistics 1:2 168
Friedman & Friedman: Individual reactions to language training
Klima, E. S., & Bellugi, U. Syntactic regularities in the speech of children. In J. Tyons and R. J.
Wales, (Eds.), Psycholinguistics papers. Edinburgh University Press, 1966.
Koenigsknecht, R. A., & Friedman, P. Syntax development in boys and girls. Child Develop-
ment, 1916,4, 1109-1115.
Koenigsknecht, R. A., Friedman, P., & Hill, D. Developmental sentence scoring procedures
used in repeated measurement of spontaneous syntax usage of preschool children: A
longitudinal study. Paper presented at a meeting of the American Speech and Hearing
Association, Detroit, 1973.
Koenigsknecht, R. A., & Lee, L. L. Validity and reliability of developmental sentence scoring:
A method for measuring syntactic development in children's spontaneous speech. Paper
presented at a meeting of the American Speech and Hearing Association, Chicago, 1971.
The assessment of grammatical development in children and the clinical presentation of
grammatical structure to children with language problems. United States Office of
Education final report. Evanston, 111.: Northwestern University Press, 1974.
Lee, L. L. The Northwestern syntax screening test. Evanston, 111.: Northwestern University
Press, 1969, 1971.
A screening test for syntax development. Journal of Speech and Hearing Disorders, 1970,
55,103-112.
Developmental sentence analysis: A grammatical assessment procedure for speech and
language clinicians. Evanston, 111.: Northwestern University Press, 1974.
Lee, L. L., & Canter, S. M. Developmental sentence scoring: A clinical procedure for
estimating syntactic development in children's spontaneous speech. Journal of Speech
and Hearing Disorders, 1971, 36. 315-340.
Lee, L. L., Koenigsknecht, R. A., & Mulhern, S. Interactive language development teaching.
Evanston, III.: Northwestern University Press, 1975.
Liebert, R. M., Odom, R. D., Hill, J. H., & Hun", R. L. Effects of age and rule familiarity on
the production of modeled language constructions. Developmental Psychology, 1969, /,
108-112.
Lovaas, O. I., Berberich, J. P., Perloff, B. F., & Schaeffer, B. Acquisition of imitative speech by
schizophrenic children. Science, 1966, 151, 705-707.
MacDonald, J., & Blott, J. Environmental language intervention: The rationale for a diagnostic
and training strategy through rules, context, and generalization. Journal of Speech and
Hearing Disorders, 1974, 39, 244-256.
Martin, R., & Haroldson, S. The effects of two treatment procedures on stuttering. Journal of
Communication Disorders, 1969, 2, 115-125.
McGrady, H. Verbal and nonverbal functions of school children with speech and language
disorders. Ph.D. dissertation, Northwestern University, 1964.
McNeill, D. The acquisition of language: The study of developmental psycholinguistics. New
York: Harper & Row, 1970.
The capacity for the ontogenesis of grammar. In D. Slobin (Ed.), The ontogenesis of
grammar. New York: Academic Press, 1971.
Miller, J., & Yoder, D. An ontogenetic language teaching strategy for retarded children. In R.
Schiefelbusch and L. Lloyd (Eds.), Language perspectives - acquisition, retardation,
and intervention. Baltimore: University Park Press, 1974.
Miller, W., & Ervin, S. The development of grammar in child language. Child Development
Monographs, 1964, 29, 9-34.
Monsees, E. Structured language. Washington: Children's Speech and Hearing Center, 1970.
Muma, J. The communication game: Dump and play. Journal of Speech and Hearing
Disorders. 1975, 40, 296-300.
Peins, M., McGough, W. E., & Lee, B. S. Evaluation of a taperecorded method of stuttering
therapy: Improvement in speaking task. Journal of Speech and Hearing Disorders, 1972,
15. 364-371.
Risley, T., & Wolf, M. Establishing functional speech in echolalic children. Behavioral
Research and Therapy, 1967, 5, 73-88.
Rosenthal, T. L., and Whitebrook, J. S. Incentives versus instructions in transmitting grammati-
cal parameters with experimenter as model. Behavioral Research and Therapy, 1970, 8,
189-196.
Applied Psycholinguistics 1:2 170
Friedman & Friedman: Individual reactions to language training
Sarno, M. T., Silverman, M., & Sands, E. Speech therapy and language recovery in severe
aphasia. Journal of Speech and Hearing Disorders, 1970, 13, 607-623.
Schlesinger, I. Relational concepts underlying language. In R. Schiefelbusch and L. Lloyd
(Eds.), Language perspectives — acquisition, retardation and intervention. Baltimore:
University Park Press, 1974.
Seashore, H. G., Wesman, A. G., & Doppelt, J. E. The standardization of the Wechsler
Intelligence Scale for Children. Journal of Consulting Psychology, 1950, 14, 99-110.
Shavelson, R. J., Berliner, D. C , Ravitch, M. M., & Loeding, D. Effects of position and type of
question on learning from prose material: Interaction of treatments with individual
differences. Journal of Educational Psychology, 1974, 66, 40-48.
Sommers, R. K. Effectiveness of individual versus group articulation therapy. Paper presented
at a meeting of the American Speech and Hearing Association, San Francisco, 1964.
Spencer, E. M. An investigation of the maturation of various factors of auditory perception in
preschool children. Ph.D. dissertation, Northwestern University, 1958.
Terman, L. M., & Merrill, M. A. Stanford-Binet Intelligence Scale. Boston: Houghton Mifflin,
1960.
Tobias, S. Sequence, familiarity, and attribute by treatment interaction in programmed instruc-
tion. Journal of Educational Psychology, 1973, 64, 133-141.
Walker, H. M., & Lev, J. Statistical inference. New York: Holt, 1953.
Winer, B. J. Statistical principles in experimental design. New York: McGraw-Hill, 1971.
Zwitman, D. H., & Sonderman, J. C. A syntax program design to present base linguistic
structures to language-disordered children. Journal of Communication Disorders, 1979,
12. 323-335.