Professional Documents
Culture Documents
The Narrative Language Performance of Three Types of At-Risk First-Grade Readers
The Narrative Language Performance of Three Types of At-Risk First-Grade Readers
net/publication/221735881
CITATIONS READS
16 588
3 authors, including:
Teresa Ukrainetz
Utah State University
42 PUBLICATIONS 1,408 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Teresa Ukrainetz on 07 March 2019.
Article
Purpose: This study investigated the narrative language per- of different words, and number of communication units. The
formance of 3 types of readers who had been identified as being narratives of early resolvers and good responders differed sig-
at risk through code-based response-to-intervention (RTI) nificantly on the productivity index, number of coordinating
procedures. conjunctions, and number of episodic elements. There were no
Method: In a retrospective group comparison, 32 at-risk other significant differences.
1st-grade readers were identified: children who resolved with- Conclusion: Types of learners distinguished by a code-based
out intervention (early resolvers, n = 11), children who met RTI model showed differences in their narrative language. First
criterion following 4 weeks of intervention (good responders, graders who responded well to code-based reading interven-
n = 8), and children who failed to meet criterion following tion retold stories that contained more language and better story
4 weeks of intervention (poor responders, n = 13). A narrative grammar than first graders who did not respond well to inter-
retell and a norm-referenced language test were obtained vention. These results indicate the need to evaluate narrative
before intervention. language performance within RTI, especially for early resolvers.
Results: There were no significant differences between the
3 learner types on the language test. However, the narratives
of the good responders were significantly higher than the nar- Key Words: response to intervention, reading, narratives,
ratives of the other 2 groups on total number of words, number language
he identification of children who are at risk for aca- difficulties. This study examined the narrative language per-
LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012 * American Speech-Language-Hearing Association 205
RTI in the schools is still in the early stages, with many Decoding skills and reading comprehension come together in
variations in administration and implementation challenges fluent, skilled reading. However, there are important differ-
(Fuchs, 2003; Marston, 2005; Mellard, McKnight, & Woods, ences between the two domains.
2009; Ukrainetz, 2006a). Decoding means “breaking the code” and deals with rec-
Children who are struggling with reading are identified ognizing the printed word. It involves the print-specific skills
early in their schooling as having a trajectory that will not of alphabet knowledge, letter-sound correspondence, and
allow them to meet year-end benchmarks. In the RTI process, spelling conventions. Decoding requires a degree of lan-
educators intervene by teaching particular aspects of reading guage facility. Reading a printed word involves the language
and continuously monitoring a child’s response to this in- processes of holding phonological representations in mem-
struction (Fuchs & Fuchs, 2006; Fuchs, Fuchs, & Compton, ory, separating and manipulating phonemes, and retrieving
2004; Fuchs, 2003). Although instruction can be individual- the meaning attached to a phonological representation (Adams,
ized and optimized based on learner response, akin to lan- 1990). Deciding how to read a single printed word taps the
guage treatment, it is more typically delivered for set periods linguistic skills of vocabulary, syntax, and discourse (e.g.,
of time in a standardized manner with high instructional deciding whether to read the word read as /rid/ or /rEd/).
fidelity (Troia, 2005). The multitier RTI process begins by Reading comprehension involves putting the recognized
evaluating the performance of all children within a grade words together to gain sentential and textual understand-
level. In Tier I, at-risk readers initially receive classroom ing. Reading comprehension has cognitive, attentional, and
instruction with regular evaluation. Progress for some of the situational processes involved, but it is essentially language
children accelerates so they meet benchmarks by the next (Kamhi, 2009). Language is a huge domain that is diffi-
evaluation. These children are considered to have been solely cult to delimit; most educational learning involves written
experientially deficient and continue with only regular class- or spoken language (American Speech-Language-Hearing
room instruction. The children still not meeting expecta- Association [ASHA], 2001; Ukrainetz & Fresquez, 2003).
tions then receive supplementary instruction in Tier II. This Language-in-print differs in some significant ways from
instruction may be provided as differentiated small-group spoken language, such as in patterns of syntactic complexity
instruction with more intense monitoring. Children not re- and decontextualization (Scott, 1995; Westby, 1985), but
sponding sufficiently to the second tier of instruction may essentially language is still language: a complex, dynamic,
be provided with the even more intense Tier III intervention. multidimensional, rule-governed, conventional representa-
Children who do not respond satisfactorily to intensive tional system that crosses representational modalities.
supplementary support are considered for identification as The two aspects of reading, decoding and comprehension,
reading disabled and eligibility for long-term special edu- differ in their length and nature of acquisition. Decoding and
cational support (Swanson & Howard, 2005). fluency are constrained skills in that they are learned fairly
Valid decisions about learning responses in RTI can only quickly and thoroughly (Paris, 2005). Early code-related
be made if quality instruction has been provided. Quality skills are relatively easy to teach, and children can show large
instruction involves research-validated curriculum and gains in letter names, letter sounds, and phonemic awareness.
practices. Tier I instruction addresses all five “pillars” of By third grade, typical achievers have generally acquired
instruction: phonemic awareness, phonics, vocabulary, fluent decoding skills.
comprehension, and fluency (National Institute of Child In contrast, the language skills and world knowledge that
Health and Human Development [NICHD] National Read- comprise reading comprehension begin in infancy with con-
ing Panel, 2000). Tier II (and III) intervention is primarily versational interactions, grow in the school years with oral
code based. In kindergarten, instruction focuses on letter- and print experiences, and, depending on life experiences
sound correspondence, phonemic awareness, and basic word and learner inclinations, continue to improve through life.
decoding skills. In first and second grade, more advanced These skills are called unconstrained because of the lack of
word decoding skills and rapid, accurate oral reading are a specified mastery criterion: More is usually better (Paris,
addressed. This selective instructional focus is needed to 2005). In addition to a much longer learning trajectory, lan-
provide the intensity and therapeutic attention needed for guage is different from decoding in its modifiability. Specific
at-risk readers (Ukrainetz, 2006a). However, it may also vocabulary may be acquired easily through lived experi-
result in overlooking children who may be at risk for other ence (e.g., acquiring a dog leads to an array of canine-related
important reading skills, for example, comprehension. vocabulary). However, the rest of language is more resis-
tant to change, particularly through school lessons (see
Reading and Language in a Code-Based Weaver, 1996, for the many fruitless efforts over decades to
improve students’ written grammar).
RTI Model
Based on the differences between decoding and com-
To read fluently and understand what is read, one must prehension, RTI models concentrated on one domain may
have competency in two broad domains: decoding and not predict performance on the other domain, thus missing
comprehension (Gough & Tunmer, 1986; Perfetti, 1985). important information from half of the reading equation. The
206 LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012
only study located that examined this is Al Otaiba and Fuchs allow quick and reliable administration and scoring. Global
(2006). Al Otaiba and Fuchs examined children’s respon- norm-referenced tests often provide separate receptive and
siveness to code-based instruction and investigated the learner expressive scores, but expressive skills are usually limited
characteristics associated with responsiveness. Measures of to single word and sentence responses.
fluency, letter sounds, and phoneme segmentation in kinder- An alternative approach to measuring children’s language
garten and of oral reading in first grade were used to iden- skills is to use discourse tasks that evaluate these compo-
tify responsiveness by considering growth from pretreatment nents in integrated, functional wholes. Narratives are an
to posttreatment. Children were identified as always respon- important genre of discourse that can tap multiple features
sive, meeting the end-of-year criterion for kindergarten and levels of language simultaneously (Hoffman, 2009a;
and first grade; never responsive, never meeting the end-of- Ukrainetz, 2006b). A variety of narrative analyses have dif-
year criterion for kindergarten or first grade; or sometimes ferentiated children with typically developing (TD) language
responsive, meeting the end-of-year criterion for kinder- from those with language impairment (LI) (e.g., Fey, Catts,
garten or first grade, but not both. Receptive vocabulary Proctor-Williams, Tomblin, & Zhang, 2004; Gillam &
and sentence imitation, along with problem behavior, naming Johnston, 1992; Liles, 1985; Newman & McGregor, 2006;
speed, and amount of intervention, predicted reading fluency Scott & Windsor, 2000). Narrative performance is typically
responsiveness after 2 years of intervention. The extended evaluated through language sampling procedures that may
period of intervention time within the Al Otaiba and Fuchs be standardized and may even have age comparison infor-
study differed from the RTI goal of short-term identification mation (e.g., Miller & Chapman, 2004; Strong, 1998), but
and amelioration; however, it suggests that language may the open-ended responses and the detailed descriptive results
be predictive of reading performance within an RTI model. are not typically amenable to the strict psychometric proce-
An important consideration is that the language measures dures of norm-referenced testing. An exception to this is
and scoring systems used in the study provided a very limited the Test of Narrative Language (TNL; Gillam & Pearson,
view of language performance. 2004), which samples whole narratives, but performance is
Reading competence involves both a set of decoding still reduced to overall receptive and expressive scores.
skills and the large domain of comprehension. Because of One type of narrative task, a retell of a previously pre-
the differing natures of these two domains, code-based RTI sented story, has both comprehension and production demands.
models may be missing the linguistic side of reading. The model narrative presents the events, plot structure, and
words that the narrator is to retell, which allows more reliable
scoring than a generated story that can go in many directions.
Language and Reading
Retelling requires language, attention, comprehension, and
Young or poor readers may try to bypass decoding diffi- memory for the original story. Despite the support of the
culties by tapping into the linguistic side of reading through source story, as narrators retell the story, they filter the
the spoken modality. There are strong positive relations events through their own linguistic and cognitive resources
between oral language and both decoding and reading com- (Mandler & Johnson, 1977; Slobin & Welsh, 1973). Thus,
prehension (Bishop & Adams, 1990; Catts, Fey, Tomblin, & narrative retells may be a particularly revealing elicitation
Zhang, 1999, 2002; Catts, Fey, Zhang, & Tomblin, 2001; task for investigating relations between children’s reading
Nation, Clarke, Marshall, & Durand, 2004; Roth, Speece, & and their oral language.
Cooper, 2002; Snowling, 2005): Vocabulary has been con- Microstructure analysis. Narrative retells can be evalu-
sistently implicated in reading performance (NICHD National ated at microstructural and macrostructural levels. Micro-
Reading Panel, 2000); isolated measures of syntax have been structure is the sentence-internal structure of a narrative
found to predict reading difficulties (Badian, McAnulty, (Justice et al., 2006). It involves measures of vocabulary
Duffy, & Als, 1990; Catts et al., 2001; Vellutino et al., 1996); (e.g., quantity and variety of words), morphological structure
and a measure of grammatical complexity obtained during (e.g., inflections and derivations), and syntax (e.g., con-
conversational samples was found to contribute unique vari- junctions, noun phrases, and dependent clauses). Micro-
ance to later reading skills (Scarborough, 1990). In addition, structural measures that have differentiated children with
children with and without reading difficulties have been language difficulties from children with TD language include
differentiated on narrative episodic structure (e.g., Feagans total number of words (TNW), number of different words
& Short, 1984; Ripich & Griffith, 1988; Roth & Spekman, (NDW), number of utterances, mean length of utterance
1986). (MLU), and grammatical accuracy (e.g., Fey et al., 2004;
Children’s language skills can be measured in many Newman & McGregor, 2006; Scott & Windsor, 2000).
ways. Norm-referenced tests typically take a discrete skill These individual measures have shown some utility in de-
approach that elicits components of language in isolation scribing language and distinguishing groups; nonetheless,
and combines the results into global age-referenced scores, grouping outcomes from two or more measures may produce
thus evaluating children’s general language skills. Norm- more robust group differentiation (e.g., Puranik, Lombardino,
referenced tests involve strictly standardized procedures that & Altmann, 2008). A common grouping is by the language
208 LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012
poor start, possibly from poor test-taking behavior or mini- the norm-referenced test to potentially subtle differences in
mal experiential deficits. In April, the narrative language linguistic competence of the three learner types.
performance and general language skills of all children The purpose of this study was to examine the narrative
participating in the study were assessed before code-based language performance of first graders who had been iden-
Tier II intervention was initiated, which also occurred in tified as being at risk for reading difficulties. The specific
April. Once the intervention was completed, good respond- research questions were:
ers and poor responders were identified. & Do these three learner groups differ on their total,
Good responders and poor responders, who had more receptive, or expressive language as measured by a
persistent deficits than early resolvers, responded differently norm-referenced test of oral language?
to Tier II intervention. Good responders accelerated their & Do these three learner groups differ on individual
growth trajectories and met criterion growth expectations. measures of microstructure obtained during a narrative
These children were considered experientially deficient but retell?
good learners who were not expected to require further sup-
& Do these three learner groups differ on indices of
plementary reading instruction. Poor responders consisted
productivity and complexity obtained during a narrative
of the children who failed to meet criterion growth expecta-
retell?
tions following intervention. These children were expected
to have continued reading deficits and possibly inherent & Do these three learner groups differ on macrostructural
learning disabilities. measures of episodic complexity obtained during a
There were several possible outcomes for this study, de- narrative retell?
pending on the relationship between decoding and compre-
hension, the latter being essentially language. If decoding
and language are parallel in the early grades, so a child who
is low in decoding will be low in language, then children METHOD
identified by code-based RTI processes should also have low
language. As such, poor responders would show the weakest
Design
narrative language performance; early resolvers, who do A retrospective group comparison design was employed.
not require supplemental decoding attention, would show the The data were obtained in 2001–2002 as part of a larger
highest narrative language performance; and good respon- project (Allen, Nippold, & Simmons, 2011), with the current
ders would fall between these two groups with their initial narrative analyses designed and applied following data col-
deficits but a good capacity for learning given supplementary lection. All first graders in the district had been assessed
instruction. on reading at the beginning of the school year in September.
However, if decoding and language travel different paths, Study participation was solicited for children who were iden-
then any of the three groups may demonstrate language def- tified as being below expectations in January of first grade and
icits regardless of their performance in decoding and read- reading was reassessed in March. Participants were admin-
ing fluency (Bishop, McDonald, Bird, & Hayiou-Thomas, istered an assessment battery in April and then underwent
2009). If so, additional identification practices need to be 4 weeks of reading intervention and progress monitoring. Only
instituted along with code-based RTI. The outcome of this the measures relevant to the current study are reported here.
study matters most for early resolvers. If these children are
discovered to have low narrative language performance, then
Participants
they are at particular peril because early resolvers are con-
sidered capable learners who, despite an initial misstep, do The participants were selected from a suburban school
not need any attention beyond regular classroom instruction. district encompassing six elementary schools in the Pacific
We employed two measures of language in our study: Northwest. Five of the elementary schools qualified for
a norm-referenced test to measure children’s general lan- Title I services. In this school district, 43% of the children
guage skills and a narrative retell to investigate their narra- qualified for free and reduced-price meals. The majority of
tive language performance. The norm-referenced expressive the children were White (87.2%); of the remaining children,
score was only minimally expressive, with no tasks required 6.8% were Latino, 2.2% were Asian, 2.1% were African
beyond single sentence production. Narrative language American, and 1.7% was Native American.
performance was measured in terms of eight microstructural School district personnel completed early literacy bench-
measures, INMIS productivity and complexity indices, and mark testing for every child in the district three times across
two measures of episodic structure for a narrative retell. Use of the academic year using the Dynamic Indicators of Basic
this general language test allowed determination of whether Early Literacy Skills (DIBELS; Good & Kaminski, 1996)
a narrative retell taps language skills other than those obtained and the Test of Reading Fluency (TORF; Deno, Deno, Marston,
with conventional norm-referenced measures. It was ex- & Marston, 1987). District personnel assessed an original
pected that the narrative retell would be more sensitive than pool of 398 children in kindergarten, who were then followed
210 LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012
story. In our study, each child listened to an audiotaped story (Hunt 1965). However, from the instructions and examples,
and looked at the pictures from the wordless picture book, along with communication with the authors, it was apparent
A Boy, a Dog, and a Frog (Mayer, 1972). The child turned that C-units were actually used.
the page when he or she heard a specific sound on the record- The hand-coded measures were identified following
ing (i.e., a frog’s croak). An examiner sat with the child Justice et al. (2006). COORD consisted of any of seven coor-
during the story to aid in page turning. After listening, the dinating conjunctions (e.g., and, for, but, so, nor, or, yet) that
child retold the story without the book. To create a reason for were used to connect two clauses in a C-unit when a sub-
the retelling, the child was asked to retell the story into a tape ject was not repeated. For example, “but” was counted in
recorder so that the teacher could listen later (this is an The dog ran but tripped, but not in The dog ran/but the
alternative procedure that is allowed in the SNAP manual). cat tripped. The participants frequently used and, and then,
and then as cohesive tools at the beginning of a C-unit, so
these were not included in the COORD count. SUBORD
Procedure
consisted of any of the 26 subordinating conjunctions pro-
Measurement timeline. The pretest TORF, TOLD–P:3, vided by Justice et al. (see Appendix B in Justice et al., 2006,
and SNAP were administered in April of first grade as part for a complete list). COMPLEX was coded when a C-unit
of a larger assessment battery. Testing was completed in contained one or more dependent clauses. All C-units previ-
three, 45-min sessions in each child’s home school. The ously coded with a subordinating conjunction as well as all
assessment battery was administered to the children in the other C-units with subordinate clauses, including relative
same order. The TORF was also administered once per week clauses (e.g., The girl who has red hair is my friend), nomi-
during the 4-week intervention to the children who received nal clauses (e.g., The frog said, “I am sad.”), and infini-
treatment. The TORF was only administered at pretest and tive clauses (e.g., I want to go), were coded as complex
posttest for the early resolvers. utterances. PROPCOMPLEX was the numerical value for
Narrative analyses. The audiotaped narrative transcripts COMPLEX divided by LENGTH.
were transcribed, segmented, and analyzed at microstructural The INMIS formulae from Justice et al. (2006) were used
and macrostructural levels. to calculate the productivity and complexity of the narratives
Transcription and segmentation. The narrative language produced by each participant. The formulae used to assess
samples were transcribed and segmented by the third author the microstructure of the oral narratives were as follows:
according to guidelines of the Systematic Analysis of Lan-
Productivity ¼ 1:60 þ ð0:001 MLC-WÞ þ ð0:21
guage Transcripts (SALT; Miller & Chapman, 2004). The PROPCOMPLEXÞ þ ð0:017 NDWÞ
narratives were segmented using communication units þ ð0:00054 TNWÞ þ ð0:014 COORDÞ
(C-units; Loban, 1963, 1976). A C-unit is “each independent þ ð0:0072 SUBORDÞ þ ð0:0094 LENGTHÞ
clause with its modifiers” (Loban, 1976, p. 9) or an elliptical þ ð0:068 COMPLEXÞ
response. Elliptical responses could be either an answer to
a question whether it only lacked part of the sentence ele- Complexity ¼ 2:84 þ ð0:27 MLC-WÞ þ ð0:85
ments in the question or each yes or no (Loban, 1976). For PROPCOMPLEXÞ þ ð0:012 NDWÞ
the purposes of this study, each yes or no response that was in þ ð0:0027 TNWÞ þ ð0:028 COORDÞ
response to glossing by the examiner (E: You said the frog þ ð0:026 SUBORDÞ þ ð0:085 LENGTHÞ
þ ð0:14 COMPLEXÞ:
jumped? C: Yes) was omitted from the transcript. Any utter-
ance that did not contribute to the narrative was omitted to Macrostructure analysis. Episodic structure was analyzed
avoid an inflated word count (e.g., Are we done?). using guidelines from the SNAP (Strong, 1998). In the SNAP,
Microstructure analysis. Each transcribed language sam- a complete episode consists of a complication (e.g., He
ple was individually coded and was run through the SALT wanted a pet frog), an attempt to resolve the complication
program. The automated SALT analysis provided TNW, (e.g., He went to find a frog), and a consequence of the
NDW, number of C-units (LENGTH), and mean length of attempt (e.g., He tripped over a tree and fell into the water).
C-units in words (MLC-W). Four measures were hand coded To be considered a complete episode, the three elements had
by the third author: total number of coordinating conjunctions to be presented in that order. The source story consisted of
(COORD), total number of subordinating conjunctions five complete episodes. Each complete episode from the
(WUBORD), total number of complex C-units (COMPLEX), story was scored as 1 point. In addition, the number of epi-
and proportion of complex C-units (PROPCOMPLEX). sodic elements was scored. One point was assigned for each
The INMIS formulae from Justice et al. (2006) were used statement of setting, initiating event, internal response, at-
to further analyze the microstructure. It should be noted tempt, consequence, plan, or reaction.
that Justice et al. reported using minimal terminable units Narrative scoring reliability. A trained graduate student
(T-units) to segment utterances, resulting in a mean length of in speech-language pathology independently transcribed
T-units in words (MLT-W). A T-unit is similar to but more and coded one (33%) of the narratives. Word-to-word
restricted than a C-unit, excluding all nonpredicated utterances match transcription was 91%. Point-to-point agreement
212 LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012
receptive, F(2, 29) = 0.76, p = .48; and expressive language, across the eight measures. Four measures showed signifi-
F(2, 29) = 0.35, p = .71. The standard deviations were ob- cant differences: TNW, F(2, 29) = 7.17, p = .003, h2 = 0.33;
served to be noticeably larger for the good responders, so NDW, F(2, 29) = 5.06, p = .013, h2 = 0.26; LENGTH,
Levene’s Test of Equality of Error Variances was calculated. F(2, 29) = 6.81, p = .004, h2 = 0.32; and COORD, F(2, 29) =
Results showed that there were no significant differences 5.65, p = .008, h2 = 0.28. No significant differences were
in variance between the groups (total p = .27; receptive obtained for the other four measures: MLC-W, F(2, 29) =
p = .09; expressive p = .50). 0.07, p = .930, h2 = 0.01; SUBORD: F(2, 29) = 0.89, p = .422,
An additional post hoc test was conducted on the observed h2 = 0.06; COMPLEX: F(2, 29) = 1.15, p = .331, h2 = 0.07;
difference between the receptive and expressive language and PROPCOMPLEX: F(2, 29) = 2.06, p = .146, h2 = 0.12.
means. A paired-samples t test indicated that the scores for Follow-up pairwise comparisons were conducted on the
receptive language were significantly greater than the scores four significant measures (Table 5). Good responders per-
for expressive language for all three learner groups: early formed significantly better than early resolvers and poor
resolvers, t(20) = 3.89, p = .0009; good responders, t(14) = responders for TNW, NDW, and LENGTH. Good respond-
3.05, p = .009; and poor responders, t(24) = 2.26, p = .03. ers performed significantly better than early resolvers for
These results indicated that the three learner types did not COORD. All of these comparisons showed large effect sizes.
differ significantly on norm-referenced language test scores, The results indicated that good responders had relatively
but all three learner types showed significantly lower expres- strong narrative microstructure compared to the similarly
sive than receptive results. low performance of early resolvers and poor responders.
214 LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012
For COORD, or clausal coordination, all three groups, on difference between the episodic elements of good respond-
average, produced fewer than one clausal coordinator. The ers and early resolvers was significant, but the overall
source story only provided five examples of clausal coordi- pattern suggested a relative strength for the good responders.
nation using and within a C-unit. For the most part, the The episodic structure analysis scored only episodes that
children did not produce a modified utterance but simply were retold from the source story, not from any original epi-
omitted the compound sentence from their retell. The lack of sodes. Examination of the narratives showed that the chil-
utterances containing and in the participant retells contrasts dren produced only modeled episodes. Good responders
with its early developmental appearance (Owens, 2008; were also distinguished on how many participants attempted
Retherford, 2000). However, this may be due to the clausal all five episodes modeled (5 of 8 children), compared to the
coordinator scoring system. C-unit segmentation requires other two groups (early resolvers = 0 of 11 children; poor
an utterance division at the clausal coordinator if a second responders = 3 of 13 children).
subject is present (Loban, 1976), resulting in an utterance- Most children attempted at least three episodes but often
initial and for the second utterance. This use of and was failed to complete the episodes. When participants addressed
considered a discourse component, as a continuer and marker an episode, they tended to omit the complication, initiating
of the main story line (Peterson & McCabe, 1991), not an their retell with an attempt to solve the complication (e.g.,
intrasentential connector. Additionally, the COORD variable so he followed the footprints) followed by a consequence.
was scored for the coordination of verbs (e.g., running and These results are similar to those for third- and fifth-grade
jumping) but not nouns (e.g., frog and friend ) that served children who retold stories that included more attempts and
to conjoin clauses within a C-unit. These restrictions lowered outcomes than complications (Goldman & Varnhagen, 1986)
the number of coordinating uses of and that were scored. and for 6-, 7-, and 8-year-olds who also tended to omit com-
But was also used to initiate a C-unit but rarely appeared plications in generated narratives (Peterson & McCabe, 1983).
despite its presence three times in the source story. In sum, in our study, microstructural narrative produc-
Other than COORD, the syntactic measures are different tivity measures and the macrostructural measure of num-
but overlapping aspects of subordination. The source story ber of episodic elements were shown to distinguish among
used 22 tokens (only 1 token per C-unit could be counted, three types of at-risk learners. Where significant differences
resulting in 20 as the maximum for COMPLEX) that contrib- were found, good responders consistently outperformed the
uted to COMPLEX (6 subordinating conjunctions, 2 nominal other two learner types.
dependent clauses, and 14 infinitive clauses). The children
reproduced few of these subordinating structures, with mean Language Performance Compared
performance across the three groups at È3.5 tokens. Infini- to Typical Learners
tive clauses, which are early appearing structures (Owens,
2008; Retherford, 2000) and are not counted as measures of The research questions in this study concerned patterns of
complexity by some authors (e.g., Hughes, McGillivray, & narrative language performance across RTI-identified learner
Schmidek, 1997; Justice & Ezell, 2002), were rarely used types. However, the use of a norm-referenced test and a
by the participants. Across the three groups, the early resolv- standardized narrative sampling procedure that has age com-
ers provided a mean of only 1.6 (SD = 1.6) infinitives, the parison data allowed for consideration of how the language
good responders provided a mean of 3.6 (SD = 2.7) infinitives, of these at-risk learners compared to that of typical learners.
and the poor responders provided a mean of 2.5 (SD = 1.8). The norm-referenced TOLD–P:3 showed marginal over-
It should be noted that the INMIS indices may be of all language ability for all three learner groups in regard to
unexpected magnitudes, and care should be taken in inter- mean standard score performance of 87.03. Sixty-three per-
preting them. The good responders exemplified this: Their cent of the children (n = 20) showed performance in the
complexity score was the lowest of the three groups, al- lower quartile (standard score of ≤90) that RTI is intended
though their performance on the eight contributing measures to flag. Thirty-one percent (n = 10) scored <85, or less than
was as high or higher than those of the other two groups. the 16th percentile, which warrants referral for a special
Their lower complexity score was due to the good respond- education evaluation.
ers having produced more narrative language than the other The narrative analysis showed similar low performance for
two groups, but with similar complexity. It appears that all three learner types. On average, the three groups of children
the expectation of the INMIS complexity index is that there were missing most of the story in their retells. There were
should be more evidence of complex narrative language almost 400 words in the source story (Appendix). TNW was
within the longer narratives. 137.9 for the good responders, 60.0 for the early resolvers, and
Macrostructure. The narratives were also evaluated for 77.2 for the poor responders. This was still only approximately
macrostructural episodic organization. Good responders one-third of the source story vocabulary. Appendix J1 of
provided a mean of 1.0 complete episodes and 18 episodic
elements, compared to <0.25 complete episodes and 1
Two versions of SNAP were published in 1998. One version contains
<12 episodic elements for the other two groups. Only the 198 pages and the other contains 206 pages with Appendix J.
216 LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012
group’s on-track status for code-based skills overshadows differences between TD children and children with LI or
their current linguistic needs. The risk for future difficul- reading impairment, it provides relatively low power for a
ties appears high for this group due to the absence of a three-way comparison among more subtly different groups
“language-track” RTI model (Ukrainetz, 2006a). Their of children. This would have been difficult to remedy; an
weaknesses will not be identified until reading and academic entire school district was needed to locate this number of
performances are affected sufficiently for a special education children. Only 10.6% of the children in the district qualified
referral. The nonconsideration of linguistic skills puts this as being at risk for reading difficulties and moved into
group of children into a “wait to fail” model. Tier II instruction. A large-scale multiyear evaluation of
The central implication of this study is that RTI models schoolwide universal screening showed 15% of children
should consider language in addition to decoding. A sub- advancing to Tier II instruction (VanDerHeyden, Witt, &
group of at-risk readers, the early resolvers, met the bench- Gilbertson, 2007). The lower percentage of Tier II chil-
mark criterion for accurately and fluently decoding connected dren in this district contrasts with a report suggesting the
text. If narrative performance had been used to determine opposite: that children are being overidentified with the
responsiveness, then these children may have continued to high expectations of the current DIBELS (Catts, Petscher,
be identified as at risk and may have been provided supple- Schatschneider, Bridges, & Mendoza, 2008). The reasons
mentary support. These findings suggest that a sensitive for the lower number identified may be due to the strength
measure of language performance should be added to evalu- of the core curriculum provided within the school district,
ations of early risk for reading difficulties. but no matter the reason, this did result in an undesirably
small sample.
The INMIS scores obtained in this study were not com-
Study Limitations and Future Directions
pared to those of Justice et al. (2006) because of the differ-
There were several limitations of this study. A major ences in the narrative elicitation procedures. Age-referenced
limitation was in the use of extant data, which limited the language sample information is often used without refer-
study design choices. A comparison group of typical readers ence to elicitation context (e.g., SALT reports separately for
from the same district would have been more informative conversation and narrative but collapses across data col-
and procedurally robust than the norm-referenced scores and lected using diverse procedures). However, elicitation proce-
SNAP age reference information. dures do have an impact on narrative language. Merritt and
Language performance following intervention would Liles (1987) found that children with and without language
have been very helpful in understanding the implications of disorders produced narratives of a similar length with story
the narrative language patterns obtained and the effects of generation tasks but were differentiated by length for a story
code-based intervention on language performance. It is pos- retell task. In regard to macrostructure, Schneider and Dube
sible that the therapeutic attention that the children received (2005) found that children produced more story grammar
during the intervention, even though it did not explicitly units for a story retell task than a story generation task sup-
target narrative skills, may have indirectly and positively ported by pictures. Thus, although the INMIS was helpful
impacted their language skills. The effects of active controls in making within-study comparisons, it could not be used
in recent studies suggest the power of repeated opportunities across different elicitation procedures. To better understand
for motivated and attentive language interactions (Gillam and validate the INMIS as a clinical tool, studies should
et al., 2008; Ukrainetz, Ross, & Harm, 2009). Another con- use the same narrative elicitation contexts or ones that sys-
sideration is that the type of intervention provided may have tematically vary from the original.
a differential impact on narrative language performance. Finally, investigators need to be clear on the segmentation
If a second narrative retell had been elicited after the inter- unit and rules used. T-units and C-units were developed more
vention was completed, then one would have been able to than 40 years ago by Hunt (1965) for analyzing writing
investigate the impact of a fluency-based intervention that and by Loban (1963) for analyzing spoken language. T-units
uses a complete discourse unit versus an accuracy-based inter- are limited to grammatically complete sentences, whereas
vention that mostly uses single word and sentence level units. C-units additionally include ellipses and fragments. To ana-
The use of a norm-referenced measure of narrative abil- lyze by T-units, many utterances (e.g., A boy, a dog, and a
ity such as the TNL may have been more informative in frog; Yeah; Me too; Maybe she was; The end ) must be omit-
regard to the performance of the children in comparison to ted from the transcript. Confusion between segmentation
a peer group. The TNL requires full narrative productions units will have a significant effect on conversation, with its
from children, so it might have revealed these subtle group heavy use of elliptical utterances, with less effect on mono-
differences more than the TOLD–P:3, but that possibility logic narration (Allen, Ukrainetz, & Petersen, 2010). How-
remains speculative. At the time of the data collection, the ever, even for narrative studies, if inconsistent application
TNL had not been published. occurs, such as if the SALT program is used to automatically
A significant concern involved the sample size. Although count all of the transcribed utterances (i.e., C-units) for “total
a sample of 32 participants may be sufficient to detect number of utterances” and hand coding is used to count
218 LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012
and normally achieving children. Child Development, 55, Ihnot, C. (1999). Read naturally. Saint Paul, MN: Read Naturally.
1727–1736. Individuals with Disabilities Education Improvement Act of 2004,
Fey, M. E., Catts, H. W., Proctor-Williams, K., Tomblin, J. B., Pub. L. No. 108-446, 118 Stat. 2647 (2005).
& Zhang, X. (2004). Oral and written story composition skills Justice, L. M., Bowles, R. P., Kaderavek, J. N., Ukrainetz,
of children with language impairment. Journal of Speech, T. A., Eisenberg, S. L., & Gillam, R. B. (2006). The index
Language, and Hearing Research, 47, 1301–1318. of narrative microstructure: A clinical tool for analyzing school-
Fuchs, D., & Fuchs, L. S. (2006). Introduction to response to age children’s narrative performance. American Journal of
intervention: What, why and how valid is it? New Directions Speech-Language Pathology, 15, 177–191.
in Research, 41(1), 93–99. Justice, L. M., & Ezell, H. K. (2002). The syntax handbook:
Fuchs, D., Fuchs, L. S., & Compton, D. L. (2004). Identifying Everything you learned about syntaxI but forgot. Austin, TX:
reading disability by responsiveness-to-instruction: Specifying Pro-Ed.
measures and criteria. Learning Disability Quarterly, 27, Kamhi, A. G. (2009). The case for the narrow view of reading.
216–227. Language, Speech, and Hearing Services in Schools, 40, 174–177.
Fuchs, L. S. (2003). Assessing intervention responsiveness: Con- Leonard, L. B. (1998). Children with specific language impair-
ceptual and technical issues. Learning Disabilities Research and ment. Cambridge, MA: MIT Press.
Practice, 18, 172–186.
Liles, B. Z. (1985). Cohesion in the narratives of normal and
Gersten, R., Compton, D., Connor, C. M., Dimino, J., Santoro, language-disordered children. Journal of Speech and Hearing
L., Linan-Thompson, S., & Tilly, W. D. (2008). Assisting Research, 28, 123–133.
students struggling with reading: Response to intervention and
Liles, B. Z., Duffy, R. J., Merritt, D. D., & Purcell, S. L.
multi-tier intervention for reading in the primary grades. A
(1995). Measurement of narrative discourse ability in children
practice guide (NCEE 2009-4045). Washington, DC: National
with language disorders. Journal of Speech and Hearing
Center for Education Evaluation and Regional Assistance.
Research, 38, 415–425.
Retrieved from http://ied.ed.gov/ncee/wwc/publications/
practiceguides/. Loban, W. (1963). The language of elementary school children.
Champaign, IL: National Council of Teachers of English.
Gillam, R. B., & Johnston, J. R. (1992). Spoken and written
language relationships in language/learning-impaired and nor- Loban, W. (1976). Language development: Kindergarten through
mally achieving school-age children. Journal of Speech and grade twelve. Urbana, IL: National Council of Teachers of
Hearing Research, 35, 1303–1315. English.
Gillam, R. B., Loeb, D. F., Hoffman, L. M., Bohbman, T., Mandler, J. M., & Johnson, N. S. (1977). Remembrance of
Champlin, C. A., Thibodeau, L., . . . Friel-Patti, S. (2008). things parsed: Story structure and recall. Cognitive Psychology,
The efficacy of Fast ForWord language intervention in school- 9, 111–151.
age children with language impairment: A randomized controlled Marston, D. (2005). Tiers of intervention in responsiveness to
trial. Journal of Speech, Language, and Hearing Research, 51, intervention: Prevention outcomes and learning disabilities iden-
97–119. tification patterns. Journal of Learning Disabilities, 38, 539–544.
Gillam, R. B., & Pearson, N. (2004). Test of Narrative Language. Mathes, P. G., Torgesen, J. K., Menchetti, J., Wahl, M., &
Austin, TX: Pro-Ed. Grek, M. (2001). Proactive beginning reading. Unpublished
Goldman, S. R., & Varnhagen, C. K. (1986). Memory for document.
embedded and sequential story structures. Journal of Memory Mayer, M. (1972). A boy, a dog, and a frog. New York, NY: Dial
and Language, 25, 401–418. Books.
Good, R. H., & Kaminski, R. A. (Eds.). (1996). Dynamic Mellard, D. F., McKnight, M., & Woods, K. (2009). Response to
Indicators of Basic Early Literacy Skills (4th ed.). Eugene, OR: intervention screening and progress-monitoring practices in 41 local
Institute for the Development of Educational Achievement. schools. Learning Disabilities Research & Practice, 24, 186–195.
Gough, P. B., & Tunmer, W. E. (1986). Decoding, reading, and Merritt, D. D., & Liles, B. Z. (1987). Story grammar ability in
reading disability. Remedial and Special Education, 7, 6–10. children with and without language disorder: Story generation,
Hoffman, L. M. (2009a). Narrative language intervention inten- story retelling, and story comprehension. Journal of Speech and
sity and dosage: Telling the whole story. Topics in Language Hearing Research, 30, 539–552.
Disorders, 29, 329–343. Miller, J. F., & Chapman, R. S. (2004). Systematic Analysis
Hoffman, L. M. (2009b). The utility of school-age narrative of Language Transcripts (Version 8) [Computer software].
microstructure indices: INMIS and the proportion of restricted Madison: University of Wisconsin–Madison, Waisman Center,
utterances. Language, Speech, and Hearing Services in Schools, Language Analysis Laboratory.
40, 365–375. Nation, K., Clarke, P., Marshall, C., & Durand, M. (2004).
Hughes, D., McGillivray, L., & Schmidek, M. (1997). Guide to Hidden language impairments in children: Parallels between poor
narrative language: Procedures for assessment. Eau Claire, WI: reading comprehension and specific language impairment? Journal
Thinking Publications. of Speech, Language, and Hearing Research, 47, 199–211.
Hunt, K. W. (1965). Grammatical structure written at three grade National Institute of Child Health and Human Development
levels. Champaign, IL: National Council of Teachers of English. National Reading Panel. (2000). Teaching children to read:
220 LANGUAGE, SPEECH, AND HEARING SERVICES IN SCHOOLS • Vol. 43 • 205–221 • April 2012
APPENDIX. OCCURRENCES OF MACROSTRUCTURAL AND
MICROSTRUCTURAL FEATURES IN THE SOURCE STORY
Feature Value
TNW 396.00
NDW 150.00
LENGTH 44.00
MLC 9.00
COORD 5.00
SUBORD 6.00
COMPLEX 20.00
PROPCOMPLEX .45
Complete episodes 5.00
Episodic elements 54.00
Note. TNW = total number of words; NDW = number of different words; LENGTH =
number of C-units; MLC = mean length of utterance in words; COORD = number
of coordinating conjunctions; SUBORD = number of subordinating conjunctions;
COMPLEX = number of C-units containing at least one dependent clause;
PROPCOMPLEX = COMPLEX/LENGTH; Complete episodes = number of
complete episodes; Episodic elements = number of episodic elements.