Professional Documents
Culture Documents
Assessment Reviews
Assessment Reviews
Assessment Reviews
SPED 5140
Assessment Reviews
Assessment Reviews
language ability, and achievement. The following factors were considered: error
of measurement, reliability coefficients, test reliabilities, cluster reliabilities, and
alternate-forms equivalence. The extensive research suggests the WJ IV is
reliable.
o Validity: This is whether the test measures what it purports to measure. The
Technical Manual provides evidence of validity consistent with the Standards for
Educational and Psychological Testing. It provides substantive, internal, and
external validity evidence. In addition, it acknowledges the benefits of the Rasch
model. The extensive research suggests the WJ IV is valid.
Assessment Reviews
o Test 5: Calculation
This test measures the ability to perform mathematical computations.
First, the examinee is asked to write single numbers. Then, the examinee
uses addition, subtraction, multiplication, and division to solve equations.
o Test 6: Writing Samples
This test measures the ability to write a response in response to several
different demands. Spelling and punctuation are not penalized. The
sentences are evaluated for quality of expression.
o Test 7: Word Attack
This test measures the ability to use phonic and structural analysis to
pronounce unfamiliar printed words. First, the examinee must produce the
sounds for single letters. Then, the examinee must read aloud nonsense
words that increase in difficulty.
o Test 8: Oral Reading
This test measures story reading accuracy and prosody. The examinee
must read sentences aloud that increase in difficulty. Accuracy and
fluency are scored.
o Test 9: Sentence Reading Fluency
This test measures reading rate via silently reading simple sentences and
circling yes or no in the Response Booklet. The examinee has 3 minutes to
complete as many as possible.
o Test 10: Math Facts Fluency
This test measures the speed of solving a mixture of simple addition,
subtraction, and multiplication facts. The examinee has 3 minutes to solve
as many math facts as possible.
o Test 11: Sentence Writing Fluency
This test measures the ability to quickly formulate and write simple
sentences. The sentences are supposed to relate to the picture and use all
three words given.
o Test 12: Reading Retell
This test measures reading comprehension. The examinee silently reads a
short story and retells as much information as he or she can remember.
There is a 5 minute time limit on this test.
o Test 13: Number Matrices
This test measures quantitative reasoning. The examinee must identify the
missing number in a matrix.
o Test 14: Editing
This test measures the ability to identify and correct errors in a written
passage. Possible errors to identify include incorrect punctuation,
capitalization, spelling, and word usage.
o Test 15: Word Reading Fluency
Madeline McCormick
SPED 5140
Assessment Reviews
This test measures vocabulary knowledge and semantic fluency. The
examinee is given 3 minutes to pick the two words that go together in each
row. There are two words in each row that are related somehow
(potentially synonyms or antonyms).
Assessment Reviews
o Raw Score: The number of correct responses each receive 1 raw score point
(except in Test 6, 8, and 12 because higher scores are possible)
o W Score: The raw scores are converted using the online scoring program into W
scores, which are adaptations of the Rasch ability scale.
o Grade Equivalent (GE): The grade equivalent reflects performance in regards to
the grade level from the norming sample that has the same average score as the
examinee
o Age Equivalent (AE): The age equivalent reflects performance in regards to the
age level from the norming sample that has the same average score as the
examinee
o W Difference Score: This is the difference between the examinee’s test score and
the average test score from the norming sample (same age or grade)
o Relative Proficiency Index (RPI): This allows predictions to be made about
quality of performance on similar tasks to the ones tested.
o Instructional Zone: The instructional zone is a range along a developmental scale
that targets the examinee’s current level of functioning from their independent
level to frustration level (easy to difficult).
o CALP Levels: This is an optional part of the report that describes the examinee’s
cognitive-academic language proficiency (CALP) which helps indicate their
language proficiency in English.
o Percentile Rank: This describes performance on a scale of 1-99 in comparison to a
portion of the norming sample at a specific age or grade level. The percentile rank
indicates the percentage that scored the same or lower than the examinee.
o Standard Score: This is based on a mean of 100 and a standard deviation of 15.
o Standard Error of Measurement (SEM): The quantity of error that is inherent in a
score is the SEM and it gives professionals an idea of how much confidence to
have in the score.
Assessment Reviews
o Strengths: A thorough amount of research was conducted to prove the WJ IV to
be reliable and valid, which should make the examiner confident when using it. In
addition, the multitude of tests and skills assessed can provide a plethora of
information about a student.
o Weaknesses: This is a lengthy test. I know the Special Education Teacher at our
school has had difficulty fitting the administration of this test into her busy
schedule. In addition, I’ve seen a first grader struggle with the length of the test
(so there is a concern about testing fatigue).
Assessment Reviews
reliability coefficients must be around .80 or above (higher scores are more
desirable). The tables in the Examiner’s Manual show the GORT-5 consistently
scores at a high degree of reliability in all the areas studied. This suggests the
GORT-5 is reliable with little test error.
o Validity: This is whether the test measures what it purports to measure.
Content-Description Validity: There is an explanation of the rationale for
all of the components of the GORT in the Examiner’s Manual.
Criterion-Prediction Validity: The scaled scores were checked against the
index scores of five criterion measures of reading ability to check if they
were strongly related.
Construct-Identification Validity: A three-step process was used to
determine this type of validity:
Identify constructs thought to be responsible for test performance
Develop hypotheses based on the identified constructs
Verify the hypotheses
All of the results suggest the GORT-5 is valid.
Assessment Reviews
Section 2 Record of GORT-5 Scores
The rate, accuracy, fluency, and comprehension score from each
story are listed
Section 3 Performance Summary
The raw scores from section 2 are converted into age equivalent,
grade equivalent, percentile rank, and scaled score using the tables
in the Appendix
Section 4 Descriptive Terms Corresponding to Scaled and Index Scores
This shows the description terms (from very poor to very superior)
associated with the different ranges of scaled scores
Section 5 Worksheet for Recording GORT-5 Miscues (Optional)
Space is provided to do a miscue analysis of the specific types of
errors made while reading aloud. Categorized miscues include:
Meaning similarity
Function similarity
Graphic/Phonemic similarity
Multiple Sources
Self-Correction
Section 6 Summary of Other Reading Behaviors (Optional)
Space is provided to note the following types of deviations from
print analysis:
Substitutions
Omissions
Mispronunciations
Additions
Reversals
Hesitations
Space is provided to note the following observations:
Posture
Word-by-word reading
Poor enunciation
Disregard of punctuation
Head movement
Finger pointing
Loss of place
Nervousness
Poor attitude
Other
Section 7 Prosody (Optional)
Space is provided to rank the following categories:
Expression
Volume
Phrasing
Smoothness
Pacing
Madeline McCormick
SPED 5140
Assessment Reviews
Section 8 Record of Performance
This consists of the stories, comprehension questions, and
conversion table to mark how the student is performing
There is also administration and scoring instructions
Reminders about entry points, basals, and ceilings
Assessment Reviews
Scaled Scores: Norms for Rate, Accuracy, Fluency, and Comprehension
are reported as scaled scores that have a mean of 10 and a standard
deviation of 3.
o The Oral Reading Index (ORI) is given as a standard score and percentile rank.
ORI is a composite score made by combining the sum of the Fluency
(Rate and Accuracy) and Comprehension scaled scores to get a mean of
100 and a standard deviation of 15. The ORI is the most reliable score on
the GORT-5.