Professional Documents
Culture Documents
PsychAss Reviewer
PsychAss Reviewer
Measure of Variability
• Variability – how scores in a distribution
are scattered or dispersed. - An individual score may, for
• Two or more distributions of test scores example, fall AT THE THIRD
can have the same mean even though QUARTILE or IN THE THIRD
differences in the dispersion of scores QUARTER (but not “in” the third
around the mean can be wide. quartile or “at the third quarter)
- Symmetrical distribution; Q1 and
The range
Q3 have equal distance from the
- Equal to the difference between the median.
highest and lowest scores. ➢ Skewness; is the lack of
- Simplest measure but potential use is symmetry.
limited.
The average deviation
➢ One extreme score can radically alter
the value of the range. - Absolute value of the deviation
➢ Resulting description of variations may score, ignoring the positive or
be understated or overstated. negative sign and treating all
- Provides a quick but gross description of deviation scores as positive.
the spread of scores. - All deviation score are then
summed and divided by the total
The interquartile and semi-quartile ranges
number of scores.
- Distributions of test scores can be
The standard deviation
divided into four parts such that
25% of the test scores occur in each - A measure of variability equal to the
quarter. square root of the average squared
- Equal to the difference between Q3 deviations about the mean.
and Q1. Like the media, it is an - Square root of the variance Commented [AD1]: First: find the mean
ORDINAL STATISTICS. ➢ Variance – equal to the arithmetic 2nd: minus each score to the mean (deviation)
3rd: square every deviation
- Q1, Q2 (median), Q3. mean of the squares of the Variance: add all the squared deviation then divided by it to
difference between the score in a the total number of score.
distribution.
SD: squared the variance
∑(𝑥−𝜇)2 - In theory, the distribution of the normal
➢ Formula: 𝜎 2 =
𝑁
curve ranges from negative infinity to
▪ When squared = SD.
positive infinity.
Skewness
The area under the normal curve
- Symmetry is absent.
- 50% of the scores occur above the mean
- Positive skew; most of the scores fall under
and 50% of the scores occur below the
the low end.
mean.
➢ May indicate test was too difficult.
- Approximately 34% of all scores occur
- Negative skew; most of the scores fall
between the mean and 1 SD below and/or
under the high end.
above the mean.
➢ May indicate test was too easy.
- Approximately 68% of all scores occur
Kurtosis between the mean and ±1 SD.
- Approximately 95% of all scores occur
- Steepness of a distribution in its center.
between the mean and ±2 SD.
➢ Platykurtic – relatively flat.
- Approximately 99% of all scores occur
➢ Leptokurtic – relatively peaked
between the mean and ±3 SD.
➢ Mesokurtic – somewhere in the
middle.
- Development:
➢ 18th century: Abraham DeMoivre - Has two tails
➢ Marquis de Laplace - General rule: the larger the sample size
➢ 19th century: Karl Friedrich Gauss and the wider the range of abilities
➢ “Laplace-Gaussian Curve” measured by a particular test, the more the
➢ Karl Pearson “Normal Curve” to be graph of the test scores will approximate
more inclusive to those who the normal curve.
contributed. - In terms of mental ability as
- Theoretically, the normal curve is a bell- operationalized by tests of intelligence,
shapes, smooth, mathematically defined performance that is approximately 2 SD
curve that is highest at its center. (Ideal from the mean (i.e., IQ of 70-75 or lower or
curve.) IQ of 125-130 or higher) is one key element
- From the center it tapers on both sides in identification.
approaching the X-axis asymptotically (it ➢ IQ tests: mean= 100, SD=15
approaches, but never touches, the axis.) ➢ Success at life’s tasks, or its absence,
also plays a defining role, but the
primary classifying feature of both
gifted and retarded groups is intellectual - Score: equal to the mean, Z-score is
deviance. These individuals are out of 0.
sync with more average people, simply o Greater than the mean, Z-
by their difference from what is score is positive.
expected for their age and o Less than the mean, Z-score
circumstances. is negative.
Describing Distributions
Concepts (2)
• Norm-referenced test
- Compares each person with a norm.
These tests do not compare Regression – related technique (to correlation).
students with one another; they Used to make predictions about scores on one
compare each student’s variable from knowledge of scores on another
performance with a criterion or variable.
expected level of performance.
• Criterion-referenced test - Regression line – where the
predictions are obtained from.
➢ Defined as the bestfitting - Use for finding the association
straight line through a set of between two sets of ranks.
points in a scatter diagram. - Easy to calculate and is often used
➢ It is found by using the when the individual in a sample can
principle of least squares, be ranked on two variables but their
which minimized the actual scores are not known or have
squared deviation around a normal distribution.
the regression line. - Popular method for correlating
▪ Mean – point of unvalidated survey, instruments or
least squares for Likert-type survey responses.
any single variable. - NON-PARAMETRIC MEASURE.
This means that the • Biserial correlation
sum of the squared - Relationship between a continuous
deviations around variable and an artificial Commented [AD5]: Variables that can are typically
the mean will be dichotomous variable. obtained by measuring. Variables that can assume any value
within a special range. Ex. Ratio and interval variables.
less than it is - For example: passing or failing the
(Weight & Temperature)
around any value bar examination (artificial
other than the dichotomous variable) and grade
mean. point average in law school
(continuous variable)
Pearson product moment correlation
• Point biserial correlation
coefficient
- Continuous variable and true
- Ratio used to determine the degree dichotomous variable (gender).
of variation in one variable that can - For example: relationship between
be estimated from knowledge gender and GPA.
about variation in the other • Phi coefficient
variable. - Both variables are dichotomous
variables and at least one is “true”.
- For example: relationship between
passing or failing the bar
examination and gender. Commented [AD4]: Main difference between Spearman’s
• Tetrachoric correlation rho and Pearson r
- Both dichotomous variables are Spearman’s rho
artificial. Non-parametric measure.
Evaluates monotonic relationship (one V decreases as the
Terms and Issues in the Use of Correlation other V increases or one V increases as the other V
- The correlation coefficient can take decreases.) [it can be positive (linear correlation) or
• Residual negative correlation]
on any value from -1.0 to 1.0. Evaluates the strength and direction of the correlation.
- The difference between the
- Commonly used; most often we -1 (perfect negative correlation), 1 (perfect positive
predicted and observed values. correlation)
want to find the correlation
- Y-Y’ (R= actual y value – predicted y
between two continuous variables. Pearson r
value)
(height, weight, intelligence.) Parametric measure
➢ Regression equation gives a Assess linear correlation.
- PARAMETRIC MEASURE (PEARSON
predicted value of Y for Assess the strength and direction of the correlation.
R) 1 (positive correlation), -1 (negative linear correlation), 0
each value of X.
• Spearman’s rho (no correlation)
➢ In addition to these - A high coefficient of alienation
predicted values, there are indicates that the two variables
observed values of Y. share very little variance in
- Interpretation: Having a negative common. (non-association)
residual means that the predicted • Shrinkage
value is too high, similarly if you - Amount of decrease observed
have a positive residual, it means when a regression equation is
that the predicted value was too created for one population and then
low. applied to another.
• Standard error of estimate - one problem with regression
- Measure of the accuracy of analysis is that it takes advantage of
prediction. chance relationships within a
- Prediction is most accurate when particular sample of subjects.
the standard of error of estimate is ➢ There is a tendency to
relatively small. overestimate the
- As it become larger, the prediction relationship, particularly if
becomes less accurate. the sample of subjects is
• Coefficient of determination small.
- Correlation coefficient squared. • Cross validation
- This value tells us the proportion of - Best way to ensure that proper
the total variation in scores on Y references are being made.
that we know as a function of ➢ Use the regression equation
information about X. to predict performance in a
- For example, correlation between group of subjects other
SAT score and performance of 1st than to which the equation
year college is .40, then the was applied.
coefficient of determination is .16. ➢ Then a standard error of
➢ .40² = .16. estimate can be obtained
➢ This means that we can for the relationship
explain 16% of the between the values
variation in 1st year college predicted by the equation
performance by knowing and the values actually
SAT scores. observed.
• Coefficient of alienation • Multivariate analysis
- Measure of non-association - Broad term that refers to the
between two variables. analysis of multiple variables
- Computation: subtracting simultaneously.
coefficient of determination to 1. - Multiple variables, can include both
√1 − 𝑟
2 dependent and independent
- This is the proportion of common variables.
variance not shared between the - Considers the relationship among
variables, the unexplained variance combinations of three or more
between the variables. variables.
- For example: the prediction of - Estimate the extent to which an
success in the 1st year of college observed test score deviates from a
from the linear combination of SAT true score.
verbal and math scores. - An INDEX of the extent to which
• Discriminant analysis (multivariate one individual’s scores vary over Commented [AD6]: For example:
method) tests presumed to be parallel. Test battery includes: personality and intelligence test. (IV;
predictor)
- When the task is to find the linear - Formula:
DV: successful and non-successful.
combination of variables that ➢ SD divided by 1 minus the
provides a maximum discrimination reliability coefficient. If there is a small difference between the scores of successful
group and non-successful group in intelligence test, then
between categories. - SEM functions like a SD = predict
intelligence would have a discriminant evidence. This is
- For example: attempts to determine what would happen if an individual because the IQ of the non-successful and successful group
whether set of measures predicts took additional equivalent test does not significantly differs with each other, thus IQ does
not predict if an individual will become successful or not in
success or failure on a particular ➢ 68% = +- 1 SD
their job.
performance evaluation. ➢ 95% = +- 2 SD
• Multiple regression analysis ➢ 99% = +-3 SD
(Multivariate method) ➢ If SD is constant = the
- Specifically deals with multiple smaller the SEM,
independent variables influencing correlation coefficient
a single dependent variable. It increases; more reliable.
quantifies the extent to which each - useful in establishing what is called
independent variable contributes to a confidence interval: a range or
the variability in the dependent band of test scores that is likely to
variable. contain the true score.
Discriminant analysis and multiple
Standard error of the difference
regression analysis find linear
combinations of variables that - aid a test user in determining how
maximize the prediction of some large a difference should be before
criterion. it is considered statistically
Factor analysis is used to study the significant.
interrelationships among a set of - Address this question
variables without reference to a ➢ How did this individual’s
criterion. performance on test 1
Dependent Variable: Also known as the compare with his or her
"criterion" or "outcome" variable. performance on test 2?
Independent Variable: Also known as ➢ How did this individual’s
the "predictor" or "explanatory" performance on test 1
variable. compare with someone
else’s performance on test
Standard error of measurement/ standard
1?
error of score
➢ How did this individual’s
- Measure of the precision of an performance on test 1
observed test score. compare with someone
else’s performance on test
2?
- Formula: appropriate for test-retest
➢ Square root of the sum of evaluation (Rorschach Inkblot Test).
SEM of first test and SEM • Carryover effect
of second test. - Occurs when the 1st session
➢ Or influences scores from the 2nd
➢ square root of the 2 minus session.
reliability coefficient of 1st - When there are carryover effects,
test minus the reliability the test-retest correlation usually
coefficient of 2nd test. overestimates the true reliability.
• Practice effects
Reliability - Type of carryover effects.
• Reliability coefficient - Some skills improve with practice.
- the ratio of the variance of the true • Parallel forms reliability
scores on a test to the variance of - Compares two equivalent forms of a
the observed scores. test that measure the same
attribute.
- Use different items but have the
same level of difficulty.
• Test reliability • Split-half reliability
- Usually estimated in one of three - Estimates the internal consistency.
ways. - Test is given and divided into halves
1. Test-retest method that are scored separately.
➢ Consistency: when - Results of one half is compared with
administered on different the results of the other.
occasions. - If the items get progressively more
2. Parallel forms method difficult, then you might be better
➢ We evaluate the test across advised to use the odd-even
different forms of the test. system.
3. Internal consistency method
➢ We examine how people Test scores gain reliability as the number of
perform on similar subsets items increases.
of items selected from the
An estimate of reliability based on two half-tests
same form of the measure.
would be deflated because each half would be
• Test-retest reliability less reliable than the whole test. Commented [AD7]: Reliability of the WHOLE TEST is much
- Estimates are used to evaluate the higher than the two halves, since whole test has more items.
error associated with administering The correlation between the two halves of the
a test at two different times. test would be a reasonable estimate of the
- This type of analysis is of value only reliability of half test.
when we measure traits or
• Spearman-Brown (Correction) Formula
characteristics that do not change
- To correct for half-length test.
over time.
- Allows you to estimate what the
- Test that measures some constantly
correlation between the 2 halves
changing characteristics are not
would have been if each half had
been the length of the whole test.
- Estimate: Correlation between 2 - Measure of agreement between 2
halves = whole test. judges who each rate a set of
- Using it is not always advisable.
• Cronbach’s coefficient alpha (a) [1951] Commented [AD8]: used when two halves of the test
- When the two halves of a test have have unequal variances and on tests containing non-
dichotomous items, unequal variances
unequal variances.
- This general reliability coefficient
provides the lowest estimate of
reliability that one can expect. Commented [AD9]: For the researchers to know if a
- Estimates the internal consistency. certain instrument is reliable enough or not. It is important
to know how much an instrument is NOT reliable for the
➢ Items are not scored as 0 or objects using NOMINAL SCALES. researchers to do various methods to increase the reliability.
1 (right or wrong). - Best method for assessing the level Much better to know how UNRELIABLE THE INSTRUMENT IS
➢ More general method off of agreement among several rather than OVERESTIMATE ITS RELIABILITY.
finding reliability estimate observers.
through internal - Values:
consistency. ➢ 1 (perfect agreement)
➢ -1 (less agreement that can
be expected on the basis of
of chance alone.)
➢ A value greater than .75
➢ N = number of items.
generally indicates
➢ S = sum of the total scores
“excellent” agreement.
for all items.
➢ A value between .40 and
➢ Si = the sum of the item
.75 indicates “fair to good”
scores for the each item.
(satisfactory) agreement.
➢ A value less than .40
indicates “poor”
agreement.
Wechsler intelligence scale for children - IV wechsler preschool and primary scale of
(WISC-IV) intelligence-iii (WPPSI-III)
- The WISC-IV is a widely used intelligence test - Age Range: The WPPSI-III is designed to assess
designed to assess cognitive abilities in children the intelligence and cognitive abilities of
aged 6 to 16 years old. It measures various children aged 2 years 6 months to 7 years 3
cognitive domains, including verbal months.
comprehension, perceptual reasoning, working
memory, and processing speed.
Purpose: The primary purpose of the WPPSI-III
is to assess cognitive functioning in young
Use: The WISC-IV is used for a variety of children, including their intellectual abilities,
purposes, including educational assessment, strengths, and weaknesses.
clinical evaluation of intellectual abilities,
identification of learning disabilities or
developmental disorders, and research. Use: The WPPSI-III is used for a variety of
purposes, including educational assessment,
clinical evaluation of intellectual abilities,
SB-5 vs WISC-IV identification of learning disabilities or
developmental disorders, and research.
1. both test published in 2003, individual
administration, 1 hour, full scale IQ composite
score based on 10 subtests. Parent and Teacher Rating Scales: In addition to
2. WISC-IV, have 5 supplemental tests (30mins the direct assessment of the child's cognitive
administration), SB-5 have none. abilities, the WPPSI-III includes parent and
teacher rating scales to gather information
about the child's behavior, social skills, and
adaptive functioning in everyday settings.