Professional Documents
Culture Documents
PA Reviewer
PA Reviewer
•
Psychological Testing is the process of measuring
psychology-related variables by means of devices or
procedures designed to obtain a sample of behavior
Assessment vs. Test(Cohen, Swerdlik, & Sturman 2013)
Assessment Testing
Objective To answer referral question, solve a To obtain some gauge, usually numerical
problem, or arrive at a decision in nature, with regard to the ability or
through the use of tools of evaluations. attribute.
Role of The assessor is key to the process of The tester is not key to the process;
Evaluator selecting tests and/or other tools of practically speaking, one tester may be
evaluation as well as drawing substituted for another tester without
conclusion from the entire evaluation appreciably affecting the evaluation
PSYCHOMETRIC APPROACH
–
American in origin
–
Gives numerical estimates of single aspects of performance.
–
It rests on E. Thorndike’s belief “if a thing exists in some amount, and if it exists in some
amount, it can be measured”.
–
Definite and structured
IMPRESSIONISTIC APPROACH
–
German in origin
–
Leads to a comprehensive, descriptive picture of the individual.
–
It looks for significant cues to understanding an individual’s dynamics by all available means
and integrates them into a total picture
–
Gives minimal consideration to “how much” of some characteristic is present
–
It seeks “wholeness” or unity and relies on observation, descriptive data and self-report.
ASSUMPTIONS ABOUT PSYCHOLOGICAL
TESTING AND ASSESSMENT
1. Psychological traits and states exist
States – also distinguish one person from another but are relatively less
enduring (Chaplin et al. 1988)
and measured
Many test developers and researchers, much like all other people, have
different ways of looking and defining the same phenomenon
behaviors
4. Tests and other measurement techniques have
process
unbiased manner
Some of the test fairness related problem are more political than
psychometric
Types of Assessment
Formal vs. Informal Assessment
•
Formal assessments are preplanned, systematic attempts by the teacher to
ascertain what students have learned. The majority of assessments in
educational settings are formal. Typically, formal assessments are used in
combination with goals and objectives set forth at the beginning of a lesson
or the school year. Formal assessments are also different from informal
assessments in that test takers can prepare ahead of time for them.
•
Informal assessments are those assessments that result from spontaneous
day-to-day observations of how an individual behave and perform a
particular task. When informal assessments are conducted, they don't
necessarily have a specific agenda in mind, but are more likely to learn
different things about the person as they proceed through daily activities
naturally. These types of assessments offer important insight into a
person‘s misconceptions and abilities (or inabilities) that might not be
represented accurately through other formal assessments.
Types of Assessment
Paper-Pencil vs. Performance-Based
Assessment. These types of assessment
procedures are sub-divided into the
following:
•
Verbal vs. nonverbal. Verbal tests are not
necessarily spoken but may be written and non-
verbal is involves the ability to understand and
analyses visual information.
•
Speed vs. Power. Only speed of response is
measured by the speed test while power test is
designed to measure the knowledge of the test
taker, regardless of his or her speed of
performance.
Types of Assessment
Paper-Pencil vs. Performance-Based
Assessment. These types of assessment
procedures are sub-divided into the following:
•
Individual vs. Group. A test can be said individual test in the
sense that they can be administered to only one person at a
time while group test involves many people.
•
Objective vs. Subjective. Objective tests are psychological
tests that measure an individual's characteristics in a way that
is independent of rater bias or the individual's own beliefs.
Objective tests tend to be more reliable and valid than
subjective tests which are evaluated by giving an opinion.
Subjective tests are more challenging and expensive to
prepare, administer and evaluate correctly
Types of Assessment
Paper-Pencil vs. Performance-Based
Assessment. These types of assessment
procedures are sub-divided into the following:
•
Cognitive vs. Affective. Cognitive tests attempt to measure
mental ability while affective tests are designed to assess
interests, attitudes, and personal values of an individual.
TYPES OF PSYCHOLOGICAL TESTS
Intelligence tests.
•
These are used to measure intelligence, or your ability to understand your
environment, interact with it and learn from it
Personality tests.
•
These are used to measure personality style and traits. Personality tests
are commonly used in research or to assist with clinical diagnoses.
•
Such as the Likert Scale or the Thurstone Scale, are used to measure how an
individual feels about a particular event, place, person or object.
Achievement tests.
•
These are used to measure how well you understand a particular topic (i.e.,
mathematics achievement tests).
– American College Test (ACT)
Aptitude tests.
•
These are used to measure your abilities in a specific area (i.e. clerical skills).
TYPES OF PSYCHOLOGICAL TESTS
Neuropsychological tests.
•
It is use to find out how damage to your brain is affecting your ability to
reason, concentrate, solve problems, or remember.
– Barcelona Neuropsychological Test (BNT)
Vocational tests.
•
It is the process of determining an individual's interests, abilities and
aptitudes and skills to identify vocational strengths, needs and career
potential.
– Career Occupational Preference Survey (COPES)
TYPES OF PSYCHOLOGICAL TESTS
Direct observation tests.
•
It involves the observation of people as they complete activities. This type
of assessment is usually conducted with families in a laboratory, home or
with children in a classroom.
– The Parent-Child Interaction Assessment-II (PCIA)
•
Screening
– quick survey to located individuals who may need or be eligible
for special treatment
•
Diagnosis
– A detailed analysis of an individuals strength and weaknesses
with the general goal of arriving at a classification decision
PURPOSES PSYCHOLOGICAL
ASSESSMENT (Erford, 2013)
•
Treatment Planning and Goal identification
– Determines what specific concern/problem afflicts a person to
properly define a specific plan of action and to identify specific
objectives that are needed to be achieved
•
Progress / Outcome Evaluation
– Ensuring that a given treatment plan is helpful to a client
Limitations of Psychological Test
Uses of Psychological
Scores can’t reveal how or why the
Test in Various individual obtained a certain score
Infers only from a sample of
Settings behavior
Easily affected by extraneous
variables
Educational Setting
Chance error on individual
interpretation of scores
Industrial/Business
SEM = reasonable limits of scores
Setting and yet maintain its reliability
SEdiff = Difference between two
Clinical/Counseling scores for test of significance
SEest = Margin of error expected in
Setting individual’s predicted criterion
STATISTICAL FOUNDATIONS
SCALES OF MEASUREMENT AND THEIR PROPERTIES
2. Frequency Polygon
1. Skewness – the nature and extent to which symmetry is absent which indicate how
the measurements in a distribution are distributed
i. Positive Skew – relatively few of the scores fall at the high end of the distribution
ii. Negative Skew – relatively few of the scores fall at the low end of the distribution
2. Kurtosis– the steepness (peakedness/flatness) of a distribution in its center
i. Platykurtic– relatively flat
ii. Leptokurtic – relatively peaked
iii. Mesokurtic – somewhere in the middle
DESCRIBING DISTRIBUTIONS
3. Normal Distribution – bell shaped distribution
5. Measures of Variability
2. Interquartile Deviation – measure of variability for curves that are not normally
distributed. The partial variance between upper and lower quartiles.
DESCRIBING DISTRIBUTIONS
6. Standard Scores
A. Individual Score
Percentile Score/Rank – percentage of the norm group to which the examinee scored at or below given score.
B. Linear Standard Score – results from the conversion of the raw score into a number indicating how many standard
deviation units the raw score is below or above the mean of the distribution
C. Normalized Standard Scores – involves “stretching” the skewed curve into a shape of a normal curve and creating a
corresponding scale of standard scores
i. Stanine - score based on dividing the normal distribution into nine parts, each part describes a range of percentile scores.
ii. Sten - score based on dividing the normal distribution into ten parts, each part describes a range of percentile scores.
CORRELATIONAL STATISTICS
A statistic that indicates the degree of relationship between any two
sets of scores obtained from the same group of individuals. The degree of
association is computed and measured through correlation coefficient.
+/- 0.0 to 0.19 Very weak, negligible correlation
+/- 0.20 to 0.39 Weak, low correlation
+/- 0.40 to 0.59 Moderate correlation
+/- 0.60 to 0.79 Strong, high correlation
+/- 0.80 to 1.0 Very strong correlation
CORRELATIONAL TECHNIQUES
Statistical Tool Set of Scores Type of Measure Interpretation
Pearson r 2 sets of scores from same respondents Both are Interval or Ratio (Scale data) -1 to +1 = perfect
.91 to .99 = very high
.71 to .90 = high
.41 to .70 = moderate
.21 to .40 = slight/weak
.00 to .21 = no corr.
Spearman’s rho 2 sets of ranking from same Both are ordinal -1 to +1 = perfect
respondents. .91 to .99 = very high
N is not more than 30 .71 to .90 = high
.41 to .70 = moderate
.21 to .40 = slight/weak
.00 to .21 = no corr.
Kendall’s Tau 2 sets of ranking from same Both are ordinal -1 to +1 = perfect
respondents. .91 to .99 = very high
N can be more than 30 .71 to .90 = high
Tau coefficient is smaller than .41 to .70 = moderate
spearman’s rho, when the tools are .21 to .40 = slight/weak
used. .00 to .21 = no corr.
Kendall’s W More than two sets of ranking All are ordinal, ranking from several 0 to 1
judges or raters 0 = no agreement between judges
1 = perfect agreement between judges
Phi Coefficient 2 or more sets of frequencies All are Nominal Data -1 to +1 = perfect
.91 to .99 = very high
.71 to .90 = high
.41 to .70 = moderate
.21 to .40 = slight/weak
.00 to .21 = no corr.
Multiple correlation 3 or more sets of Pearson All are ratio or Interval (Scale Data) Relationship between one variable and a
combination of two other variables.
Parametric Tests
Z-test is any statistical test for which the distribution of the test
statistic under the null hypothesis can be approximated by a
normal distribution. (NOTE: n ≥ 30; single parameter)
Example: Suppose that in a particular geographic region, the mean and standard
deviation of scores on a reading test are 100 points, and 12 points, respectively. Our
interest is in the scores of 55 students in a particular school who received a mean score
of 96. We can ask whether this mean score is significantly lower than the regional mean
— that is, are the students in this school comparable to a simple random sample of 55
students from the region as a whole, or are their scores surprisingly low?
T-test assesses whether the means of two relatively small groups of
normal distributions are statistically different from each other.
Parametric Tests (continued…)
one-way analysis of variance (ANOVA) is used to determine whether there are any
significant differences between the means of three or more independent (unrelated)
groups.
Example: a researcher wishes to know whether different pacing strategies affect the time to complete
a marathon. The researcher randomly assigns a group of volunteers to either a group that (a) starts slow
and then increases their speed, (b) starts fast and slows down or (c) runs at a steady pace throughout. The
time to complete the marathon is the outcome (dependent) variable.
two-way analysis of variance (ANOVA) compares the mean differences between groups
that have been split on two independent variables (called factors) on the dependent
variable.
Example: you may want to determine whether there is an interaction between physical activity level and
gender on blood cholesterol concentration in children, where physical activity (low/moderate/high) and
gender (male/female) are your independent variables, and cholesterol concentration is your dependent
variable.
Nonparametric Tests
Situations in which nonparametric test are used:
•
The data involve measurements on nominal or ordinal
scales. In these situations, you cannot compute the
means and variances that are essential part of the
parametric tests.
•
The data do not satisfy the assumptions underlying
parametric tests.
•
The data have extremely high variance, which can
undermine the likelihood of significance for a parametric
test. In this case, the scores can be converted to
categories or ranks, and a nonparametric test can be
Nonparametric Tests
Chi Square statistic is used to investigate whether distributions of categorical variables
differ from one another.
chi-square test for independence. The test is applied when you have two categorical variables from a
single population. It is used to determine whether there is a significant association between the two
variables.
Example: in an election survey, voters might be classified by gender (male or female) and voting
preference (Democrat, Republican, or Independent). We could use a chi-square test for
independence to determine whether gender is related to voting preference.
chi-square goodness of fit test. The test is applied when you have one categorical variable from a
single population. It is used to determine whether sample data are consistent with a hypothesized
distribution.
Example: suppose a company printed baseball cards. It claimed that 30% of its cards were rookies;
60%, veterans; and 10%, All-Stars. We could gather a random sample of baseball cards and use a
chi-square goodness of fit test to see whether our sample distribution differed significantly from
the distribution claimed by the company.
Nonparametric Tests (Continued…)
Binomial Test is an exact test of the statistical significance of
deviations from a theoretically expected distribution of
observations into two categories. One common use of the
binomial test is in the case where the null hypothesis is that two
categories are equally likely to occur (such as a coin toss).
Sign-test is the alternative test to the Wilcoxon test for dependent
data. One requirement of the Wilcoxon test is that the data needs
to be at least interval scaled. For the sign test the data needs to be
at least ordinal scaled.
Parametric vs. Nonparametric
Parametric Non-parametric
Assumed distribution Normal Any
Assumed variance Homogeneous Any
Typical data Ratio or Interval Ordinal or Nominal
Data set relationships Independent Any
Usual central measure Mean Median
The formula for calculating the standard error of measurement (SEM) is:
SEM= s√ 1- r
Example: Anne took the Graduate Record Examinations Aptitude Test (GRE), an instrument used in selecting and
admitting students in the graduate program. GRE gives three scores:Verbal (GRE-V), Quantitative (GRE-Q) and Analytical
(GRE-A) Scores range from 200 to 800
The reliability coefficient for the GRE-V is .90 (Educational Testing Service, 1997).
STANDARD ERROR OF MEASUREMENT
5. Ensure Validity
VALIDITY
Face Validity – content of the test reflects the materials it Predictive Validity – Test data are used to estimate criterion
is supposed to measure, according to the test takers. scores in the future; predictor and criterion scores are
obtained at different times.
Example: The RPm Board Exam should reflect the materials
Example: High score in RPm Board Exam should predict high
provided in the table of specification. performance in psychometric work
3. Ambiguity in statements
L. Thurstone primary mental abilities.(1938) 7 interrelated Long-Term Storage and Retrieval (Glr): is the
factors namely: verbal comprehension, word fluency, number ability to store information and fluently retrieve it
facility, perceptual speed, memory, space, and reasoning later in the process of thinking.
Jean Piaget (1954) – a kind of evolving biological adaptation to Visual Processing (Gv): is the ability to perceive,
the outside world. analyze, synthesize, and think with visual patterns,
including the ability to store and recall visual
J. P. Guilford’s Structure of intelligence model (1967). Includes
120 unique intellectual factors that are organized around 3 representations.
dimensions: mental operations, content, and products.
THEORIES OF INTELLIGENCE
Francis Galton (1884) – The most intelligent persons are
Cattell-Horn and Carroll – CHC model (1993):
equipped with the best sensory abilities.
Proposed that Fluid abilities (Gf) are biologically
Alfred Binet (1916) – The tendency to take and maintain a determined while crystallized abilities (Gc) are
definite direction or purpose; the capacity to make adaptations acquired skills and knowledge influenced by
and strategies to achieve a desired end, and the power of cultural, social, and educational experiences
autocriticism (self-criticism)
Auditory Processing (Ga): is the ability to analyze,
Charles Spearman (1927) –Intelligence consists of one general
factor (g) plus a large number of specific factors: Two-Factor synthesize, and discriminate auditory stimuli,
Theory of Intelligence. including the ability to process and discriminate
speech sounds that may be presented under
L. Thurstone primary mental abilities.(1938) 7 interrelated factors distorted conditions.
namely: verbal comprehension, word fluency, number facility,
perceptual speed, memory, space, and reasoning Processing Speed (Gs): is the ability to perform
automatic cognitive tasks, particularly when
Jean Piaget (1954) – a kind of evolving biological adaptation to
measured under pressure to maintain focused
the outside world.
attention.
J. P. Guilford’s Structure of intelligence model (1967). Includes
120 unique intellectual factors that are organized around 3
dimensions: mental operations, content, and products.
THEORIES OF INTELLIGENCE
Short-Term Memory (Gsm): is the ability to apprehend
Lev Vygotsky’s Socio-cultural theory (1978): Suggested a
and hold information in immediate awareness and then developmental theory emphasizing the role of culture and social
use it within a few seconds. interaction.
KN Nonverbal Absurdities
(Knowledge) Verbal Vocabulary
QR Nonverbal Quant. Reasoning
(Quantitative Verbal Verbal QR
Reasoning)
VS Nonverbal Form board
(Visual/Spatial Verbal Positions & Dir
Reasoning)
WM Nonverbal Block pattern
(Working Memory) Verbal Sentence memory
Stanford-Binet Intelligence Scale
IQ Range Category
120-129 Superior
90-109 Average
Vocabulary
Verbal Comprehension Similarities
Information
Picture completion
Perceptual Organization Block Design
Matrix reasoning
Arithmetic
Working Memory Digit span
Letter-number sequencing
Processing Speed Digit symbol
Symbol Search
Wechsler Adult Intelligence Scale
IQ Range Category
130 & above Very Superior
120 – 129 Superior
110 – 119 High Average
90 – 109 Average
80 – 89 Low Average
70 – 79 Borderline
69 & below Extremely low
•
Vineland Adaptive
•
Kaufman Assessment
Battery for Children, 2nd
Behavior Scales, 2nd
ed
ed Scale Name Subtest
Index Subdomain Sequential Scale Number Recall
A tool that is truly Filipino in nature and orientation. Age range: 16 and above; uses
Full Scale IQ
Coverage: Crystallized intelligence → Talasalitaan (Vocabulary), Kakayahan sa mga
bilang (Numerical ability), Ugnayan (Analogy) and Fluid intelligence →
Isinalarawang problema (Figural)
Differential Aptitude Test (DAT, 5th ed. by
Bennett, Seashore & Wesman, 1982, 1990)
General Cognitive Verbal reasoning Measure the ability to learn in either
Abilities an occupational or training setting,
Numerical reasoning
and specifically the ability to learn
from books and manuals, self
instruction, trainers, teachers, or
mentors.
•
Philippine Aptitude Classification Test (PACT by CEM)
Sheldon & Stevens (1942) proposed a type
Raymond Cattell’s Factor-Analytic Trait
theory based on relationship between body theory.
build and temperament such as endo-, Surface traits – more obvious
meso- and ectomorph. Contemporary
research on coronary prone personality aspects of personality
types seem related to the type theory. Source traits – stable and
John Holland’s theory of vocational constant sources of behavior, less
personality. Occupational choice has a visible than surface traits and
great deal to do with one’s personality and more important in accounting for
self-perception of abilities. behavior.
Theories of Personality
Hans Eysenck’s dimensions of
Social Cognitive Theory. Reciprocal
personality = Psychoticism (P), influences between people and their
Extraversion (E), and Neuroticism (N). circumstances, especially concerning
Henry Murray’s Psychogenic Needs and perceptions of control
Environmental Presses which argues that
Humanistic Theory. Theories of how
it is the continuity of functional forms
and forces manifested through sequences people develop, the structure of
of organized regnant processes and overt personality, the nature of mental
behaviors from birth to death health and how to treat problems
based on writings by Rogers and
Sigmund Freud’s unconscious level based Maslow
on Topographical model of Personality
PRINCIPLES OF PERSONALITY TEST
CONSTRUCTION
Response Styles A tendency to….
Personality – Individual’s unique constellation of
psychological traits and states, including aspects of values,
Socially desirable Presenting oneself in a
interests, attitudes, worldview, acculturation, sense of responding favorable light
personal identity, sense of humor, cognitive and
behavioral styles and related characteristics. Acquiescence Agree with whatever is
presented
•
OBSERVATION – This are judgments about
personality based on our own observations.
NORMS, RELIABILITY AND
VALIDITY OF PERSONALITY TESTS
•
The scores are usually interpreted with reference to a set of norms based on the responses of selected
groups of people because the standardization samples are sometimes very small and unrepresentive of the
intended target population, such norms must be interpreted cautiously.
•
Scores and norms for some personality inventories, particularly those consisting of items having a forced-
choice format, are ipsative. When the scoring is ipsative, a person’s score on one scale is affected by his or
her scores on the remaining scales.
•
It is impossible to make all high scores or all low scores because the scores compensate for one another.
This creates problems in comparing the scores of different people on a particular scale or variable.
•
The instability of personality measurements typically results in measures having lower reliabilities than
scores on tests of ability and achievement.
•
Personality inventories have fairly limited validities. Here, the faking and response sets contribute to the
low validities of many inventories used in clinical diagnosis and classification. Another factor affecting the
validity is the susceptibility of users to the fallacy of believing that sets of item scales with similar names
•
NEO Personality Inventory – 16 Personality Factors
•
Typical errors in reproduction are rotation, INTELLECT/OPENNESS Domain SURGENCY/EXTRAVERSION
1. Pagkamatalino Domain
angulation, integration, perseveration, distortion of (Intelligence) 1. Pagkapalakaibigan
2. Pagkamalikhain (Creativity) (Sociability)
shape and disproportion. 2. Pagkamadaldal (Social
curiosity)
AGREEABLENESS Domain 4. Pagkamapagkumbaba
Brannigan and Decker (2003) added seven new items: 1. Pagkamaalalahanin
(Humility)
5. Pagkamaunawain (Capacity
(Thoughtfulness)
4 designs for ages 4 to 7 yo while 3 new items are for 2. Pagkamagalang
for understanding)
6. Pagkamatapat (Honesty)
(Respectfulness)
ages 8 and above. 3. Pagkamatulungin
(Helpfulness)
Panukat ng Ugali at Pagkatao by Virgilio Enriquez & Ma.
Angeles Guanzon-Lapena (1975, 1983, 1989, 1997, 2001)
EXTRAVERSION/SURGENCY AGREAABLENESS
1. Pagkasunud-sunuran (Conformity) 1. Pagkamapunahin (Criticalness)
2. Ambisyon (Ambition) 2. Pagkapalaaway (Belligerence)
3. Pagkamahiyain (Shyness/Timidity) 3. Hirap Kausapin (Difficulty to Deal with)
4. Lakas ng Loob (Guts/Daring) 4. Pagkamapagkumbaba (Humility)
5. Pagkamatulungin (Helpfulness)
6. Pagkamapagbigay (Generosity)
7. Pagkamagalang (Respectfulness)
INTELLECT/OPENNESS TO EXPERIENCE
1. Pagkamausisa (Inquisitive)
2. Pagkamaalalahanin (Thoughtfulness)
3. Pagkamalikhain (Creativity)
Trustworthiness Scale (T Scale)
by Garcia, Hernando, Samson and Abrenica
(1995)
•
Designed to measure the degree to which trustworthiness
behaviors will be manifested by bank employees
•
50 items written in English
•
5-point Likert Scale
•
Reliability (Split-half): .95
•
Validity
– Convergent Validity: r=.343; p value at .05
– Discriminant Validity: r=-.147; p value at .05
Alternative Assessment
•
INTAKE OR INITIAL INTERVIEW •
CHECKLIST
One of the commonly used assessment strategies
1. Requires the observer to note whether a particular
Used to gather information about the range and scope of the concerns, characteristic is present or absent
pertinent details about the current situation, and relevant background
information to the current problems
2. Individuals are instructed to mark the words or
Not always easy to conduct effectively; guidelines and training are needed phrases in the list that apply to them
to obtain accurate and valid information from an interview
Errors in Rating
1. Constant Error – occurs when the assigned ratings are higher (leniency or
generosity error), lower (severity error) or average category (central
tendency error)
Advantages
Observations are relatively easy to do.
The situations in which people or animals are observed may come closer to real life
situations than is the case for most research. Therefore, observational research may be
Observation
Disadvantages
Even in laboratory observation, once the observation is set up, the observer has relatively
little control over the behavior.
The observer may not be aware of all of the factors that are affecting behavior, and may
draw incorrect conclusions.
Observational Methods
Naturalistic Observation - this method involves observing behavior as it naturally
happens. The observer tries to be unobtrusive and does not interfere in any way.
Laboratory Observation - this method involves doing observations in which you have set
up the circumstances in which the behavior will take place. If you interfere in any way to
set up the observation, it becomes a laboratory observation; thus, it does not necessarily
Mental Status Examination
Mental Status Examination (MSE) and client’s history are the
most important diagnostic tools a psychiatrist has to
obtain information to make an accurate diagnosis
A means of assessing the person's current thought processes,
emotions, and interpersonal qualities
Can also provide clues to areas that may need to be
addressed in follow-up sessions or outside referrals
Often conducted as part of the clinical interview
ETHICAL PRACTICE OF PSYCHOLOGICAL
ASSESSMENT (by Groth-Marnat, 2009)
Invasion of Privacy
•
The Office of Science and Technology (1967), in a report
entitled Privacy and Behavioral Research, has defined privacy
as “ the right of the individual to decide for him/herself how
much he will share with others his thoughts, feelings, and
facts of his personal life” (p. 2). This right is considered to be
“essential to insure dignity and freedom of self-
determination” (p. 2).
•
The invasion of privacy issue usually becomes most
controversial with personality tests because items relating to
motivational, emotional, and attitudinal traits are sometimes
disguised.
ETHICAL PRACTICE OF PSYCHOLOGICAL
ASSESSMENT (by Groth-Marnat, 2009)
Inviolacy
•
Whereas concerns about invasion of privacy relate to the
discovery and misuse of information that clients would rather
keep secret, inviolacy involves the actual negative feelings
created when clients are confronted with the test or test
situation.
•
Inviolacy is particularly relevant when clients are asked to
discuss information they would rather not think about.
ETHICAL PRACTICE OF PSYCHOLOGICAL
ASSESSMENT (by Groth-Marnat, 2009)
1.Identify the Problem. One should gather all relevant information and determine whether the problem is an
ethical issue, legal, practice-related, or other issue. If this is an ethical issue, continue with the process.
C. Assessment Tools – judiciously select and administer tests pertinent to referral and purpose of
assessment, norms are directly referable to the population of our clients
ETHICAL DECISION MAKING
(Forester-Miller & Davis, 1996; in Erford, 2013)
D. Obsolete and Outdated Test Results – do not base interpretations, conclusions and
recommendations on outdated test results
E. Interpreting Assessment Results – communicate results in a most understandable way and least
stigmatizing manner
F. Release of Test Data – data not to be used by other persons not involved. We do not release
raw scores, scaled scores, client’s actual test responses and notes or behaviors during exam
unless regulated by the court
G. Explaining Assessment Results –use layman’s terminologies and not technical language
H. Test Security – administration and handling of all test materials must be done by qualified
personnel.
I. Assessment by Unqualified Persons – except for training purposes with adequate supervision
ETHICAL DECISION MAKING
(Forester-Miller & Davis, 1996; in Erford, 2013)
3.Determine the Nature and Dimensions of the Dilemma. A psychometrician should consider the
moral principles that underlie the PAP Code of Ethics for direction, current research, and seek
consultation to determine an appropriate course of action.
4.Generate Potential Course of Action. A professional psychometrician should consult at least one
colleague to ensure that all potential courses of action are identified.
5.Consider the Potential Consequences of all Options and Determine a Course of Action. The impact
of potential consequences on the client and others should be considered in determining which option
is optimal for addressing the dilemma.
6.Evaluate the Selected Course of Action. This is to ensure that implementing that choice will not
create new or additional ethical dilemmas.
7.Implement the Course of Action. The psychometrician, together with the assistance from the
psychologist, should implement the selected course of action and follow up to ensure that the selected
action had the desired outcome.
1. Mismatched Validity
*Some tests are useful in diverse situations, but
no test works well for all tasks with all people
in all situations.
*It is important to note that as the population,
task, or circumstances change, the measures of
validity, reliability, sensitivity, etc., will also
tend to change.
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
2. Biases
*Contrast bias. Involves making an evaluation based on
the standard of the preceding client (Wexley, Sanders, &
Yuki, 1973).
*Order effects. The primacy and recency effect,
collectively known as order effects, refer to the saliency
of information based on the timing and order in which
they are presented. The primacy effect can be seen
when information first presented to the assessor
influences the final judgment more than information
presented later during the session (Peters & Terborg,
*10 Fallacies in
1975). The reverse applies for the recency effect, where
information presented later in the session has a greater
Psychological Assessment
influence on the final decision made (Morgeson &
Campion, 2010).
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
2. Biases
*Availability bias. Rresults from inaccurately basing the
frequency of events on the ease with which they can be
recalled to memory (Morgeson & Campion, 2010).
*Confirmation bias. Refers to the tendency to seek
evidence to confirm an initial preconception and ignore
any contradictory information (Dror & Fraser-McKenzie,
2008).
*Representativeness bias. Reflects the tendency of
people to judge the degree of relationship between two
things based on their similarity to each other (Morgeson
& Campion, 2010).
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
3. Confusing Retrospective & Predictive
Accuracy (Switching Conditional
Probabilities)
*“Affirming the consequent” logical fallacy:
People with condition X are overwhelmingly likely
to have these specific test results.
Person Y has these specific test results.
Therefore: Person Y is overwhelmingly likely to
have condition X.
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
4. Unstandardizing Standardized
Tests
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
5. Ignoring the effects of Low Base Rates
* Example:
Task: To create a test that will identify crooked person among
Judicial Candidates
Hypothetical Contention: 1 in only 500 Judicial Candidate is
crooked
Supposed Reliability: .9
Number of Judicial Candidates: 5000
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
6. Misinterpreting High Base Rates
* Example:
Out of 273 residents who sought mental health service after
an earthquake, 89% were diagnosed with PTSD related to the
earthquake. 92% of that figure belong to a specific faith.
Therefore, this faith makes a person vulnerable to PTSD. Or
more subtly, this faith might make it easier for people with
PTSD to seek mental heath services.
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
7. Perfect Conditions Fallacy
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
8. Financial Bias
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
*9. Ignoring Effects of Audio-recording,
Video-recording or the Presence of
Third-party Observers
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php
*10. Uncertain Gatekeeping
* Example:
* A 17-year-old boy comes to your office and asks for a
comprehensive psychological evaluation. He has been
experiencing some headaches, anxiety, and depression.
A high-school dropout, he has been married for a year
and has a one-year-old baby, but has left his wife and
child and returned to live with his parents. He works full
time as an auto mechanic and has insurance that covers
the testing procedures. You complete the testing.
*10 Fallacies in
Psychological Assessment
Pope, K.S. (2003; 2010). 10 fallacies in psychological assessment. Retrieved on June 2015
from http://kspope.com/fallacies/assessment.php