Professional Documents
Culture Documents
Competencia para Defenderse
Competencia para Defenderse
A Thesis
Presented to the
Faculty of
In Partial Fulfillment
Master of Science
in
Clinical Psychology
_______________________________
By
Jael Huizar
Spring, 2017
ABSTRACT
Among all the standardized tests available to assess competency to stand trial, the
and The Evaluation of Competence to Stand Trial-Revised (ECST-R) are unique in that
they are among the most extensively developed tools. While developed for the same
purposes, they differ in both the theoretical foundation and the approach to assess
each test could potentially vary when used with the same defendant.
The purpose of this review was to provide an in-depth understanding of each test
by exploring what is known about the psychometric properties of each test; as well as
what is known about the way each instrument works across various populations. The
research available offers some basis to make tentative recommendations for the clinical
utility of these tests that are offered herein. Below follows a discussion of the limitations
and recommendations for use of each in clinical practice. While recommendations for
clinical use of the tests are provided, any suggestions offered should be considered
tentative, as there is a lack of consensus in the field regarding how the instruments should
be used.
ii
TABLE OF CONTENTS
ABSTRACT ................................................................................................................... ii
Chapter
1. INTRODUCTION ................................................................................................ 1
iii
APPENDIX: TABLES................................................................................................... 53
REFERENCES .............................................................................................................. 56
iv
1
CHAPTER 1
INTRODUCTION
Competency to stand trial evaluations form a significant part of the legal process
(Borum & Grisso, 1995). Competency to stand trial (CST) refers to an individual’s
capacity to function as a defendant in a court of law. The legal system can be very
intimidating for defendants, in part because legal jargon is not easily understood outside
the community of attorneys, lawmakers, and judges. Therefore, it can be difficult for
legal system participants to understand and effectively take part in the legal process.
Nowhere is it more important to understand the legal process than for individuals charged
with crimes because the outcome of a defendant’s case rests heavily on the actions taken
Although a law degree is not necessary to utilize the court system, it is important
understand one’s civil rights, and possess enough basic knowledge to actively participate
in legal proceedings. In most cases the average person possesses the minimal level of
legal knowledge necessary for effectively working with an attorney and proceeding
and/or cognitive deficits may hinder a defendant’s ability to think rationally or make
In Dusky v. United States (1960), the U.S. Supreme Court concluded that “it is not
enough . . . that ‘the defendant [is] oriented to time and place and [has] some recollection
of the events,’ but defined competency to stand trial as “whether [the defendant] has
sufficient present ability to consult with his lawyer with a reasonable degree of rational
him” [362 U.S. 402, 403]. Since then, each state in the U.S. has adopted some variation
of the Dusky standard for CST evaluations. The justice system has relied on this standard
as a guide for determining whether defendants are competent and therefore, able to
In terms of clinical application, the Dusky standard has been challenging for
provided in the ruling as to how an examiner should define terms such as “rational
defendant to be ruled incompetent to stand trial. Consequently, legal scholars and clinical
researchers have put forth great efforts to conceptualize Dusky in order to produce a more
well-defined illustration of the specific abilities and levels of those abilities that are
required to consider one competent to stand trial (Bonnie, 1992; Grisso, 2003; Otto, et al.,
1998; Poythress et al., 1999; Roesch, Hart, & Zapf, 1996; Roesch, Zapf, Golding,
& Skeem, 1999; Rogers et al., 2001; Rogers, Jackson, Sewell, Tillbrook, & Martin, 2003;
Zapf, 1999; Zapf et al., 2009). The Dusky standard has generally been conceptualized as
3
having three central functions which include the ability to: consult with counsel, possess
and grasp adequate factual understanding about general courtroom proceedings (Grisso,
2003). Competent defendants are expected to understand their legal situation and how it
Assessing competence to stand trial is not a simple or even a uniform process for
clinicians who conduct these evaluations. CST assessments require a careful investigation
consideration of how the defendant’s functioning might interfere with or impede one’s
ability to adequately participate in the legal proceedings (Grisso, 2003; Melton, Petrila,
Poythress, & Slobogin, 2007; Zapf & Roesch, 2009). Clinicians must consider
the expectations of the justice system, and whether the demands of the defendant’s case
may exceed the abilities the individual possesses. Following such an evaluation,
clinicians must then interpret and report their results in a format useful to the justice
system where a decision regarding the defendant's competence will be made, most
criticism, especially given the wide variety of methods currently used by practitioners
who conduct CST evaluations (Archer, Buffington-Vollum, Stredny & Handel, 2006;
Borum & Grisso, 1995; Grisso, 1991, 2003; Holub, 1992; Neal & Grisso, 2014). In the
past, examiners tended to focus too heavily on mental illness and failed to address how
the illness directly related to the legal standard for competence in their evaluations
4
(Grisso, 2003). However, there is now a clear consensus among experts that the presence
of mental illness alone does not automatically render a defendant incompetent to stand
trial (Bonnie, 1992; Golding & Roesch, 1988; Tillbrook, 2000). Instead, a careful
examination of not just the presence of impairments, but how those impairments intersect
with the demands of the legal case, is necessary when performing a CST evaluation.
tools that can help assess a defendant’s abilities as defined in Dusky have become highly
Assessment Instruments (CAIs) have been developed to guide those who are conducting
CST evaluations to aid clinicians in their ability to evaluate both one’s mental status as
well as the effect any deficits that are present might have on the competence-related
Since their creation, evaluators have regularly used CAIs to aid in forming their
opinions when conducting CST evaluations; however, the relation of such measures to
the Dusky standard has been debated. Among all of the standardized tests available to
unique in that they are considered to be the most extensively developed tools, both of
which focus on the legally relevant concepts most related to the Dusky standard. While
both tests were created from thoughtful programs of research to address the functional
abilities drawn from Dusky, the MacCAT-CA and ECST-R are very distinct from each
other. These forensic instruments differ in the theoretical foundation on which they are
5
based. For example, the MacCAT-CA is fashioned after Bonnie’s two-part conceptual
model on competence which includes both: basic and decisional competence; whereas
ECST-R is designed to directly parallel the three Dusky prongs, which includes: Consult
takes a semi-structured approach to assess both competency and feigning while focusing
The expectation that one competency measure could be used by evaluators and
would have the flexibility and utility to fit every defendant’s unique case is unrealistic.
Indeed, all existing assessment tools do not operate uniformly and many approach the
assessment of CST differently, as is the case with the MacCAT-CA and the ECST-R.
Thus, it is important to know whether the findings from these forensic instruments are
equivalent and, if not, how might any variance across these tools affect the outcomes of
the defendants being evaluated. It is important to know: whether one test might hold
defendants to a higher standard of functioning than the others, whether one assessment
instrument is more suitable for use with a particular type of defendant, and whether
different instruments should be limited for use with certain groups of defendants. These
issues are important to understand given the differences between the ECST-R and the
MacCAT-CA. With these questions identified, a review of the available research would
help to provide insight as to the current state of knowledge regarding these two CAIs.
6
For the present study, an extensive review of all the existing literature relating to
the MacCAT-CA and the ECST-R was conducted. The purpose of this literature review
was to provide an in-depth understanding of each test by exploring what is known about
the psychometric properties of each test; as well as what is known about the way in which
each instrument works across various populations. The main goals for this review were:
(a) to summarize the literature available for each test to provide a detailed overview of
the current knowledge of each instrument, including a critique of the strengths and
weaknesses of each; (b) to conduct a comparison of the two tests against other another;
and (c) to provide recommendations for use of each test in clinical practice for those
CHAPTER 2
main projects. The first project, which was supported by the MacArthur Foundation
Research Network on Mental Health and the Law, was designed to create a more
criminal defendants (Bonnie et al., 1997; Poythress et al., 1999). As part of this project, a
measure abilities that were thought to be important when making decisions regarding
one’s legal case. The MacSAC-CD focused on four factors that included the abilities to:
reason with information related to one’s case, and make a decision. These factors were
trial. However, the MacSAC-CD was not user-friendly because it was lengthy, difficult to
score, and the items were repetitive (Poythress et al., 1999). Therefore, this instrument
was not practical for everyday use in CST evaluations and eventually it was shortened
and simplified for use in CST evaluations. The more condensed, user-friendly, instrument
Test Overview
intended for use with defendants who are 18 years of age or older, who have an IQ
greater than 60, and who speak English. The MacCAT-CA was based in part on the
of competence to stand trial. Bonnie (1992) proposed that the Dusky standard could be
accusations, the judicial process, and the defendant’s overall capacity to effectively work
with an attorney to prepare a defense; while decisional competence refers to aiding one’s
attorney by providing input on the case and possessing the ability to make rational and
reasonable decisions in conjunction with legal defense (Bonnie, 1992). The newer
Appreciation. These scales were meant to align with Bonnie’s (1992) definition with the
Understanding and Reasoning scales measuring basic competence and the Appreciation
provide ratings of patients’ abilities across the three areas assessed. Outcomes from the
but are meant to be integrated with other relevant information from the defendant’s
through questions about a fictional defendant. Theoretically, this domain aligns with the
capacity to recognize general information related to the legal system. The Understanding
section allows retesting on many of the items when a defendant answers a question
wrong. The retest format is a benefit because it considers a defendant’s ability to learn
and make decisions (Poythress et al., 1999). The second domain of CST assessed by the
MacCAT-CA is Reasoning, which explores the defendant’s ability to detect the possible
defendant faces throughout the legal process. Theoretically, this domain aligns with the
ability to classify information based on the importance and relevance of the information
to a legal case. The last domain of the MacCAT-CA assesses Appreciation by probing
defendants about their attitudes and beliefs regarding the legal process related to the
defendant’s own legal case. Theoretically, this domain is the most connected to the
“rational understanding” part of the Dusky standard because of its emphasis on the
defendant’s ability to understand and appreciate the facts of the defendant’s legal case.
The responses made by defendants are rated on a three-point scale (0, 1, or 2) using
specific scoring criteria. Higher scores on the three-point scale indicate higher levels of
abilities. Scores are totaled for each domain, but no overall score is calculated.
years of age with an IQ higher than 60. Thus, the authors caution that the MacCAT-CA
10
should only be used to assess individuals that speak English due to the absence of
available translations in other languages (Poythress et al., 1999). In fact, not only is the
MacCAT-CA only available to English speaking defendants, but there is also a scarcity
of research available regarding the impact of using the MacCAT-CA with non-English
speaking defendants and the use of a translator is contraindicated due to the potential for
variations in intended meaning of the item questions and subsequent responses (Poythress
et al., 1999).
Psychometric Properties
Norms
defendants from three groups. The first group, known as “jail unscreened” (JU), was
defendants from the jail inmate population. The second group known as “jail treated”
(JT) was also assumed to be competent and comprised of 249 jail inmates that were
receiving mental health services not related to restoration of competency. The third group
known as “hospitalized incompetent” (HI) was comprised of 283 defendants who were
hospitalized for restoration of competency and who were considered legally incompetent.
The participants in the norming group were predominantly Caucasian (52.5%), males
(90%); although the sample did include a smaller number of individuals representing
minority groups (i.e., 12% black, less than 1% Asian, 36% “Nonwhite”). The authors of
the MacCAT-CA did report that race/ethnicity was significantly related to Understanding
and Reasoning subscales (Poythress et al., 1999). However, further statistical testing
indicated that the variation in scores between the Whites and Non-Whites was due to
11
variances in intellectual abilities (Poythress et al., 1999). Only 10% of the 729
participants were female, but the manual does not provide any specific information on
Reliability
In terms of reliability, the original norms for the test showed good internal
consistency of the three abilities related to competency and found alpha coefficients of
.85 for Understanding, .81 for Reasoning, and .88 for Appreciation (Otto et al., 1998;
Poythress et al., 1999, 2002). The mean interitem correlations were .36 (Reasoning), .42
consistency and acceptable similarity of item content for these measures. In a later study,
Rogers et al. (2001) also evaluated the reliability of the MacCAT-CA scales, and reported
good internal consistency with alphas of 0.81 (Understanding), 0.85 (Reasoning) and
MacCAT-CA have also shown good levels of internal consistency for the Understanding
and Reasoning domains of the MacCAT-CA (Jacobs, Ryba, & Zapf, 2008; Marcus,
Poythress, Edens, & Lilienfeld, 2010; Rogers, Grandjean, Tillbrook, Vitacco, & Sewell,
2001). However, some research has shown the appreciation scale to be weaker
psychometrically than the other two scales and it has been hypothesized that this
difference may stem from the differing format of the items on this scale from the other
two MacCAT-CA scales (Jacobs et al., 2008; McIlnay, 2010; Zapf, Skeem & Golding,
2005).
12
Otto et al., (1998) reported that interscorer reliability for the three scales ranged
from good to excellent, with Intraclass Correlation Coefficients (ICC’s) of .75 for
Appreciation, .85 for Reasoning, and .90 for Understanding. Lexcen, Hawk, Herrick, and
Blank (2006) also reported ICCs for the subscales of the MacCAT-CA for administration
that occurred either in person or via video administration and the ICCs were excellent
across both conditions for all the scales, except for the appreciation subscale which
yielded a slightly lower ICC of .69 for both conditions. Overall, studies evaluating the
reliability of the MacCAT-CA scales have been favorable, although the Appreciation
scale does appear to be somewhat weaker than the Understanding and Reasoning scales.
Validity
In terms of validity, the original validation study conducted by the test developers
assessed the criterion-related validity by examining the variances between the average
scores for the three sample groups: HI, JT, and JU. The results indicated that the
defendants since the HI group scored significantly lower than the JT group on all
MacCAT-CA scales (Poythress et al., 1999). They also found that the MacCAT-CA
appeared to adequately differentiate between the different groups of defendants who were
hospitalized for being incompetent and defendants who were competent but receiving
mental health services in jail. Poythress et al. (1999) compared the performance of
defendants assessed with the MacCAT-CA and found that hospitalized incompetent
that are consistent with Dusky. The authors concluded that these findings are similar to
those found for other forensic instruments such as Competence Screening Test (CST;
13
Lipsitt, Lelos, & McGarry, 1971) and Georgia Court Competency Test-Mississippi State
Hospital Revision (GCCT-MSH) (Nicholson, Briggs, & Robertson, 1988; Otto et al.,
1998).
From the original norming studies, Otto et al. (1998) also provided support for the
all three domains of the MacCAT-CA were related to indicators of intellectual capacity
and psychopathology, as would be expected (Poythress et al., 1999; Redding, 1997; Zapf
et al., 2005) and an analysis of the relationships between the MacCAT-CA subscales and
the other measures suggest that the subscale scores converge with intelligence and
education, but diverge from measures of emotional distress, decisional temperance, risk
perception, and perspective which refers to a defendant’s ability to appreciate the short
and long term effects of a decision (Krause, 1998). Other studies conducted have also
provided evidence for construct validity (Boyd, 1999; Krause, 1998; Redding, 1997; Zapf
et al., 2005).
specifically, by evaluating relations with the psychiatric symptoms assessed on the Brief
Psychiatric Rating Scale (BPRS) (Jacobs et al., 2008; Overall & Gorham, 1962; Ryba &
Zapf, 2011; Warren, Aaron, Ryan, Chauhan, & DuVal, 2003). The psychiatric symptoms
measured by the BPRS included four clusters: psychoticism, hostility, depression, and
withdrawal. Findings from these studies indicated a strong negative relationship between
14
the Psychosis and Withdrawal symptoms on the BPRS with all three MacCAT-CA
scales, but no significant relationship with Depression symptoms (Jacobs et al., 2008;
Redding, 1997; Ryba & Zapf, 2011; Warren et al., 2003). There appears to be some
variability in the findings in relation to the Hostility cluster of the BPRS. Redding (1997)
reported that the hostility cluster was moderately negatively related to the Reasoning and
Appreciation scales, whereas Warren et al. (2003) found that the hostility cluster was
related to the Understanding and Reasoning. Ryba and Zapf (2011) found that the
hostility cluster was related to all three MacCAT-CA scales, but they note that variations
in cognitive abilities explained more differences in the scores of all three MacCAT-CA
scales than did the psychiatric symptoms (Ryba & Zapf, 2011).
levels of .36 (Understanding), .42 (Reasoning), and .49 (Appreciation) with all
correlations statistically significant (Poythress et al., 1999). Zapf and Roesch (2005)
conducted an analysis of the correlations between the MacCAT-CA, the Fitness Interview
underlying construct with the results indicating that they all correlate at a moderate to
high levels. The correlation between the FIT and the MacCAT-CA was .82 and the
correlation between the MacCAT-CA and the MacCAT-T was .86. These high
correlations provide good support for the construct validity of the MacCAT-CA-CA as
the FIT is another CAI and the MaCAT-T is a similarly designed instrument that assesses
comparison with another CAI, the Competence Assessment to Stand Trial for Defendants
with Mental Retardation (CAST-MR), which is designed specifically for use with
defendants diagnosed with an Intellectual Disability. Overall, these studies show a lack of
convergent validity between these two forensic instruments as there was no significant
relationship found between scores on the MacCAT-CA and those on the CAST-MR
(Bennett, 2005; Jurecska, 2010; Jurecska, Peterson, & Millkey, 2012). These findings
were somewhat surprising as both instruments are designed to measure competence, and
Four studies investigated the factor structure of the MacCAT-CA (Jacobs et al.,
2008; Krause, 1998; Rogers et al., 2001; Zapf, et al., 2005). The findings across these
studies generally revealed support for either a two- or three-factor model, with the three-
factor model generally receiving the most support. However, there was some variance
among the studies as to how to organize the items into a three-factor model. The two-
factor model tested by Rogers et al. (2001) combined the Understanding and Reasoning
sections into Factor One and maintained the Appreciation section as Factor Two and
while Rogers et al. (2001) noted that, on the surface, these results may be puzzling as two
of the Dusky criteria were merged into one factor, they hypothesized that the assessment
of the Dusky prongs is likely confounded by the nature of the questions that address the
hypothetical scenario as opposed to those addressing the defendant’s own case. The
three-factor model from Rogers’ study was broken down as: Appreciation section items
hypothetical case (Factor 2); and Reasoning items with the addition of two items from
16
Understanding section (Factor 3). Zapf et al. (2005) later tested both the two and three
factor models proposed by Rogers et al. (2001) but did not find support for either.
supporting the utility of the original design of the MacCAT-CA scales as well as a
different three factor version that was a better fit and that was also later confirmed by
Jacobs et al. (2008). The new modified three-factor model placed together all the items in
the Understanding section with Items 14-16 from the Reasoning section (Factor 1); Items
9-13 from the Reasoning section (Factor 2); and all items in the Appreciation section
(Factor 3). In all, while the MacCAT-CA scales appear to hold up reasonably well
through factor analysis, although some questions remain as to how purely they fit with
the three aspects of Dusky and whether the nature of the questions (those related to the
hypothetical scenario versus those related to the defendant’s case) holds undue influence
A number of studies have called into question the differences between the types
of questions across the three scales on the MacCAT-CA. Zapf and Roesch (2005)
investigated the factor loadings of each of the scales of the MacCAT-CA and found less
support for the Appreciation scale than the other two scales, which they hypothesize is a
result of the defendant having to use abstract thinking related to their own circumstances,
whereas the other scales rely on responses to the hypothetical scenario. Zapf and Roesch
(2005) suggested that perhaps the complexity of measuring a defendant’s awareness and
understanding of his or her own situation using a standardized format resulted in weaker
measurement for the Appreciation scale. This idea is in line with the suggestions put forth
by Marcus et al., (2010) who proposed that rational understanding is taxonic and should
17
determination. They posit that a defendant either has rational understanding or does not;
unlike the factual understanding scale, where defendants may possess varying degrees of
Overall, given the whole of the research available on the validity and usefulness
of the MacCAT-CA, it appears there is sufficient evidence to show that this instrument
related to constructs that one would expect it to relate to (i.e., intelligence, cognitive
ability, psychoticism, clinician ratings of competence, etc.) and it differs from constructs
one would expect it to (i.e. emotional distress, risk perception, depression, etc.). In
addition, the three scales hold up reasonably well under scrutiny, although questions have
been raised about the design and strengths of the Appreciation section.
MacCAT-CA, a substantial number of studies have investigated the use of this instrument
with additional populations, such as those with an intellectual disability (ID) and with
youthful defendants. Four studies have investigated the utility of the MacCAT-CA with
Jurescka, Peterson, & Millkey, 2012; Pinals, Tillbrook & Mumley, 2006). The results of
these studies have generally indicated that the MacCAT-CA was significantly more
challenging for defendants with ID. Many defendants were incapable of understanding
the MacCAT-CA’s hypothetical scenario and there were items that were answered
incorrectly by all participants with ID (Understanding items 7 and 8) or that were not
18
answered at all (Appreciation item 22) (Pinals et al., 2006). Further, it was noted that
participants struggled to elaborate on their responses and many became discouraged with
the number of questions on the test thus resorted to responses of “I don’t know.” Another
problem observed was that examiners spent much longer amounts of time repeating
questions and attempting to make sense of incoherent responses that they were often
unable to complete the test in a reasonable amount of time (Pinals et al., 2006). Due to
the various obstacles with ID populations, many defendants were counted as incompetent
to stand trial. Thus, defendants with intellectual disabilities that are assessed using the
MacCAT-CA may be more likely to be found incompetent to stand trial than those who
tested using other CAIs such as the CAST-MR, which utilizes a multiple-choice format
A total of thirteen studies have investigated the utility of the MacCAT-CA with
adolescent populations The findings of these studies indicate that juveniles between the
ages of 10-15 generally scored lower than the adults whereas the older juveniles between
the ages of 16-17 tend to perform similarly to adult samples (Boyd, 1999; Burnett, 1999;
Burnett, Noblin & Prosser, 2004; LaVelle Fick, Hart, & Deardorff, 2006; Grisso et al.,
2003; Grisso & Seagrave, 2005; Lyle, 2003; Viljoen, Odgers, Grisso, & Tillbrook, 2007;
Viljoen, Slaney, & Grisso, 2009; Warren et al., 2003). Only one study found that the
the adults in this study had significantly low IQ levels in comparison to their adolescent
whether the differences lie in lack of knowledge versus developmental factors (Viljoen et
al., 2007; Viljoen et al., 2009). For example, the MacCAT-CA’s unique retest format was
on legal concepts (Viljoen et al., 2007). However, younger adolescents did not perform
better even after receiving education suggesting that the obstacles for adolescents are
measurement bias on several items from the Understanding and Reasoning scales,
although no items from the Appreciation scale showed evidence of bias (Viljoen et al.,
2009). Overall, the findings from studies on the use of the MacCAT-CA with youthful
defendants shows notable age differences that impact performance on the MacCAT-CA
and this literature calls into question how strongly a defendant’s age or level of maturity
can influence competence opinions when the MacCAT-CA is used with this group.
In a related area of study, it was found that participants with even small
impairments in frontal lobe abilities, such as attention and abstract reasoning, were
enough to cause individuals to perform poorly on the MacCAT-CA and the authors
conclude that these difficulties may stem from the rigidity of the instrument as well as the
wordiness of the questions (Pinals et al., 2006). They also noted that the MacCAT-CA
does not adequately detect a defendant’s delusional beliefs about one’s legal situation.
Pinals et al. (2006) reported that clinical factors such as psychotic symptoms, cognitive
deficits, or efforts to feign interfered with about 30% of defendants’ ability to complete
the MacCAT-CA. These authors suggest that evaluators should not use the MacCAT-CA
to assess competence with all defendants in a public forensic setting and that instead it
that perhaps it is significant to concede that the very reasons why defendants cannot
complete the MacCAT-CA are parallel to the reasons they may be found incompetent to
limitations.
The original norms for the MacCAT-CA included only a limited number of
women and minorities and there is a notable lack of information on any gender or race
differences reported in the test manual (Grisso, 2003). Since its publication only a small
number of studies have discussed or focused on these issues. A few studies have been
conducted that included a small sample of female participants (10% to 14%) but no
specific analyses aimed at exploring gender differences were conducted (Burnett et al.,
2004; Edens et al., 1999; Jurecska, 2010; Jurecska et al., 2012; Mueller & Wylie, 2007).
Redding (1997) conducted one of the only empirical studies that directly included an
investigation of gender, but this research, which included a sample of incarcerated female
defendants who were mostly African American, had the primary focus of investigating
the relationship between depression and legal competence, which was measured using the
MacCAT-CA. Although, Redding (1997) did not address gender issues, specifically, in
his findings, he did note that female participants showed higher levels of depression, but
there were no significant correlations between gender and scores on the MacCAT-CA,
In terms of using the MacCAT-CA with various racial or ethnic groups, very few
studies have directly addressed this issue. In a study designed to examine the effect of
defendants who were: 48% African American, 34% Caucasian, 10% Hispanic, 6% Asian
21
and 2% came from other groups (Krause, 1998). Results of this study found that race was
a significant factor influencing the outcomes, but only on the Understanding scale.
Specifically, the results indicated that Caucasian defendants had the highest scores,
followed by African Americans, Hispanics, and then Asians; however, further analyses
revealed that these race/ethnic differences disappeared when the defendants level of
intelligence was controlled. The only other study conducted with defendants from
racial/ethnic minority groups, found that minority groups displayed reduced levels of
understanding before teaching (Viljoen et al., 2007). However, it was also shown that
minority groups demonstrated higher levels of improvement with teaching than the non-
Hispanic Caucasian defendants. To explain their findings, the authors note that perhaps
the minority groups in their sample were less acquainted with the process of the justice
system or even with legal jargon, but they did have the ability to learn and that the
difference in performance was not due to test bias or even to difference in intelligence,
but rather due to reduced exposure and familiarity with the court system (Viljoen et al.,
2007).
While the psychometric properties of the MacCAT-CA are generally quite strong,
the test is not without criticism (Rogers et al., 2001; Rogers & Johansson-Love, 2009;
Tillbrook, 2000). In particular, the MacCAT-CA has been criticized for its loose
adherence to the Dusky standard (Pinals et al., 2006; Rogers et al., 2001). Since the
conceptual model on competence and in part on the three Dusky prongs, it has been
argued that the merging of these two theoretical approaches resulted in an instrument that
22
does not align perfectly with either, nor does it fit well with the legal standard defined by
Dusky (Jacobs et al., 2008; Rogers et al., 2001; Zapf, 1999; Zapf, Skeem, & Golding,
2005). In order for an assessment instrument to adhere to Dusky, it must address the three
ability to consult with an attorney. The Dusky standard requires the evaluation of abilities
specifically related to a defendant’s case as well as the ability to consult with an attorney;
therefore, it is unclear how well the three domains of the MacCAT-CA directly address
the defendants’ understanding and reasoning about their specific legal case since 16 of
the 22 test items refer to a hypothetical scenario rather than the defendant’s specific case.
The MacCAT-CA items do not directly address the ability to consult with counsel prong
and thus do not fully adhere to Dusky (Pinals et al., 2006; Rogers et al., 2001).
of symptoms to use other screens for malingering. While many would argue that CAIs
are not meant to assess both competence and feigning, others have noted the importance
and benefits of doing both simultaneously (Rogers et al., 2001) While the MacCAT-CA
does not include a formal screen for feigning built into the test, there is some preliminary
evidence that defendants who attempted to feign during the MacCAT-CA administration
were unable to maintain their exaggerated or faked symptoms throughout the testing
process (Pinals et al., 2006). The authors posit this may be due to the structured format of
the test providing a distraction to how they were coming across to evaluators.
23
Concerns have also been noted that the questions addressing the hypothetical
scenario are too rigid. For example, the MacCAT-CA manual states that evaluators must
read the hypothetical scenario verbatim and cannot paraphrase any part of it even if a
defendant does not understand the vignette (Poythress, et al., 1999). An evaluator is only
permitted to reread the story at a slower pace but is not authorized to elaborate or clarify
any parts of the scenario. Even though the vignette format has been criticized, there are
still positive reviews about the MacCAT-CA’s ability to assess a defendant’s reasoning
(Acklin, 2012; Rogers et al., 2001). Zapf et al. (2005) pointed out that instead of a
one’s case may be more useful in discovering any underlying irrational thought processes
that may not be obvious using highly structured questions used in the MacCAT-CA based
on a hypothetical scenario.
defendant’s capacity due to its retest format (Rogers et al., 2001). In particular, the
Understanding section allows retesting for many of the items when a defendant answers a
question wrong. The retest format is a benefit in that it considers a defendant’s ability to
learn any specific pieces of legal information of which the defendant may not have
initially been aware (Poythress et al., 1999). This education process, allows the examiner
to differentiate between defendants who perform poorly on items because they are
incompetent, simply because they lack sufficient knowledge of the justice system. Thus,
a retest format is useful in screening out competent defendants whose scores can improve
through brief education and many participants who received brief education demonstrated
24
Another critique about the MacCAT-CA’s norming sample is that it was based on
a prison sample of defendants that may not adequately represent nonincarcerated pretrial
defendants undergoing CST evaluations (Golding, 2008; Rosenfeld, 2001). The argument
highlights the fact that the original sample relied on defendants whose competency was
not questioned and in turn compared them to a group of defendants that were found
legally incompetent to stand trial. The critique notes that this weakens the construct
validity of the instrument. Perhaps it would have been more appropriate for the MacCAT-
CA to have incorporated a more appropriate sample that represents its target population
using a sample of defendants where clear-cut competent and incompetent cases have been
removed. Since publication, no other studies have been conducted that test Golding’s
competency.
Dusky standard. Overall, the MacCAT-CA has been found to have good reliability and
validity for assessing competence (Otto et al., 1998; Zapf et al., 2005). The retest format
capacity to learn new material. Although it is highly recommended for use as a tool,
25
rather than a standalone test, critics also have commented on its loose adherence to
Dusky, the limited scope of practice due to the original norming sample focusing on
Caucasian males, with an IQ higher than 60 that speak English and its use of a
CHAPTER 3
Around the same time the MacCAT-CA was being developed, a second CAI was
also in the works. The original Evaluation of Competency to Stand Trial (ECST)
instrument was developed by Rogers in 1995 (Rogers & Tillbrook, 1998). Rogers,
Tillbrook, and Sewell (2004) introduced the revised version known as the Evaluation of
Competence to Stand Trial –Revised (ECST-R) in 2004. This revised instrument was
created to include scales that evaluated competency as well as scales that were designed
to detect feigning.
Test Overview
The ECST-R was designed as a hybrid interview with both semi-structured and
unstructured components that assess the three factors of competence as defined in Dusky
v. United States (1960). The ECST-R is designed for use with defendants who are 18
years of age or older, who have an IQ greater than 60, and who speak English. The
ECST-R has not been validated for use with juveniles, defendants with moderate mental
retardation, or with non-English speaking defendants (Rogers, Tillbrook & Sewell, 2004).
range of cognitive abilities. Thus, the test manual recommends only using this instrument
when a defendant possesses the necessary English language skills to understand the
questions (Rogers et al., 2004). The authors stress caution must be taken when using the
27
ECST-R with an interpreter as words can easily lose their intended meaning when
translated, unless using a certified court interpreter (Rogers, Tillbrook, & Sewell, 2004).
The authors also note that the task of interpreting the Atypical Presentation scales can be
The ECST-R categorizes the 18 competency questions into three domains that
Rational Understanding of the Courtroom Proceedings (RAC), and Consult with Counsel
(CWC) (Rogers et al., 2004). The six items in the Factual Understanding of the
the courtroom works as well as their knowledge of the legal process. Special attention is
placed on whether defendants can identify significant key players of the court such as the:
judge, jury, prosecutor, and defense attorney as well as the distinct roles of these
individuals in the defendant’s case (Rogers, Tillbrook & Sewell, 2004). The seven items
in the Rational Understanding of the Courtroom Proceedings (RAC) domain assess the
ability of a defendant to make decisions that are not severely compromised by mental
illness (Rogers, Tillbrook & Sewell, 2004). Special attention is paid to mental
reasoning, testifying, plea negotiating, as well as in their courtroom experiences and any
unmanageable courtroom behavior. The six items in the Consult with Counsel (CWC)
domain assess how well a defendant can maintain a working relationship with one’s
attorney (Rogers, Tillbrook & Sewell, 2004). The items in the CWC domain focus on a
and expectations of the attorney. Overall, the items explore general dynamics of the
28
attorney-client relationship. The CWC scale is assessed using open-ended type questions
unlike the items on the other scales that are more structured.
The ECST-R’s hybrid format serves an important function. The structured part of
the instrument is designed to thoroughly measure the essential functional abilities related
clinicians can supplement the 18 competency questions provided with additional queries
in that provides both the structure of a standardized measure as well as the flexibility to
The authors of the ECST-R had two goals in mind in deciding how to approach
the scoring method of this test. The first goal was to assess competence-related abilities
and the second goal was to avoid generating numerical scores that could complicate
important information about a defendant’s case. Thus, they produced a method in which
the ECST-R competency scales offer valuable information about an individual defendant
that is also combined information about his or her case. Therefore, the ECST-R uses
rating criteria (0 to 4) that measure the severity of psychotic symptoms (i.e. persecutory
delusions) and self-defeating motivation of impairment (Rogers et al., 2004). Each of the
18 items has several ratings that clinicians can make for each item. The clinician then
decides on the final score based on the ratings. In some instances, more than one rating is
applied for an item, thus the score would be the equivalent to the highest. In terms of
scoring, the items in each of the three ECST-R domains (CWC, FAC, and RAC) of the
ECST-R are quantified using a rating scale for psychotic symptoms and self-defeating
29
motivation that range from: 0, not observed; 1, questionable clinical significance; 2, mild
impairment not related to competency; 3, moderate impairment that will influence but not
impair competency; and 4, severe impairment that impairs competency. The item scores
from each of the three domains are summed and then converted to t-scores that are
impairment.
Presentation scales, which are comprised of 28 items that assess feigning as it relates to
competency (Rogers, Tillbrook & Sewell, 2004). The authors of the ECST-R understand
that criminal defendants may fake incompetence and may not be truthful during the CST
one’s presentation. The Atypical Presentation scales can be broken down into five
(ATP-B). The ATP-R scale assesses the level of common concerns that a defendant may
face, thus low scores in the ATP-R scale may indicate defensiveness. Defendants faking
psychotic symptoms will have high scores in both ATP-P and ATP-N, as reflected by a
high ATP-B score, while defendants faking nonpsychotic symptoms will only have high
scores in ATP-N. The ATP-I scale is determined by the sum of the ATP-P and ATP-N
which together produce the impairment ratings. The process of scoring the ATP scales is
designed to be simple with the items scored on scale of three points (0 = no,
1 = sometimes, and 2 = yes). Each ATP scale is then summed and the raw scores are
30
converted to t-scores. The inclusion of these scales is significant not only because of their
ability to detect feigning, but also due to their focus on asking about the presence or
psychosis.
Psychometric Properties
Norms
The validation of the ECST-R (Table 1) was based on four clinical studies that
together contained eight different samples of defendants and these groups were
supplemented by five additional samples (Rogers et al., 2001; 2003; Rogers, Martin, &
Sewell, 2003; Tillbrook, 2000). The first study used 100 male jail detainees divided into
two groups: genuine (nonfeigning) and feigning (probable faking group; Rogers et al.,
2001). The genuine group referred to defendants with valid reports of mental disorders
whereas the feigning group referred to the defendants with false reports of mental
disorders. The second study used 28 male pretrial defendants that were referred for CST
evaluations and 42 male defendants who had been found incompetent to stand trial
(Tillbrook, 2000). The third study used 56 male defendants hospitalized in a competency
restoration program in various stages of treatment. These defendants were divided into
two groups based on those with genuine mental illness and those believed to be feigning
(Rogers, Sewell, Martin & Jackson, 2003). The fourth study used 154 jailed defendants
randomly assigned to two groups: genuine (legitimate mental disorders) and feigning
(probable false mental disorders; Rogers, Martin & Sewell, 2003). An additional sample
drawn from competency cases administered in CST evaluations was collected, mainly for
31
normative reasons and, to increase the overall sample for confirmatory factor analyses
The authors of the ECST-R appear to have considered cultural diversity in their
validation study, as the norm group as a whole was made up of about 53% Caucasian,
40% African American, and 10% Hispanic individuals. In terms of gender, two studies
were all male defendants (Rogers et al., 2001; Tillbrook, 2000). The third validation
sample used 82.1% male and 17.9% female (Rogers et al., 2003), the fourth validation
sample used 52.1% male and 47.9% female (Rogers et al., 2003), and the ancillary data
used 85.4% male and 14.6% female (Rogers et al., 2004). Therefore, the norms for the
Reliability
defendants that were re-administered the ECST-R one week after an initial administration
(Rogers et al., 2004). The interviewers were unaware of the results from the first
administration when they conducted the second administration of the ECST-R. The
findings indicated that 90% of the defendants’ scores on the first administration of the
ECST-R did not vary significantly on the second administration (Rogers et al., 2004).
the test manual are listed in Table 2 along with the mean interitem correlations (Rogers et
al., 2001; 2003; Tillbrook, 2000). These findings generally demonstrate good internal
consistency and acceptable similarity of item content for these measures. In one study,
Rogers et al. (2001) elected to combine data used by Tillbrook’s (2000) and Rogers and
Grandjean (2000) studies and the results of this integration of data yielded alpha
32
coefficients that were acceptable for CWC and excellent for factual and rational
understanding (Rogers et al., 2001). In later studies, Vitacco, Rogers, Gabel & Munizza
(2009) reported similar internal consistencies on two scales FAC, RAC, but a notably
higher CWC; while Norton and Ryba (2010) reported internal consistencies, which were
a bit lower than those reported in the normative studies (Table 2). Two additional studies
reported that the FAC showed good internal consistency, although in these studies the
RAC and CWC were not administered (Springman, 2007; Springman & Vandenberg,
2009). While some variance is seen across these findings, it is no more than might be
expected given the challenges in assessing strictly defined clinical constructs, and overall,
it appears the scales possess an acceptable level of consistency in terms of the constructs
measured.
Interrater reliability was initially established with either two (Rogers et al., 2001;
Rogers et al., 2003) or three (Tillbrook, 2000) independent examiners who showed a
fairly high level of agreement in CST decisions (89.3%) among the raters. The average
interrater reliabilities for the ECST-R scales are reported in Table 3, although some of the
defendants that could be applied to more than one category (Tillbrook, 2000). The overall
results of these studies indicate that the ECST-R provides consistency in measurement
Validity
Empirical support has been established regarding the ECST-R’s utility with
individuals suffering from mental illness as well as with, criminal populations and
defendants who were suspected of feigning based on four clinical studies supplemented
33
by five additional samples (Rogers, Jackson, Sewell, & Harrison, 2004; Rogers et al.,
2003; Rogers, Sewell, Grandjean, & Vitacco, 2002; Vitacco, Rogers, Gabel, & Munizza,
2007). The test authors used independent experts and applied bootstrapping procedures to
CST decisions which are confounded by clinical opinion. The ECST-R has shown
psychopathology and competency related abilities (Rogers et al., 2001; Rogers et al.,
Evidence for the construct validity of the ECST-R has been shown via
confirmatory factor analysis that found support for the three competency scales
applicability to the three prongs drawn from the Dusky standard (Rogers et. al., 2001;
Rogers et al., 2003). The first study conducted by Rogers et al. (2001) recommended the
use of a two-factor model which essentially focused on the factors that measure factual
understanding and rational understanding. In later research, Rogers et al. (2003) assessed
the factor structure of the ECST-R according to three competing competency models
referred to as: discrete abilities (three factor model), domains (two factor model) and
cognitive difficulty (two factor model). They found that the discrete abilities model was a
moderately good fit whereas the domains and cognitive difficulty models tested
previously produced less appropriate fits. The findings of the confirmatory factor analysis
provided support for the model using the three-factor discrete abilities which, indicated
the ECST-R’s competency scales are appropriately aligned with the abilities drawn from
terms of differentiating feigned from real symptoms (Norton & Ryba, 2010; Springman
& Vandenberg, 2009; Vitacco et al., 2007, 2009). Norton and Ryba (2010) found that
honest responders varied from coached feigners demonstrating large effect sizes for
differences on all of the competency scales, with the strongest effects seen for the CWC
scale. Vitacco et al. (2009) found that a group of “possible feigners” had considerably
elevated scores in comparison to the genuine group and they also report large effect sizes
for RAC (d = 1.60) and CWC (d = 1.89), although for FAC they found only a moderate
In terms of using the ECST-R minorities and women, very little information has
been reported that specifically speaks to these issues. Two studies done using only the
and Ryba (2004) found no significant differences between the control and feigned mental
illness groups in terms of ethnicity. However, the dearth of research in this area leave it
an open question regarding whether the ECST-R works equally well across a variety of
Overall, the psychometric properties of the ECST-R indicate that this instrument
related abilities (Rogers et al., 2001; Rogers et al., 2003; Tillbrook, 2000). In addition,
the competency scales of the ECST-R effectively differentiate between feigned mental
35
illness and real symptoms (Norton & Ryba, 2010; Rogers et al., 2003; Rogers, et al.,
2004; Springman, 2007; Springman & Vandenberg, 2009). The ECST-R’s competency
scales demonstrated good internal consistency and inter-item correlations and appear to
have appropriate interrater reliability for the competency scales indicating high levels of
agreement among evaluators (Rogers et al., 2001; Rogers et al., 2003; Tillbrook, 2000).
Since the ECST-R also includes the ATP scales that are designed to detect
feigning, it is important to consider the psychometric properties and the utility of these
scales in detecting those who may try to falsely present as incompetent. In terms of
reliability for the ATP scales, moderate to high levels of internal consistency were
reported in the test manual: ATP-R = .63, ATP-P = .79, ATP-N = .70, ATP-I = .87 and
ATP-B = .86 (Rogers et al., 2004). These findings have generally been supported by
studies that have followed that also report good internal consistency for the overall ATP
scales (α = .85; Springman, 2007; Springman & Vandenberg, 2009). Norton and Ryba
reported appropriate levels of internal consistency: ATP-R = .50, ATP-P = .74, ATP-
N = .73, ATP-B = .82, although the ATP-I scale was not assessed. Vitacco et al., (2007)
found that the five ECST-R ATP scales showed good internal consistency (ranges of
α = .72 to .90) and mean interitem correlations (ranges from .22 to .42). No studies have
assessed the test-retest reliability of the ATP scales and, due to the objective and
Regarding the validity of the ATP scales, there is evidence to show that these
scales can effectively differentiate between feigned mental illness and real symptoms as
well as feigned mental illness and honest/not mentally ill responders (Norton & Ryba,
36
2010; Rogers et al., 2003; Rogers, et al., 2004; Springman, 2007; Springman &
Vandenberg, 2009) and that these scales appropriately relate with measures of
psychopathology and cognitive abilities (Rogers et al., 2004). The ECST-R ATP scales
also demonstrate good convergent validity with the scales of the Structured Interview of
Reported Symptoms (SIRS; Rogers, Bagby, & Dickens, 1992) with correlations between
Two studies have investigated the effects of test coaching and whether a coached
feigner can elude detection using the ECST-R (Springman, 2007; Springman &
Vandenberg, 2009). The results of these studies indicated that, even with coaching, the
compared to the control group and it was concluded that coaching had very little
Even after an exhaustive literature search, the only empirical studies conducted
using the ECST-R are those reviewed above that relate to an evaluation of the test’s
psychometric properties. At present, no other research has been conducted with this
competence due to the semi-structured nature of the test and the very case and symptom-
specific scoring system that is used in determining the outcomes across the three
competency scales. CST measures that use single scores and structured questioning are
more suitable for research purposes, but the ECST-R’s unique scoring is designed for
making psycholegal decisions about competency and is not easily adapted for use in
research investigations (Rogers et al., 2004). As such, it is important to note, that at the
37
present time, there are only four studies that have been conducted by researchers who are
not authors of the test itself or who were not part of the test development team. Having so
Like the MacCAT-CA, the ECST-R also has identified strengths and weaknesses.
One area of concern that has been noted is that the ECST-R focuses heavily on psychosis
as the source of impairment (Norton & Ryba, 2010; Rogers, Tillbrook & Sewell, 2004).
The ECST-R is most suitable in cases where a defendant’s competence is impaired due to
psychosis since the ECST-R is not designed to assess other types of impairments.
Although it is true that a diagnosis of psychosis is still the most common reason for
Kugler 1991), is generally accepted that psychosis and legal incompetence are not one
and the same and the concern lies in the fact that the ECST-R may miss or underreport
Another critique of the ECST-R is that the semi-structured format complicates the
scoring process for evaluators (Tillbrook, 2000). For example, the FAC scale is simple to
score due to the explicit scoring criteria; however, some of the raters reported difficulty
in scoring the semi-structured responses of the CWC scale when a defendant’s answers
could be applied to more than one category or no category at all (Tillbrook, 2000). A
recommendation made to improve the clarity of scoring on this scale was to add more
rating categories for certain items and more clearly define the degree of impairment
meant for each rating (Tillbrook, 2000). Certainly, the semi-structured nature and flexible
38
format of the questioning and scoring of the ECST-R has limited the ability for
researchers to use this test in tightly controlled settings that are often used to evaluate the
quality and effectiveness of tests or to replicate the results reported by the test authors
during development of the instrument. But, even more importantly, the question remains
as to whether the flexibility inherent in this instrument offers more harm, via inconsistent
scoring, than the good that stems from the examiner’s ability to adjust the questions
In terms of strengths, the creators of the ECST-R have provided empirical support
for its use with forensic populations, suspected malingerers, mentally ill patients, and
correctional samples (Rogers et al., 2002; Rogers, Jackson, Sewell, Tillbrook, & Martin,
2003; Rogers, Jackson, et al., 2004; Vitacco et al., 2007). The ECST-R also has a strong
theoretical basis as it was created to parallel the three Dusky prongs related to the
psycholegal factors required by the justice system to determine CST (Golding, 2008;
Rogers et al. 2003; Rogers & Johansson-Love, 2009). The ECST-R’s strict alignment
with Dusky minimizes the incorporation of unnecessary information that may complicate
CST evaluations. Further, in terms of meeting the Daubert criteria, the test authors
maintain that one of the advantages of the ECST-R is its likelihood for admissibility
under the Daubert standard (Rogers, Tillbrook & Sewell, 2004; Rogers & Johansson-
Love, 2009).
maximum relation with a defendant’s case because it allows the integration of questions
that can be specifically tailored to the defendant’s case (Rogers et al., 2004). The ECST-
the defendant’s relationship with an attorney. The flexible format provides a defendant
supplementary feature to screen for feigning via the Atypical Presentation Scales which
(Norton & Ryba, 2010; Rogers et al., 2001; Rogers et al., 2002; Rogers, Jackson, Sewell,
Tillbrook, & Martin, 2003; Rogers, Jackson, et al., 2004; Springman, 2007; Springman &
Vandenberg, 2009; Vitacco et al., 2007, 2009). Thus, the ECST-R effectively screens for
feigning and does so in a manner that conceals the strategies that it uses so that
defendants are limited in their ability to detect the screening techniques (Rogers et al.,
2004). Consequently, the ECST-R’s approach to feigning may provide evaluators with
useful and unrehearsed information from defendants who may be motivated to exaggerate
with strong psychometric properties. Research on the ECST-R has confirmed that it
possesses good reliability and validity for assessing competence while also providing a
screen for feigning. Although the ECST-R is highly recommended to forensic clinicians
in the evaluations of CST, critics have commented on its emphasis on psychosis, the
semi-hybrid format that complicates scoring, and the limited scope of practice due to the
original norming sample focusing primarily on males with an IQ higher than 60 who
speak English.
40
CHAPTER 4
Few articles have been published that directly compare the MacCAT-CA and the
ECST-R. One study compared the MacCAT-CA, the ECST-R, and the CAST-MR in
regard to their relevance to the Dusky standard as well as their error and classification
rates by using the Daubert standard as a guideline (Rogers & Johansson-Love, 2009). In
terms of the Dusky standard, the authors investigated items of each of the scales of the
MacCAT-CA and the ECST-R. The findings indicated that both instruments thoroughly
assessed the factual understanding prong in terms of roles of courtroom personnel and
in the way they measure the rational understanding prong of Dusky. The MacCAT-CA
focuses on a defendant’s beliefs about the outcome of their case and beliefs about the
Whereas, the ECST-R focuses on a defendant’s decision-making ability about one’s trial,
such as the ability to consider different outcomes of one’s trial as well as any challenging
courtroom experiences. In terms of the Consult with Counsel prong of Dusky, the
differences in the way these two instruments approach competency was most prominent
in this domain. The MacCAT-CA relies on a hypothetical scenario to assess how well a
defendant can discriminate between valuable and useless information to share with their
41
attorney such as being able to understand as well as explain important mitigating issues.
The ECST-R relies on a defendant’s own criminal case to assess a defendant’s insight
and what they expect of their attorney. The CWC scale of the ECST-R uses open-ended
questions about a defendant’s case to assesses the individual’s ability to solve differences
with one’s attorney as well as any impairments that may interfere with a defendant’s
The authors also concluded that the MacCAT-CA may have problems meeting the
ability to consult with counsel; whereas ECST-R integrates a defendant’s case in its semi-
structured test format to achieve the same (Rogers & Johansson-Love, 2009). They note
that the MacCAT-CA may be useful in assessing defendants who want to represent
themselves due to the difficulty of the instrument’s material, which in turn provides a
complicated legal information. Rogers and Johansson-Love (2009) also investigated the
length and complexity of the questions in both instruments. The authors were interested
in the possibility that word length and complexity of items may impact the performance
MacCAT-CA and the ECST-R. They found that in terms of word length on two of the
competency scales of the ECST-R typically had less than 10 (Rogers & Johansson-Love,
2009). The authors concluded that perhaps the MacCAT-CA would be less appropriate
for defendants of lower intelligence such as those with intellectual disabilities, whereas
Tillbrook (2000) conducted a study that was used as part of the basis for the
psychometric strengths and weaknesses of the MacCAT-CA and the ECST-R. The results
of this study revealed that even though the ECST-R and the MacCAT-CA are very
different in their approach to CST, there were several similarities in their overall
performance. Both tests showed good interrater reliability and internal consistency and
support was found in terms of convergent validity, concurrent validity, and hit rate
analyses (Tillbrook, 2000). Specifically, in terms of predictive validity, the ECST-R and
scale produced a lower correlation with the Appreciation scale of the MacCAT-CA,
which the author attributes to the differences in the way these scales measure a
Finally, Tillbrook (2000) also sought to investigate how evaluators rate, organize,
and associate items on the ECST-R and MacCAT-CA in CST evaluations. The findings
indicated that when using the ECST-R, FAC appears to have the most influence on an
evaluator’s decisions of CST followed by RAC and CWC. Similarly, when using the
evaluator’s decision, then Appreciation and Reasoning. The domains of the ECST-R and
the MacCAT-CA were also compared to the criterion but produced slightly different
results than those reported by the evaluators as being most influential in decisions of
CST. For the ECST-R, the scales that correlated with the criterion from highest to lowest:
43
RAC, FAC, and CWC. For the MacCAT-CA, the scales that correlated with the criterion
(2000) makes an equal recommendation for using either instrument due to their strong
to CST evaluations.
available from 1990 to 2009 on the MacCAT-CA and the ECST-R. The analyses
included all available studies that included participants with an IQ greater than 60 who
were over the age of 18 years old. Studies using participants with intellectual disabilities
or who were adolescents were excluded. The purpose of this meta-analysis was to
investigate the specific characteristics that differed between competent defendants and
legally incompetent defendants and the specific variables of interest included: psychosis,
The findings indicated that psychosis was most related to incompetency and that
the Appreciation scale (MacCAT-CA) and the CWC scale (ECST-R) showed the largest
effect sizes in terms of the level of impairment due to psychosis. The Factual
relationship with the overall competency score and further, all three of the MacCAT-CA
scales were significantly related to IQ. The findings also indicated that depression had a
small but significant positive relationship with each of the competency scales of the
MacCAT-CA; however, the author also noted the large variation in the findings among
the 13 studies in the meta-analysis in regards to what they reported on the relationship
between depression and the competency scales of the instruments. In terms of hostility,
44
the author found a significant negative correlation between hostility and Reasoning
(MacCAT-CA). In examining the factors that possibly added to the considerable variance
among effect sizes, a consistent pattern arose in which the MacCAT-CA had greater
effect sizes than the ECST-R when comparing the performances of competent and
demonstrates more sensitivity on each domain of competency than the ECST-R which
contradicts the study conducted by Tillbrook (2000) also comparing these two tests.
Rogers et al. (2001) say that it is immaterial to a defendant’s case and difficult for
defendants to identify with. The results of this meta-analysis agree with Rogers et al.
Overall, the few articles that compare the MacCAT-CA and the ECST-R
demonstrate how these tests differ in their approach to CST. While research on the
psychometric properties of the ECST-R and the MacCAT-CA provide empirical support
for both tests in terms of reliability and validity, these are differences noted in terms of
the connection to the three Dusky prongs and in the likelihood of the tests meeting the
Daubert criteria. Comparative research also shows the tests have similarities in how they
assess competence abilities related to factual and rational understanding but that they
CHAPTER 5
The aim of this literature review was to summarize the literature that pertains to
the MacCAT-CA and the ECST-R, to report on the strengths and weaknesses of each test,
to compare the tests with one another, and to make recommendations for use in clinical
Above, the tests were summarized, compared, and the strengths and weaknesses were
discussed. Below follows a discussion of the limitations and recommendations for use of
each in clinical practice. While recommendations for clinical use of the tests are
First, it is clear that, based on the research available, the MacCAT-CA and the
ECST-R both have strong psychometric properties, particularly in terms of reliability and
validity. Further, both tests include substantial numbers of individuals in their norm
groups; however, it is notable that both tests are somewhat limited in terms of the
inclusion of women and minorities in their norms which limits the applicability and
usefulness of the tests with these groups. While use with these populations is not directly
contraindicated, the tests results should be applied and interpreted with caution as
appropriate comparisons are limited and there is a noticeable lack of research that would
46
be necessary to provide evidence that these tests work equally as well with women and
Second, the test manual for both the ECST-R and the MacCAT-CA state that
these tests are not optimal for defendants that do not speak English. Presently, no
literature is available that explores the use of these specific instruments with non-English
speakers and both test manuals caution against using interpreters as the nature of the
questions could easily get lost in translation. Therefore, neither test is appropriate for use
when using the test with individuals for whom English is their second language, as the
evaluator must ensure any impairments seen are a result of deficits in functioning and not
Third, neither test should be used with defendants who have substantial deficits in
cognitive functioning, and particularly not with those diagnosed with an Intellectual
Disability. The consensus among the existing literature is that the MacCAT-CA is not
suitable for use with defendants with ID because of its lengthy items, rigid
cognitively impaired individuals. Due to these test characteristics, studies have shown
there is reason to be concerned that competent defendants with ID are at an elevated risk
to be found incompetent (Bennett, 2005; Jurescka, 2010; Jurescka et al., 2012; Pinals et
al., 2006). Further, research has shown that the MacCAT-CA’s word length and the
complexity of items can negatively impact the performance of defendants with a lower
is currently available that explores the ECST-R’s utility with ID populations, the test
47
authors note in the manual that the test is not recommended for use with those of limited
In terms of whom each test should be used with in clinical practice, the research
available allows for some tentative recommendations to be offered. It is clear that, even
though both the MacCAT-CA and the ECST-R were developed specifically for CST
flexible scoring format. As such, the hybrid format of the administration may complicate
scoring responses on the ECST-R and make it difficult to categorize particular ratings in
some instances. These differences in test construction cause the MacCAT-CA and the
It appears the ECST-R is most likely to be useful in cases where defendants are
presenting with symptoms of a mental illness and particularly so in cases where there is
suspicion of feigning. The ECST-R is particularly well suited for cases where a
defendant’s competence is impaired due to psychosis because of its unique focus on the
impact of psychosis on competence abilities as well as the availability of the ATP Scales,
which can help to directly assess the veracity of psychotic symptoms. While the ECST-
R’s strength appears to be with cases in which defendant’s present with psychosis, there
is concern that it misses other sources of competence-related impairment that are not due
directly to psychosis and, as such, the test would be less useful in these types of cases.
The MacCAT-CA, on the other hand, appears to be most useful in cases where
defendants are presenting with symptoms of a mental illness and an average level of
48
defendants that present with some limitations, there is concern whether it is appropriate to
use with defendants who are severely impaired due to symptoms of psychosis or deficits
in cognitive functioning. In such cases, it is likely that these defendants are too impaired
to complete the test. Thus, the MacCAT may be best suited for any “grey area” cases
where the defendant has some limitations that are not too severe.
Research also suggests that, due to the nature of some of the questions, the
In such cases, the Rational and Factual Understanding scales are useful in providing the
courts with a benchmark for a defendant’s knowledge of the legal system, ability to plea
bargain, and reason with important legal information which are important factors in self-
representation cases. The MacCAT-CA is also likely to be a useful tool for defendants
who might improve their performance with the retest feature, such as defendants with
minimal education or limited familiarity of the legal system. C Finally, the MacCAT-CA
may be useful with defendants who are paranoid, mistrustful, guarded, or those who are
severely distressed about their criminal charges because of its reliance on a hypothetical
scenario rather than focusing on questions related to the defendant’s own case. On the
other hand, the MacCAT-CA may not be optimal for a busy inpatient public mental
health setting where evaluators have a higher need for flexibility in servicing a diverse
Overall, the MacCAT-CA and the ECST-R provide two well-designed and
validated options for clinicians to use while conducting CST evaluations. While, on the
49
surface, it would appear that two tests designed with the same purpose should be able to
be used interchangeably and in all situations in which such a test might be necessary, but
it is clear that is not the case with the ECST-R and the MacCAT-CA. The important
differences in the test structure and design, in the administration and scoring, and in the
interpretation, that can be drawn from the results all lead to many differences in the
clinical utility of each instrument. It is clear that both tests are useful, but it is also clear
that both tests have their limitations and it is very important that any forensic evaluator
about the nature and limitations of these two instruments that are discussed herein. While
both tests are useful, the limitations of each is certainly important enough to warrant
caution in the use with certain groups either permanently, or until additional research is
On the whole, there have been a large number of studies conducted on or with the
MaCAT-CA, but far fewer that have included or evaluated the ECST-R. There are even
fewer studies that have directly compared the MacCAT-CA to the ECST-R to one
two instruments. Specifically, a study of CST evaluee performance on both tests that
and the ECST-R hold people to the same standard of competence. In one unpublished
simulation study such a comparison was conducted and revealed large differences across
competency classifications across the two instruments (Panza, Celeya, Slavikova &
Preston, 2016). Further research along these lines that includes both competent and
50
incompetent forensic patients, as well as those with a wide variety of impairments, would
Surveys of test usage, and specifically usage of FAIs, have been conducted in the
past and have generally revealed a low level of usage of CAIs in CST evaluations
(Archer, Buffington-Vollum, Stredny & Handel, 2006; Borum & Grisso, 1995; Grisso,
1991, 2003; Holub, 1992; Neal & Grisso, 2014). Since these surveys were done at least a
decade, if not two decades ago, it would certainly be useful to conduct an updated survey
of clinician use of these instruments. It would be of interest to know not only how many
examiners use the MacCAT-CA and the ECST-R, but also to know what types of cases
they use each test for and whether or not they use the tests in conjunction with any other
instruments.
Finally, there is significant need for more research in particular on the ECST-R’s
clinical utility with different populations. The vast majority of the research available on
the ECST-R was conducted by the test authors themselves, or by the authors in
conjunction with their colleagues. Very few of the published works on the ECST-R are
authored by individuals not directly involved in the test’s development. While this does
not imply that the research available is problematic or biased in any way, it is certainly a
boost to the instruments validity when those outside of the development team can
replicate or expand upon the original findings noted in the test manual. One explanation
for the dearth of literature available outside of that done by the test authors lies in the fact
that the ECST-R’s hybrid design makes it less than optimal for research purposes. The
unstructured interview and the flexible scoring options necessarily lead to inconsistencies
across examiners and examinees making it difficult to generalize any findings or draw
51
any firm conclusions about the test. Further, these characteristics make the test very
additional research that explores the clinical utility of the ECST-R in a variety of contexts
and populations, it is very difficult to comment on how this test would hold up.
In sum, the MacCAT-CA was published close to 18 years ago and the ECST-R
was published close to 13 years ago. Since their publications, research a fairly large
amount of knowledge has become available to help practitioners decide how best to use
these tests. Given that research which is available there is some foundation to make the
tentative recommendations for the use of these tests in clinical forensic practice that are
offered herein. Despite the evidence in support of using the MacCAT-CA and the ECST-
R, there still appears to be a wide range of practice among evaluators in the field of
forensic psychology and these inconsistencies can have a significant impact on the
outcomes of criminal cases (Neal & Grisso, 2014). Since the process of conducting
forensic assessments is not standardized, clinicians are free to arbitrarily select any
method they prefer when conducting CST evaluations and there does not appear to be a
current consensus among practitioners to use any tool. The question remains why
forensic evaluators would pass up using these tools specifically designed to simplify the
assessment process of CST. Given the high number of evaluators resorting to such a wide
range of practices, one must consider that perhaps they are not convinced of the utility of
standardized forensic instruments such as, the MacCAT-CA and the ECST-R. This is
especially true for forensic experts that view competency as highly dependent on context
whom consider the clinical interview as remaining superior to any standardized tool
(Golding, 2008; Roesch et al., 1996). Perhaps the present study can help guide clinicians
52
who conduct CST evaluations in determining when the MaCAT-CA and the ECST-R
may be a useful forensic tool to use and in which cases the use of either test might be
indicated.
53
APPENDIX
TABLES
Rogers, Rogers,
Grandjean, Jackson, Vitacco,
Tillbrook, Sewell, Rogers, Rogers,
Vitacco & Tillbrook Tillbrook, Gabel, & Norton
Tillbrook Sewell & Martin & Sewell Munizza & Ryba
(2000) (2001) (2003) (2004) (2009) (2010)
FAC .92 .90 .87 .87 .85 .76
RAC .75 .86 .89 .89 .91 .74
CWC .33 .72 .83 .83 .92 .87
Rational NA NA .99 .93 .95 .89
ATP-R NA NA 1.00 .63 NA .50
ATP-P NA NA 1.00 .79 NA .74
ATP-N NA NA 1.00 .70 NA .73
ATP-I NA NA 1.00 .87 NA NA
ATP-B NA NA 1.00 .86 NA .82
Rogers, Rogers,
Grandjean, Jackson, Vitacco,
Tillbrook, Sewell, Rogers, Rogers,
Vitacco & Tillbrook Tillbrook, Gabel, &
Tillbrook Sewell & Martin & Sewell Munizza
(2000) (2001) (2003) (2004) (2009)
FAC .95 .98 .98 .96 .97
RAC .88 1.00 .99 .91 .98
CWC .88 .95 .99 .91 .99
Rational .94 1.00 .99 .96 .98
ATP-R NA .99 1.00 1.00 NA
ATP-P NA 1.00 1.00 1.00 NA
ATP-N NA .96 1.00 .98 NA
ATP-I NA 1.00 1.00 1.00 NA
ATP-B NA .99 1.00 1.00 NA
55
REFERENCES
Borum & Grisso (1995). Psychological test use in criminal forensic evaluations.
Professional Psychology: Research and Practice, 5, 465-473.
Burnett, D. M., Noblin, C. D., & Prosser (2004). Adjudicative competency in a juvenile
population. Criminal Justice Behavior, 31(4), 438-462.
Edens, Poythress, Nicholson, & Otto. (1999). Effects of state organizational structure and
forensic examiner training on pretrial competence assessments. The Journal of
Behavioral Health Services & Research, 26(2), 140-150.
Grisso, T., Steinberg, L., Woolard, J., Cauffman, E., Scott, E., Graham, S., Lexcen, F.,
Reppucci, N. D., & Schwartz, R. (2003). Juveniles’ competence to stand trial: A
comparison of adolescents’ and adults’ capacities as trial defendants. Law and
Human Behavior, 27(4), 333-363.
Grisso, T., Vincent, G. M., & Seagrave, D. (Eds.). (2005). Mental health screening and
assessment in juvenile justice. New York, US: The Guilford Press. Retrieved from
http://www.ebrary.com.
Hoge, S. K., Bonnie, R. J., Poythress, N., Monahan, J., Eisenberg, M., & Feucht-Havier,
T. (1997). The MacArthur adjudicative competence study: Development and
validation of a research instrument. Law and Human Behavior, 21, 141-182.
Jacobs, M. S., Ryba, N. L., & Zapf, P. A. (2008). Competence-related abilities and
psychiatric symptoms: An analysis of the underlying structure and correlates of
the MacCAT-CA and the BPRS. Law and Human Behavior, 32, 64-77.
Jurecska, D. E. (2010). Competence to stand trial: Special challenges for the populations
diagnosed with intellectual disabilities and borderline intellectual functioning.
Dissertation Abstracts International: Section B: The Sciences and Engineering,
72 (7-B), 4296.
LaVelle Fick, S., Hart, K. J., & Deardorff, P. A. (2006). The performance of incarcerated
juveniles on the MacArthur Competence Assessment Tool-Criminal Adjudication
(MacCAT-CA). The Journal of the American Academy of Psychiatry and the
Law, 34, 360-373.
Lexcen, F. J., Hawk, G. L., Herrick, S., & Blank, M. (2006). The use of video
conferencing for psychiatric and forensic evaluations. Psychiatric Services, 57(5),
713-715.
Lipsitt, P. D., Lelos, D., & McGarry, A. L. (1971). Competency for trial: A screening
instrument. American Journal of Psychiatry, 128, 105–109.
Lyle, V. K. (2003). Emotional maturity and trial competence in urban male adolescents.
Dissertation Abstracts International: Section B: The Sciences and Engineering,
64(3-B), 1498. Publisher: ProQuest Information & Learning; [Dissertation]
Marcus, D. K., Poythress, N. G., Edens, J. F., & Lilienfield, S. O. (2010). Adjudicative
competence: Evidence that impairment in “rational understanding” is taxonic.
Psychological Assessment, 22(3), 716-722.
Melton, G. B., Petrila, J., Poythress, N. G., & Slobogin, C. (1997). Psychological
evaluations for the courts: A handbook for mental health professionals and
lawyers (2nd ed.). NY: The Guilford Press.
Mumley, D. E., Tillbrook, C. E., & Grisso, T. (2003). Five-year research update (1996–
2000): Evaluations for competence to stand trial. Behavioral Sciences and the
Law, 21, 329–350.
Neal, T. M. S. & Grisso, T. (2014). Assessment practices and expert judgment methods
in forensic psychology and psychiatry: An international snapshot. Criminal
Justice and Behavior, 1, 1–16. DOI: 10.1177/0093854814548449
Nicholson, R. A., Briggs, S. R., & Robertson, H. C. (1988). Instruments for assessing
competency to stand trial: How do they work? Professional Psychology: Research
and Practice, 19, 383-394
59
Otto, R. K., Poythress, N. G., Nicholson, R. A., Edens, J. F., Monahan, J., Bonnie, R. J.,
Hoge, S. K., & Eisenberg, M. (1998). Psychometric properties of the MacArthur
Competence Assessment Tool—Criminal Adjudication. Psychological
Assessment, 10, 435-443.
Overall, J. E., & Gorham, D. R. (1962). The Brief Psychiatric Rating scale. Psychological
Reports, 10, 799–812.
Panza, N. R., Slavikova, M., Celeya, K., & Preston, K. (2016). The MacCAT-CA and
ECST-R: Instrument review, additional validation data, and recommendations for
use in competency to stand trial evaluations. Unpublished Manuscript.
Pinals, D.A., Tillbrook, C.E., & Mumley, D.L. (2006). Practical application of the
MacArthur competence assessment tool-criminal adjudication (MacCAT-CA) in a
public sector forensic setting. The Journal of the American Academy of Psychiatry
and the Law, 32(2), 179-188.
Poythress, N. G., Nicholson, R., Otto, R. K., Edens, J. F., Bonnie, R. J. Monahan, J., &
Hoge, S. K. (1999) The MacArthur Competence Assessment Tool: Criminal
Adjudication: MacCAT-CA. Odessa, FL: PAR.
Roesch, R., Hart, S. D., & Zapf, P. A. (1996). Conceptualizing and assessing competency
to stand trial: Implications and applications of the MacArthur treatment
competence model. Psychology, Public Policy, and Law, 2, 96-113.
Roesch, R., Zapf, P. A., Golding, S. L., & Skeem, J. L. (1999). Defining and assessing
competency to stand trial. In A.L. Hess & I.B. Weiner (Eds.), The handbook of
forensic psychology (pp. 327-367). New York: John Wiley & Sons, Inc.
Rogers, R., Grandjean, N., Tillbrook, C. E., Vitacco, M.J., & Sewell, K. W. (2001).
Recent interview-based measures of competency to stand trial: A critical review
augmented with research data. Behavioral Sciences and the Law 19, 503-518.
60
Rogers, R., Jackson, R. L., Sewell, K. W., & Harrison, K. S. (2004). An examination of
the ECST-R as a screen for feigned incompetency to stand trial. Psychosocial
Assessment, 16, 139-145.
Rogers, R., Jackson, R., Sewell, K., Tillbrook, C., & Martin, M. (2003). Assessing
dimensions of competency to stand trial: Construct validation of the ECST-R.
Assessment, 101, 344-351.
Rogers, R., Sewell, K. W., Grandjean, N. R., & Vitacco, M. J. (2002). The detection of
feigned mental disorders on specific competency measures. Psychological
Assessment, 14, 177-183.
Rogers, R., Tillbrook, C., & Sewell, K. (2004). Evaluation of Competency to Stand Trial-
Revised (ESCT-R): Professional manual. Lutz, FL: Psychological Assessment
Resources.
Ryba, N. L. & Zapf, P. A. (2011). The influence of psychiatric symptoms and cognitive
abilities on competence-related abilities. International Journal of Forensic Mental
Health, 10, 29-40.
Viljoen, J. L., Odgers, C., Grisso, T., & Tillbrook, C. (2007). Teaching adolescents and
adults about adjudicative proceedings: A comparison of pre- and post- teaching
scores on the MacCAT-CA. Law and Human Behavior, 31, 419-432.
Viljoen, J. L., Slaney, K. L., & Grisso, T. (2009). The use of the MacCAT-CA with
adolescents: An item response theory investigation of age-related measurement
bias. Law and Human Behavior, 33, 283-297.
61
Vitacco, M. J., Rogers, R., Gabel, J. & Munizza, J. (2009). An investigation of the ECST-
R in male pretrial patients evaluating the effects of feigning on competency
evaluations. Assessment, 16(3), 249-257.
Vitacco, M. J., Rogers, R., Gabel, J. & Munizza, J. (2007). An evaluation of malingering
screens with competency to stand trial patients: A known-groups comparison.
Law Human Behavior, 31, 249-260.
Warren, J. I., Aaron, J., Ryan, E., Chauhan, P., & DuVal, J. (2003). Correlates of
adjudicative competence among psychiatrically impaired juveniles. The Journal
of the American Academy of Psychiatry and the Law, 31, 299-309.
Zapf, P. A., & Roesch, R. (2001). A comparison of the MacCAT-CA and the FIT for
making determinations of competency to stand trial. International Journal of Law
and Psychiatry, 24, 81-92.
Zapf, P. A., & Roesch, R. (2009). Evaluation of competence to stand trial. Oxford, New
York: Oxford University Press.
Zapf, P. A., Skeem, J. L., & Golding, S. L. (2005). Factor structure and validity of the
MacArthur Competence Assessment—Criminal Adjudication. Psychological
Assessment, 17(4), 433-445.