Professional Documents
Culture Documents
Simulation Study To Demonstrate Biases Created by Diagnostic Criteria of Mental Illnesses: Major Depressive Episodes, Dysthymia, and Manic Episodes
Simulation Study To Demonstrate Biases Created by Diagnostic Criteria of Mental Illnesses: Major Depressive Episodes, Dysthymia, and Manic Episodes
fatigue or loss of interest.1 2 In other words, patients need diagnosis.7 The biases introduced by data censoring and
to meet both the major and minor criteria before being data categorisation can better explain the frailty diagnosis
diagnosed with a major depressive episode. than the input symptoms.7
Historically this symptom- based diagnostic approach Mostly designed as symptom-based composite measures,
developed by Feighner et al has been widely accepted.4 5 it is possible that the diagnostic criteria of mental illnesses
Since then, mental illnesses can be diagnosed through also create and introduce biases into diagnoses so that the
different sets of criteria. This approach is important diagnoses could not be fully explained by the input symp-
because clinicians become capable of screening toms. In concern of the biases created by the diagnostic
important symptoms before diagnosing and treating criteria alone, this study aims first to understand the rela-
patients accordingly. In fact, these criteria can also be tionships between mental symptoms and diagnoses and
seen as composite measures that use multiple measures to then to quantify the potential role of the biases regarding
capture disorders that may not be quantified with single the diagnoses by simulating populations with different
variables.6 7 Recent studies on composite measures have prevalence rates and between- variable correlations of
found composite diagnostic criteria problematic because mental symptoms.
biases can be introduced while aggregating information
from input variables.7 The biases emerge while the sums
of input variables are censored or while input variables are METHODS
transformed inadequately.7 8 In other words, biases can Assumptions and simulation parameters
be created when there is information in the composite A file containing R codes to reproduce the simulations
measures that is not explained by and unrelated to the was attached in the online supplemental file 1. Simu-
input variables.7 For example, categorising continuous lated populations with mental symptoms of different
variables considers individuals in the same group homog- prevalence rates and between-variable correlations were
enous and disregards the heterogeneity between individ- created to interpret the diagnoses and understand the
uals in the same categories.7 Such practices induce biases potential magnitudes of biases that could be introduced
and decrease measurement precision.7 8 via data processing implied by the diagnostic criteria-
Currently, there is no extensive review on the existence online supplemental file 1. Three diagnoses of mental
of these biases created by composite measures or medical illnesses were chosen for the leading associated disease
diagnoses, and only selected diagnoses have been studied burdens2: major depressive episodes for the diagnosis of
for such biases. These biases have been proven vital to major depressive disorder, dysthymic disorder, and manic
another symptom-based composite measure, the diagnosis episodes for the diagnosis of bipolar disorder.1
of frailty, a condition that often occurs in the elderly and There were assumptions made to simulate the popula-
is significantly associated with health outcomes, such as tions (table 1). First, for each simulation, the prevalence
mortality, falls, and morbidity.7 Frailty is diagnosed based rates of the input symptoms were assumed to be similar
on several symptoms and characterised by weakness and for the three diagnoses in this study. Second, the input
vulnerability to adverse health events.7 While using one symptoms for the diagnoses of major depressive episodes
of the most widely used diagnostic criteria, the Biological and dysthymic disorder correlated with the same correla-
Syndrome Model scores, to diagnose frailty,9 biases alone tion coefficients.10 The symptoms for the diagnosis of
can explain more than 71% of the variances of the frailty manic episodes correlated to one another. Third, the
input symptoms for the diagnosis of manic episodes diagnostic criteria to generate diagnosis variables and
were created independently of those for the diagnosis of domain variables in the major or minor criteria.
the other two mental illnesses. The assumptions of the For example, the diagnosis of dysthymic disorder
prevalence rates and between-variable correlations were required the confirmation of both the major criteria,
made because there was no acceptable-quality data on the ‘depressed mood most of the day for more days than
symptoms of mental illnesses published and we needed to not, for at least 2 years’ and the minor criteria, ‘the pres-
examine various combinations of these epidemiological ence of two or more of the following symptoms,’ at the
measures. There were studies on the prevalence of mental same time.1 The diagnosis based on whether subjects
illnesses,11 12 but the information on the prevalence of meeting both the major and minor criteria of dysthymic
mental symptoms was very limited. There were variables disorder is the same as identifying those with a multipli-
about depression or anxiety collected in national surveys, cative product of 1 of two binomial variables (0 and 1 for
such as the items collected through the Center for Epide- absence and presence of the major or minor criteria). In
miologic Studies Depression Scale.7 13–19 However, these the equations, two binomial variables were multiplied to
variables were not the symptoms used in the DSM-IV-TR. confirm the diagnosis of dysthymic disorder among those
Lastly, we assumed that the diagnoses were made accu- with a multiplicative product of 1. Individuals could be
rately based on the input symptoms reported precisely assigned with 0 or 1 for whether they met both criteria,
by patients and the diagnostic criteria in the DSM-IV-TR while the sum of major and minor criteria were 0, 1 or 2
were strictly followed. However, these assumptions did for the individuals. Linearly, a bias variable with values of
not hold in the real world.20 For simplicity and practicality −1 or 0 was created and those meeting both the major or
reasons, we assumed perfect diagnostic quality by physi- minor criteria were assigned with −1.7 For categorisation
cians and accurate reporting of the input symptoms by of continuous variables, bias variables were required to
patients in the simulated populations. remove the variations between the subjects in the same
categories.7 Other equations to generate the intermediate
variables and the diagnoses were listed and explained in
Diagnostic criteria as mathematical functions
tables 2–4.
The input symptoms were extracted from the major and
minor criteria of the diagnoses and listed in tables 2–4.
Generation of bias variables
The input symptoms, major and minor criteria, and the
Bias variables could be generated while binomial input
diagnoses were assigned variable names. All input symp-
symptoms were summed or multiplied to obtain bino-
toms, items or domains in the major or minor criteria,
mial intermediate or diagnosis variables (see the example
and the diagnoses were binomial variables, presenting
in the previous two paragraphs).7 A visual presentation
0 and 1 for the absence and presence of the symptoms, of how bias variables were generated was published.7
criteria or the diagnoses, respectively. For example, two Therefore, the number of bias variables depended on
symptoms, ‘insomnia’ and ‘hypersomnia’, were extracted the complexity of how the diagnoses were made. For
from one of the minor criteria for the diagnosis of major example, six of the nine items or domains in the minor
depressive episodes.1 Two other symptoms, ‘more talk- criteria for the diagnosis of major depressive episodes
ative than usual’ and ‘pressure to keep talking’, were were the censored sums of the input symptoms and six
extracted from one of the minor criteria for the diagnosis bias variables were derived along with the intermediate
of manic episodes.1 variables that represented the items in the minor criteria.
Mathematical functions were generated based on the All bias variables were described in tables 2–4.
diagnostic criteria to convert input symptoms into diag-
noses. For example, one of the minor criteria of dysthymic Simulation parameters and simulated populations
disorder was ‘poor appetite or overeating.’ This required We simulated populations of 100 000 subjects. There
two input symptoms and one bias variable to generate the were five prevalence rates to simulate the input symp-
criterion.7 In other words, ‘poor appetite or overeating’ toms for the diagnosis of major depressive episodes,
equalling the sum of two input variables, ‘poor appetite’ dysthymic disorder and manic episodes: 0.05, 0.1, 0.3, 0.5
and ‘overeating,’ and a bias variable to achieve censoring and 0.7. The correlations between the input symptoms
of the sum of both variables.7 The sum of two binomial were hypothesised to be 0, 0.1, 0.4, 0.7 and 0.9. There
variables could be 0, 1 and 2 for the subjects. However, to were 25 combinations of the assumed prevalence rates
derive a binomial variable (having at least one symptom) and between-variable correlations. The presence of the
based on a distribution of 0 to 2, the bias variable had input symptoms was randomly assigned to the subjects
values of −1 for subjects with both symptoms to obtain after specifying the prevalence rates and between-variable
values less than or equal to one in all subjects.7 There- correlations between the input symptoms.21 22 The inter-
fore, the bias variable had values of −1 for the subject with mediate and diagnosis variables were derived according
both symptoms and 0 for the other subjects. In addition to the equations in tables 2–4. For each combination of
to adding variables together to derive an intermediate prevalence rates and between-variable correlations, the
variable or a diagnosis, multiplication, categorisation, populations were simulated for 100 times to obtain the
and other more complicated methods were used in the mean values and 95% CIs of derived prevalence rates, as
Major criteria,
essential for
diagnosis
Depressed mood
or a loss of interest
or pleasure in daily
activities for more
than 2 weeks.
Depressed mood for mde_ma1
more than 2 weeks.
Loss of interest or mde_ma2
pleasure in daily
activities for more
than 2 weeks.
Minor criteria mde_mi
(at least 5 of
the symptoms
including the
two in major
criteria)
Continued
Continued
5
6
Table 2 Continued
Equations to derive
Classification of Criterion Domains in the major Domain Symptom diagnosis or domain Approximation by linear Mechanisms related to
symptoms variable or minor criteria variables Symptoms variables variables regression introducing biases
Loss of energy mde_mi6_2
Open access
Information of mde_mi6_bias
the domain not
explained by the
input variables
Feelings of mde_mi7 mde_mi7=mde_mi7_1+ Censoring of the sum of
worthlessness or mde_mi7_2+ multiple input variables
excessive guilt mde_mi7_bias
Feelings of mde_mi7_1
worthlessness
Feelings of mde_mi7_2
excessive guilt
Information of mde_mi7_bias
the domain not
explained by the
input variables
Diminished ability to mde_mi8 mde_mi8=mde_mi8_1+ Censoring of the sum of
think or concentrate, mde_mi8_2+ multiple input variables
or indecisiveness mde_mi8_bias
Diminished mde_mi8_1
ability to think or
concentrate
Indecisiveness mde_mi8_2
Information of mde_mi8_bias
the domain not
explained by the
input variables
Recurrent thoughts mde_mi9
of death
Information mde_bias1 Bias introduced to
due to categorise the sum of
categorisation the number of confirmed
(choosing three symptoms in the minor
domains in criteria
minor criteria)
Information mde_bias2 Bias introduced to
due to categorise the sum of
categorisation the number of confirmed
(choosing four symptoms in the minor
domains in criteria
minor criteria)
data categorisation
introducing biases
Due to the existence of the biases, the input symptoms
were not likely to fully explain the diagnoses.7 There-
fore, the diagnoses were approximated by the input,
bias and intermediate variables individually and collec-
tively.7 13 15 17 The approximation was conducted using
forward-stepwise linear regressions.7 13 15 17 23 The inter-
pretability of the diagnoses by the input symptoms and
Approximation by linear
RESULTS
The derived prevalence rates of the input symptoms for
the three mental illnesses matched the assumed rates in
the online supplemental file 1. The derived correlations
between the input symptoms were close to assumed levels
Symptom
variables
The items in the major and minor criteria were the inter-
mediate variables necessary to create the diagnoses. The
methods used to generate the intermediate variables
were important for the prevalence rates of the interme-
diate variables and the derived diagnoses in figure 1. For
variables
Domains in the major Domain
symptoms in figure 2.
explained by
the domains
symptoms
of symptoms variable (domains) variables Symptoms variables variables Approximation introducing biases
Dysthymia dys=dys_ma x dys_mi dys=intercept Multiplication to create
(variable=dys) + coef1 x the situations where both
dys_ma+coef2 x the major and minor
dys_mi+coef3 x criteria met (union of
dys_bias two binomial variables,
mde_ma x mde_mi)
and the bias variable
(dys_bias) equivalent
to the residual of the
diagnosis not explained
by the input symptoms
and the bias variables
due to censoring and
categorisation
Major criteria,
essential for
diagnosis
Depressed dys_ma
mood most
of the day for
more days
than not, for at
least 2 years
Minor criteria dys_mi dys_mi=dys_mi1+dys_mi2+ Categorising of the sum
(at least two dys_mi3+dys_mi4+dys_mi5+ of multiple input variables
items) dys_mi6+dys_mi_bias
Poor appetite dys_mi1 dys_mi1=dys_mi1_1+ Censoring of the sum of
or overeating dys_mi1_2+dys_mi1_bias multiple input variables
Poor appetite dys_mi1_1
Overeating dys_mi1_2
Information dys_mi1_bias
of the domain
not explained
by the input
variables
Continued
9
Continued
10
Open access
Table 3 Continued
Major or Equations to generate
Classification Criterion minor criteria Intermediate Symptom diagnosis or domain Mechanisms related to
of symptoms variable (domains) variables Symptoms variables variables Approximation introducing biases
Information mde_mi8_bias
of the domain
not explained
by the input
variables
Feelings of dys_mi6
hopelessness
Information of dys_mi_bias Bias introduced by
minor criteria categorising the number
not explained of input symptoms
by input confirmed in the minor
variables criteria
Information dys_bias Information of the
of diagnosis diagnosis not explained
not explained by the input symptoms
by major or and the bias variables
minor criteria generated due to data
categorisation (dys_mi_
bias)
*The input symptoms used for the diagnosis of both major depressive episodes and dysthymic disorder.
11
12
Table 4 Continued
Major or
Classification of Criterion minor criteria Domain Symptom Mechanisms related to
symptoms variable (domains) variables Symptoms variables Equations Approximation introducing biases
A distinct period
Open access
of abnormally
and persistently
elevated,
expansive or
irritable mood,
lasting at least
1 week (or any
duration if
hospitalisation is
necessary)
Elevated man_ma1
mood, lasting
at least 1 week
Expansive man_ma2
mood, lasting
at least 1 week
Irritable mood, man_ma3
lasting at least
1 week
Minor criteria
(three or more
of the following
symptoms have
persisted; four if
the mood is only
irritable)
Increased man_mi1 man_mi1=man_mi1_1+ Censoring of the sum of
self-esteem or man_mi1_2+ multiple input variables
grandiosity man_mi1_bias
Increased self- man_mi1_1
esteem
Grandiosity man_mi1_2
Information man_mi1_bias
of the domain
not explained
by the input
variables
13
14
Table 4 Continued
Major or
Classification of Criterion minor criteria Domain Symptom Mechanisms related to
symptoms variable (domains) variables Symptoms variables Equations Approximation introducing biases
Distractibility man_mi5
Open access
15
Open access
Figure 1 The prevalence rates of an intermediate variable Figure 3 The prevalence rates of major depressive episodes
for the diagnosis of major depressive episodes by assumed by assumed input symptom prevalence and correlations.
input symptom prevalence and correlations. The intermediate Major depressive episodes are diagnosed when both
variable is ‘significant unintentional weight loss or gain’ and major and minor criteria are confirmed. The black line
the input symptoms are ‘significant unintentional weight represents the situation where the prevalence rates of the
loss’ and ‘significant unintentional weight gain.’ The black input symptoms are the same as that of major depressive
line represents the situation where the prevalence rates of episodes. Lines below the black lines have the prevalence
the input symptoms are the same as that of the intermediate rates of major depressive disorder lower than those of the
variable. Lines above the black lines have the prevalence input symptoms. CI, confidence interval.
rates of the intermediate variable larger than those of the
input symptoms. CI, confidence interval.
prevalence rate (0.679) assuming 0.1 correlations and 0.7
correlations of the input symptoms in figures 2–4 and prevalence rates of the input symptoms. When compared
listed in table 5. None of the three diagnoses had prev-
across figures 2–4, given the same assumed prevalence
alence rates exceeding those of the input symptoms. In
rates and between-variable correlations of the input symp-
general, higher prevalence rates or between- variable
correlations of the input symptoms were associated with toms, the diagnostic criteria of dysthymic disorder consis-
higher prevalence rates in the three diagnoses, except for tently generated diagnoses of the highest prevalence rates
manic episodes that had higher prevalence rates (0.692) and the criteria of major depressive episodes created diag-
assuming 0 correlations and 0.7 prevalence rates than the noses of the least prevalence rates (see table 5 for details).
17
Open access
Figure 5 The approximation of the diagnosis of dysthymic disorder by input symptoms, bias variables and both, measured by
R-squared. The diagnosis of dysthymic disorder is approximated by all variable, including input symptoms and bias variables,
using forward-stepwise regression. The selection of the variables was determined by adjusted R-squared. Circles are the
maximal adjusted R-squared achieved by the regression with input symptoms, bias variables, or both of them. See table 4 for
the details in the input symptoms and the bias variables. The assumed correlations between the input symptoms are 0.4 and
the assumed prevalence rates of the input symptoms are 0.7 in this figure.
Associations between the diagnoses and input symptoms or the best. For the diagnosis of manic episodes, the third
bias variables item of the major criteria (man_ma3 in table 4) individ-
The diagnoses were first interpreted with the input symp- ually best explained the diagnosis in all combinations of
toms (including intermediate variables) and the bias vari- assumed prevalence rates and correlations. However, the
ables individually. The diagnosis of dysthymic disorder, proportions of diagnosis variances best explained by indi-
for example, was interpreted with the input symptoms, vidual input symptoms varied widely between 0.001 and
the bias variables, and both in figure 5. For each simu- 0.974, depending on the assumed prevalence rates and
lation, the diagnosis of dysthymic disorder was approxi- between-variable correlations. Based on a high correla-
mated with an increasing number of the input symptoms, tion with the diagnoses, certain input variables or symp-
the bias variables or both. After selecting the variables toms were more important than others, such as the major
that best approximated the diagnosis based on adjusted criteria for the diagnosis of dysthymic disorder. The
R-squared, the input symptoms could explain a propor- prevalence rates and between-variable correlations were
tion of 0.956 of the diagnosis variance and the bias vari- important to determine the relationships between input
ables could explain at most a proportion of 0.405 of the symptoms and diagnoses.
diagnosis variance in figure 5. With all variables used in Similarly, there were bias variables that consistently best
the regression, the diagnosis could be perfectly explained explained the diagnoses in table 7. For the diagnosis of
by the input symptoms and bias variables (adjusted major depressive episodes, the biases due to categori-
R-squared=1). The individual input symptoms and the sation of the numbers of confirmed input symptoms
bias variables that individually best explained the diag- (mde_bias1 and mde_bias2 in table 2) were the leading
noses are listed in tables 6 and 7, respectively. bias variable. The diagnosis of major depressive episodes
For the diagnosis of major depressive episodes, the first not explained by the input symptoms or information
or second items in the major criteria (variable names: censoring (mde_bias in table 2) was the leading bias vari-
mde_ma1 or mde_ma2 in table 2) individually best able in two combinations of the assumed prevalence rates
explained the diagnosis depending on the assumed preva- and correlations. For the diagnosis of dysthymic disorder,
lence rates and correlations in table 6. For the diagnosis of the residual of the diagnosis not explained by the major
dysthymic disorder, the major criteria (dys_ma in table 3) and minor criteria (dys_bias in table 3) and the bias due
consistently and individually explained the diagnosis to the categorisation of the confirmed input symptoms
Table 6 The individual input symptoms that best explained the diagnoses based on adjusted R-squared: major depressive
episodes, dysthymic disorder and manic episodes by assumed input symptom prevalence and correlations
Assumed Assumed
correlations between prevalence of input Major depressive
input symptoms symptoms episodes Dysthymic disorder Manic episodes
0 0.05 mde_ma1 dys_ma man_ma3
0 0.05 0.001 (95% CI 0.001 to 0.076 (95% CI 0.075 to 0.002 (95% CI 0.002 to
0.001) 0.077) 0.002)
0 0.1 mde_ma1 dys_ma man_ma3
0 0.1 0.01 (95% CI 0.01 to 0.01) 0.228 (95% CI 0.227 to 0.021 (95% CI 0.02 to
0.229) 0.021)
0 0.3 mde_ma1 dys_ma man_ma3
0 0.3 0.167 (95% CI 0.167 to 0.774 (95% CI 0.773 to 0.366 (95% CI 0.366 to
0.167) 0.774) 0.367)
0 0.5 mde_ma2 dys_ma man_ma3
0 0.5 0.324 (95% CI 0.324 to 0.971 (95% CI 0.971 to 0.773 (95% CI 0.772 to
0.325) 0.971) 0.773)
0 0.7 mde_ma2 dys_ma man_ma3
0 0.7 0.412 (95% CI 0.412 to 0.999 (95% CI 0.999 to 0.964 (95% CI 0.964 to
0.412) 0.999) 0.964)
0.1 0.05 mde_ma2 dys_ma man_ma3
0.1 0.05 0.07 (95% CI 0.07 to 0.353 (95% CI 0.352 to 0.136 (95% CI 0.135 to
0.071) 0.355) 0.137)
0.1 0.1 mde_ma1 dys_ma man_ma3
0.1 0.1 0.101 (95% CI 0.1 to 0.462 (95% CI 0.461 to 0.199 (95% CI 0.198 to
0.101) 0.463) 0.199)
0.1 0.3 mde_ma2 dys_ma man_ma3
0.1 0.3 0.242 (95% CI 0.242 to 0.777 (95% CI 0.777 to 0.483 (95% CI 0.483 to
0.243) 0.778) 0.484)
0.1 0.5 mde_ma2 dys_ma man_ma3
0.1 0.5 0.365 (95% CI 0.365 to 0.932 (95% CI 0.931 to 0.74 (95% CI 0.74 to
0.366) 0.932) 0.741)
0.1 0.7 mde_ma2 dys_ma man_ma3
0.1 0.7 0.445 (95% CI 0.445 to 0.986 (95% CI 0.986 to 0.906 (95% CI 0.906 to
0.446) 0.986) 0.907)
0.4 0.05 mde_ma1 dys_ma man_ma3
0.4 0.05 0.375 (95% CI 0.373 to 0.731 (95% CI 0.729 to 0.561 (95% CI 0.559 to
0.376) 0.732) 0.562)
0.4 0.1 mde_ma1 dys_ma man_ma3
0.4 0.1 0.395 (95% CI 0.394 to 0.763 (95% CI 0.762 to 0.595 (95% CI 0.594 to
0.396) 0.764) 0.596)
0.4 0.3 mde_ma1 dys_ma man_ma3
0.4 0.3 0.465 (95% CI 0.465 to 0.851 (95% CI 0.85 to 0.701 (95% CI 0.701 to
0.466) 0.851) 0.702)
0.4 0.5 mde_ma2 dys_ma man_ma3
0.4 0.5 0.525 (95% CI 0.524 to 0.908 (95% CI 0.908 to 0.787 (95% CI 0.786 to
0.525) 0.908) 0.787)
0.4 0.7 mde_ma2 dys_ma man_ma3
0.4 0.7 0.568 (95% CI 0.568 to 0.946 (95% CI 0.946 to 0.855 (95% CI 0.854 to
0.569) 0.947) 0.855)
0.7 0.05 mde_ma2 dys_ma man_ma3
Continued
Table 6 Continued
Assumed Assumed
correlations between prevalence of input Major depressive
input symptoms symptoms episodes Dysthymic disorder Manic episodes
0.7 0.05 0.688 (95% CI 0.687 to 0.909 (95% CI 0.908 to 0.831 (95% CI 0.83 to
0.69) 0.909) 0.832)
0.7 0.1 mde_ma1 dys_ma man_ma3
0.7 0.1 0.688 (95% CI 0.687 to 0.912 (95% CI 0.911 to 0.836 (95% CI 0.835 to
0.689) 0.913) 0.836)
0.7 0.3 mde_ma2 dys_ma man_ma3
0.7 0.3 0.71 (95% CI 0.709 to 0.93 (95% CI 0.93 to 0.93) 0.862 (95% CI 0.861 to
0.711) 0.862)
0.7 0.5 mde_ma2 dys_ma man_ma3
0.7 0.5 0.729 (95% CI 0.728 to 0.944 (95% CI 0.943 to 0.882 (95% CI 0.882 to
0.729) 0.944) 0.883)
0.7 0.7 mde_ma1 dys_ma man_ma3
0.7 0.7 0.745 (95% CI 0.744 to 0.954 (95% CI 0.954 to 0.9 (95% CI 0.9 to 0.9)
0.745) 0.955)
0.9 0.05 mde_ma1 dys_ma man_ma3
0.9 0.05 0.828 (95% CI 0.827 to 0.958 (95% CI 0.957 to 0.918 (95% CI 0.917 to
0.829) 0.958) 0.919)
0.9 0.1 mde_ma2 dys_ma man_ma3
0.9 0.1 0.838 (95% CI 0.838 to 0.961 (95% CI 0.961 to 0.925 (95% CI 0.924 to
0.839) 0.961) 0.925)
0.9 0.3 mde_ma2 dys_ma man_ma3
0.9 0.3 0.856 (95% CI 0.856 to 0.969 (95% CI 0.968 to 0.937 (95% CI 0.936 to
0.857) 0.969) 0.937)
0.9 0.5 mde_ma2 dys_ma man_ma3
0.9 0.5 0.862 (95% CI 0.862 to 0.972 (95% CI 0.972 to 0.942 (95% CI 0.942 to
0.863) 0.972) 0.943)
0.9 0.7 mde_ma2 dys_ma man_ma3
0.9 0.7 0.865 (95% CI 0.865 to 0.974 (95% CI 0.974 to 0.946 (95% CI 0.946 to
0.866) 0.974) 0.946)
See table 2 to 4 for variable definitions. Adjusted R-squared is derived from linear regressions using individual input symptoms as predictor
with 95% confidence intervals (CIs) derived from 100 simulations for each combination of assumed input symptom prevalence and
correlations.
CI, confidence interval.
in the minor criteria (dys_mi_bias) were the leading bias important factors for the relationships between the bias
variables. For the diagnosis of manic episodes, the bias variables and the diagnoses.
due to the categorisation of the number of confirmed In general, the proportions of the diagnosis variance
input symptoms in the minor criteria up to three (man_ that could be explained by either individual input symp-
bias1 in table 4) was the leading bias variables, except for toms or single bias variables were low when the preva-
two combinations of the assumed prevalence rates and lence rates and between-variable correlations of the input
correlations, in which the bias due to categorisation of symptoms were assumed to be low. With higher assumed
the confirmed input symptoms in the minor criteria up
prevalence rates or correlations, the proportions of the
to four (man_bias2 in table 4) best explained the diag-
diagnoses explained by the single input symptoms or bias
nosis. However, the proportions of diagnosis variances
explained by individual bias variables varied widely from variables were higher. Across three diagnoses, the diag-
0 to 0.87. Depending on the assumed prevalence rates nosis of dysthymic disorder could be better explained by
and between-variable correlations of the input symptoms, single input variables (higher adjusted R-squared), and
certain bias variables were more important than other the diagnosis of major depressive episodes was associated
bias variables and even some input variables. The assumed with the least adjusted R-squared. The bias variables of the
prevalence rates and between-variable correlations were diagnosis of manic episodes could explain the diagnosis
Table 7 The individual bias variables that best explained the diagnoses based on adjusted R-squared: major depressive
episodes, dysthymic disorder and manic episodes by assumed input symptom prevalence and correlations
Assumed Assumed
correlations between prevalence of input Major depressive
input symptoms symptoms episodes Dysthymic disorder Manic episodes
0 0.05 mde_bias2 dys_bias man_bias2
0 0.05 0 (95% CI 0 to 0) 0.028 (95% CI 0.028 to 0.001 (95% CI 0.001 to
0.028) 0.001)
0 0.1 mde_bias2 dys_bias man_bias2
0 0.1 0.004 (95% CI 0.004 to 0.053 (95% CI 0.053 to 0.011 (95% CI 0.011 to
0.004) 0.054) 0.011)
0 0.3 mde_bias2 dys_bias man_bias1
0 0.3 0.015 (95% CI 0.015 to 0.045 (95% CI 0.045 to 0.089 (95% CI 0.089 to
0.015) 0.045) 0.09)
0 0.5 mde_bias dys_bias man_bias1
0 0.5 0.013 (95% CI 0.013 to 0.007 (95% CI 0.007 to 0.035 (95% CI 0.034 to
0.014) 0.007) 0.035)
0 0.7 mde_bias dys_bias man_bias1
0 0.7 0.01 (95% CI 0.01 to 0.01) 0 (95% CI 0 to 0) 0.002 (95% CI 0.002 to
0.002)
0.1 0.05 mde_bias2 dys_bias man_bias1
0.1 0.05 0.037 (95% CI 0.037 to 0.113 (95% CI 0.113 to 0.083 (95% CI 0.083 to
0.037) 0.114) 0.084)
0.1 0.1 mde_bias2 dys_bias man_bias1
0.1 0.1 0.047 (95% CI 0.047 to 0.122 (95% CI 0.121 to 0.116 (95% CI 0.115 to
0.048) 0.122) 0.116)
0.1 0.3 mde_bias2 dys_mi_bias man_bias1
0.1 0.3 0.077 (95% CI 0.077 to 0.105 (95% CI 0.105 to 0.198 (95% CI 0.197 to
0.077) 0.106) 0.198)
0.1 0.5 mde_bias2 dys_mi_bias man_bias1
0.1 0.5 0.079 (95% CI 0.079 to 0.073 (95% CI 0.073 to 0.166 (95% CI 0.166 to
0.08) 0.073) 0.167)
0.1 0.7 mde_bias2 dys_mi_bias man_bias1
0.1 0.7 0.065 (95% CI 0.065 to 0.047 (95% CI 0.046 to 0.094 (95% CI 0.093 to
0.065) 0.047) 0.094)
0.4 0.05 mde_bias1 dys_mi_bias man_bias1
0.4 0.05 0.294 (95% CI 0.293 to 0.415 (95% CI 0.413 to 0.432 (95% CI 0.431 to
0.295) 0.416) 0.433)
0.4 0.1 mde_bias1 dys_mi_bias man_bias1
0.4 0.1 0.304 (95% CI 0.303 to 0.419 (95% CI 0.418 to 0.445 (95% CI 0.444 to
0.304) 0.42) 0.445)
0.4 0.3 mde_bias1 dys_mi_bias man_bias1
0.4 0.3 0.335 (95% CI 0.334 to 0.411 (95% CI 0.411 to 0.473 (95% CI 0.472 to
0.335) 0.412) 0.473)
0.4 0.5 mde_bias1 dys_mi_bias man_bias1
0.4 0.5 0.354 (95% CI 0.354 to 0.395 (95% CI 0.395 to 0.475 (95% CI 0.474 to
0.355) 0.396) 0.475)
0.4 0.7 mde_bias1 dys_mi_bias man_bias1
0.4 0.7 0.356 (95% CI 0.355 to 0.367 (95% CI 0.366 to 0.451 (95% CI 0.45 to
0.356) 0.367) 0.451)
0.7 0.05 mde_bias1 dys_mi_bias man_bias1
Continued
Table 7 Continued
Assumed Assumed
correlations between prevalence of input Major depressive
input symptoms symptoms episodes Dysthymic disorder Manic episodes
0.7 0.05 0.616 (95% CI 0.615 to 0.705 (95% CI 0.704 to 0.723 (95% CI 0.722 to
0.617) 0.706) 0.724)
0.7 0.1 mde_bias1 dys_mi_bias man_bias1
0.7 0.1 0.611 (95% CI 0.611 to 0.699 (95% CI 0.698 to 0.72 (95% CI 0.72 to
0.612) 0.699) 0.721)
0.7 0.3 mde_bias1 dys_mi_bias man_bias1
0.7 0.3 0.623 (95% CI 0.623 to 0.699 (95% CI 0.699 to 0.728 (95% CI 0.728 to
0.624) 0.7) 0.729)
0.7 0.5 mde_bias1 dys_mi_bias man_bias1
0.7 0.5 0.632 (95% CI 0.632 to 0.696 (95% CI 0.696 to 0.731 (95% CI 0.731 to
0.633) 0.697) 0.732)
0.7 0.7 mde_bias1 dys_mi_bias man_bias1
0.7 0.7 0.639 (95% CI 0.638 to 0.693 (95% CI 0.692 to 0.732 (95% CI 0.731 to
0.639) 0.693) 0.732)
0.9 0.05 mde_bias1 dys_mi_bias man_bias1
0.9 0.05 0.777 (95% CI 0.776 to 0.835 (95% CI 0.834 to 0.847 (95% CI 0.847 to
0.778) 0.835) 0.848)
0.9 0.1 mde_bias1 dys_mi_bias man_bias1
0.9 0.1 0.788 (95% CI 0.788 to 0.842 (95% CI 0.841 to 0.855 (95% CI 0.854 to
0.789) 0.843) 0.855)
0.9 0.3 mde_bias1 dys_mi_bias man_bias1
0.9 0.3 0.807 (95% CI 0.806 to 0.854 (95% CI 0.853 to 0.867 (95% CI 0.867 to
0.807) 0.854) 0.868)
0.9 0.5 mde_bias1 dys_mi_bias man_bias1
0.9 0.5 0.811 (95% CI 0.811 to 0.855 (95% CI 0.855 to 0.87 (95% CI 0.87 to
0.811) 0.856) 0.871)
0.9 0.7 mde_bias1 dys_mi_bias man_bias1
0.9 0.7 0.812 (95% CI 0.811 to 0.853 (95% CI 0.853 to 0.869 (95% CI 0.869 to
0.812) 0.853) 0.87)
See table 2 to 4 for variable definitions. Adjusted R-squared is derived from linear regressions using individual bias variables as predictor with
95% confidence intervals (CIs) derived from 100 simulations for each combination of assumed input symptom prevalence and correlations.
CI, confidence interval.
individually better than the bias variables of the other two explained the diagnosis better than those of the other two
diagnoses. diagnoses under all combinations of assumed prevalence
rates and between- variable correlations. However, the
Approximating the diagnoses with input symptoms proportion of diagnosis variance explained by own input
When the diagnoses were approximated by all of their symptoms varied widely from 0.003 to 1.0. The assumed
own input symptoms (table 8), there were always some prevalence rates and between-variable correlations of the
diagnosis variances that could not be explained by the input symptoms and the design of the diagnostic criteria
input symptoms. In other words, the input symptoms were all important for the relationships between input
together could not fully explain the diagnoses, except symptoms and diagnoses.
for the diagnosis of dysthymic disorder that could be fully
explained by the input symptoms (adjusted R-squared=1) Approximating the diagnoses with bias variables
assuming 0 between-variable correlations and 0.7 preva- The diagnoses were approximated with the bias vari-
lence rates for the input symptoms. In table 8, the propor- ables of their own. The bias variables always explained
tions of diagnosis variances explained by input symptoms some of the diagnosis variances, except for the diag-
increased with higher assumed prevalence rates or nosis of dysthymic disorder assuming 0 between-variable
between- variable correlations of the input symptoms correlations and 0.7 prevalence rates for the input symp-
in general. The input symptoms of dysthymic disorder toms (adjusted R-squared=0). With increasing assumed
23
Open access
between-variable correlations for the input symptoms, the design of the diagnostic criteria of three mental illnesses
adjusted R-squared increased. However, given the same can be compared with each other. The design of diag-
assumed between-variable correlations, the proportions nostic criteria transform input symptoms into various
of diagnosis variances explained by the bias variables diagnosis prevalence rates with implicit upper limits (ie,
might increase or decrease with the assumed prevalence no more prevalent than the input symptoms), unacknowl-
rates. Compared with the adjusted R-squared in table 8, edged differential weights on the input symptoms (ie,
the proportion of the diagnosis variances explained by certain input symptoms better explaining the diagnoses)
the bias variables was always smaller than that explained and the introduction of biases (ie, due to censoring, data
by the input symptoms in table 9. The proportions of the categorisation or multiplication).
diagnosis variance explained by bias variables also varied We are the first to notice that the prevalence rates of
widely from 0 to 0.89. The assumed prevalence rates and the three diagnoses are lower than those of the input
between- variable correlations of input symptoms and symptoms if input symptoms are randomly distributed
the design of the diagnostic criteria were important for with uniform prevalence rates and correlations. Given
the relationship between the bias variables and the diag- similar assumed input symptom prevalence and correla-
noses. Only when the input symptoms for the diagnosis tions, dysthymic disorder is the most prevalent, and
of dysthymic disorder were randomly and independently major depressive episodes are the least. The diagnosis of
prevalent to 70% of the simulated populations, the bias dysthymic disorder can be better explained by its input
variables became irrelevant to the diagnosis. symptoms individually or collectively than the other two
diagnoses. The diagnosis of major depressive episodes is
least explained by own input symptoms individually or
DISCUSSION collectively. As expected, the diagnosis of the three mental
This study is a first attempt to assess the biases created by illness is similar to composite measures or indices and is
mental illness diagnostic criteria, as well as understand subject to the biases introduced by data processing, given
the relationships between input symptoms and the diag- all combinations of the assumed prevalence rates and
noses of three mental illnesses: major depressive episodes between- variable correlations of the input symptoms.7
(at least one episode required for the diagnosis of major There is only one exception: dysthymic disorder with the
depressive disorder), dysthymic disorder and manic input symptoms that are randomly and independently
episodes. The diagnostic criteria of these three mental present in 70% of the population. This is because the
illnesses have been reviewed and rewritten as mathemat- diagnosis of dysthymic disorder is a multiplicative product
ical functions. Simulated populations of 100 000 for each of the major and minor criteria. Without correlations,
of 100 simulations, with input symptoms of the three diag- everyone in the population is certain to qualify for the
noses, were created. For simplicity and practicality, the minor criteria (probability of 100% because having at
presence of the input symptoms was randomly assigned, least two out of the six items in the minor criteria: mathe-
and the input symptoms were assumed to have uniform matically [C(2,6)+C(3,6)+C(4,6)+C(5,6)+C(6,6)] × (0.7)6
prevalence rates and between- variable correlations. = 37 × 0.117=4.35 > 100%). If 70% of the population were
There were 25 combinations of assumed prevalence rates also randomly assigned with the major criteria and 100%
and between-variable correlations simulated. were assigned with the minor criteria, 70% would be
Mathematically, the diagnostic criteria are functions diagnosed with dysthymic disorder and the diagnosis of
and composite measures to transform information from dysthymic disorder can be fully explained by the major
the input symptoms to diagnoses. There are bias variables criteria. In fact, without correlations between input symp-
created by the diagnostic criteria due to data processing.7 toms, it only requires each of the six items in the minor
There are three major mechanisms of introducing biases: criteria to be randomly assigned to 54.8% [(1/37)(1/6)]
censoring, data categorisation8 and multiplication of of the population for everyone to qualify for the minor
input symptoms.7 These mechanisms introduce infor- criteria, and the diagnosis can be fully explained by the
mation or biases that cannot be fully explained by the minor and major criteria.
input symptoms.7 The introduced biases can sometimes
explain more than half of the variance in the diagnoses Distortion of the input symptoms
depending on the prevalence rates and between-variable The importance of the input symptoms has been distorted
correlations of the input symptoms. The findings show due to the diagnostic criteria for the three mental illnesses.
that the design of the diagnostic criteria is important for The same phenomenon has been proven in the diagnosis
bias introduction and significant for the prevalence of the of frailty based on three of the most commonly used
diagnoses in populations, the relationships between the scoring methods.7 In other words, based on the functions
input symptoms and the diagnoses, and the relationships to generate the diagnoses, the input symptoms are differ-
between the bias variables and the diagnoses. entially weighted, and weights are not explicitly acknowl-
edged. The most prominent is the diagnosis of dysthymic
The role of the diagnostic criteria disorder; more than 90% of the variance can be explained
With the same assumptions in the prevalence rates and by its major criteria assuming 0.7 or 0.9 between-variable
between-variable correlations of the input symptoms, the correlations for the input symptoms in table 6. Another
25
Open access
example is that the third item of the major criteria for CONCLUSION
the diagnosis of manic episodes, ‘irritable mood,’ individ- To the best of our knowledge, there is no study on the
ually predicts the diagnosis better than any other input relationships between the input symptoms and diagnoses.
symptoms or intermediate variables. This input symptom The input symptoms were extracted from the diagnostic
has been given more weight than others and can explain criteria and the diagnostic criteria were transformed
more than 91.8% of the diagnosis variance, assuming 0.9 into mathematical functions. Without mental illness data
correlations between input symptoms. Based on the texts available to the public, 100 000 subjects were simulated
in the DSM-IV-TR, we do not think this symptom should with different assumptions on the prevalence rates (0.05,
be emphasised to this degree. However, the diagnostic 0.1, 0.3, 0.5 and 0.7) and correlations (0, 0.1, 0.4, 0.7 and
criteria impose implicit and unequal weights to the input 0.9) of the input symptoms. We found that biases were
symptoms, and introduce biases into the diagnoses. introduced into the diagnoses of three mental illnesses:
major depressive episodes, dysthymic disorder, and
Future directions manic episodes. The prevalence rates of the diagnoses
We think it important to rethink the role and impor- were proportional to the assumed prevalence rates and
tance of the diagnostic system. Current approaches are between- variable correlations of the input symptoms.
embedded with implicit assumptions of the prevalence Certain input symptoms were more important than the
rates of the diagnoses (no higher than input symptoms others in explaining the diagnoses. However, the input
if the prevalence of input symptoms are similar), unac- symptoms could not fully explain the diagnoses, except
knowledged weights to input symptoms (certain input when the input symptoms independent of each other with
symptoms explaining the diagnoses much better) and 0.7 symptom prevalence rates were used for the diagnosis
biases that are induced by data processing and could not of dysthymic disorder. In conclusion, the criteria used to
be explained by the input symptoms. It is unclear whether diagnose these three mental illnesses may fail to repre-
the diagnosis of dysthymic disorder was intentionally sent the concepts they are based on, in a similar manner
designed to be more prevalent than those of major to three of the most commonly used scoring methods to
depressive episodes or manic episodes, given their input diagnose frailty.
symptoms of the same prevalence rates.
In the real world, there are other important issues Author affiliations
related to the diagnostic criteria. For example, diagnoses 1
Independent researcher, Montreal, Quebec, Canada
are not closely linked to treatment,20 30 diagnoses are not 2
Psychiatry, National Taiwan University Hospital Yun-Lin Branch, Douliou, Taiwan,
well made particularly by non-psychiatrists,31 and there Taiwan
3
Département d'informatique, UQAM, Montreal, Quebec, Canada
are two diagnostic systems (the DSM and the Interna- 4
Internal Medicine, Taipei Hospital, Xinzhuang, Taipei, Taiwan
tional Classification of Disease) that require efforts to 5
Pathology, Changhua Christian Healthcare System, Changhua, Taiwan, Taiwan
harmonise.32 Amid these issues, we think the diagnostic 6
Surgery, Changhua Christian Healthcare System, Changhua, Taiwan, Taiwan
criteria for mental illnesses should be reviewed and 7
Chest Medicine, National Yang Ming University Hospital, Ilan, Taiwan, Taiwan
8
improved for interpreteability, clinical use without intro- Department of Chest Medicine, Taipei Veterans General Hospital, Taipei, Taiwan
9
ducing biases, and better connection to clinical decisions. Institute of Emergency and Critical Care Medicine, National Yang-Ming University,
Taipei, Taiwan, Taiwan
Certain measures and biomarkers have been proven 10
Faculty of Medicine, School of Medicine, National Yang-Ming University, Taipei,
useful to identify mental illnesses.33 34 We are developing Taiwan
methods that better detect symptom- based conditions
and applying syndrome mining techniques35 to search for Contributors Y-SC conceptualised and designed this study, managed and
neglected mental illnesses. analysed data and drafted the manuscript. K-FL assisted in the interpretation of the
diagnostic criteria. C-JW assisted in data management and computation. H-CW,
H-TH, L-CT, Y-PC, Y-CL and W-CC participated in the design of this study. All authors
reviewed and approved the manuscript.
LIMITATIONS Funding The authors have not declared a specific grant for this research from any
The strength of this study is the use of simple assumptions funding agency in the public, commercial or not-for-profit sectors.
in simulated populations that enables the comparison of Competing interests Y-SC is currently employed by the Canadian Agency for
the diagnostic criteria of three mental illnesses. However, Drugs and Technologies in Health. The other authors declare that there is no conflict
the assumptions in the prevalence rates and between- of interest.
variable correlations for the input symptoms might not be Patient consent for publication Not required.
realistic. Some of the assumptions are unlikely to hold in Provenance and peer review Not commissioned; externally peer reviewed.
the real world. However, simulations are the only option Data availability statement All data relevant to the study are included in the
for us due to the lack of real-world data on the prevalence article or uploaded as online supplemental information. No real-world data used—
of the input symptoms. In addition, the translation from all analysis are based on simulations reproducible with the files in the online
symptoms to diagnoses was assumed to be perfect based supplemental materials.
on the diagnostic criteria. The simulations in this study Supplemental material This content has been supplied by the author(s). It has
not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been
only reflect the problems in the design of the diagnostic
peer-reviewed. Any opinions or recommendations discussed are solely those
criteria and are not designed to review the impact of how of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and
they are used in the real world. responsibility arising from any reliance placed on the content. Where the content
includes any translated material, BMJ does not warrant the accuracy and reliability 15 Chao Y-S, Wu H-C, Wu C-J, et al. Principal component
of the translations (including but not limited to local regulations, clinical guidelines, approximation and interpretation in health survey and Biobank data.
terminology, drug names and drug dosages), and is not responsible for any error Frontiers in Digital Humanities 2018;5.
16 Chao Y-S, Wu H-C, Wu C-J, et al. Stages of biological development
and/or omissions arising from translation and adaptation or otherwise.
across age: an analysis of Canadian health measure survey 2007–
Open access This is an open access article distributed in accordance with the 2011. Front Public Health 2018;5:2296–565.
Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which 17 Chao Y-S, C-J W. PD25 principal component approximation: medical
permits others to distribute, remix, adapt, build upon this work non-commercially, expenditure panel survey. International Journal of Technology
and license their derivative works on different terms, provided the original work is Assessment in Health Care 2019;34:138.
18 Chao Y-S, Wu C-J, Chen T-S. Risk adjustment and observation time:
properly cited, appropriate credit is given, any changes made indicated, and the use comparison between cross-sectional and 2-year panel data from
is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/. the medical expenditure panel survey (MEPS). Health Inf Sci Syst
2014;2:5.
ORCID iD 19 Chao Y-S, Wu H-T, Scutari M, et al. A network perspective on patient
Yi-Sheng Chao http://orcid.org/0000-0002-4019-3586 experiences and health status: the medical expenditure panel survey
2004 to 2011. BMC Health Serv Res 2017;17:1472–6963.
20 Bonnin JE. Treating without diagnosis: psychoanalysis in medical
settings in Argentina, 2015.
21 Leisch F, Weingessel A, Hornik K. On the generation of correlated
artificial binary data, 1998.
REFERENCES 22 Leisch F, Weingessel A, Hornik K. bindata: generation of
1 American Psychiatric Association. Diagnostic and Statistical Manual artificial binary data, 2012. Available: http://CRANR-projectorg/
of Mental Disorders. In: Text revision (DSM-IV-TR®). Fourth ed. package=bindataRpackageversion09-19
Washington, DC: American Psychiatric Association Publishing, 2010. 23 Lumley T, Lumley MT, ‘leaps’ P. Regression subset selection Thomas
2 Center for Substance Abuse Treatment. Managing depressive Lumley based on Fortran code by alan Miller, 2013. Available: https://
symptoms in substance abuse clients during early recovery. cran.r-project.org/package=leaps [Accessed 18 Mar 2018].
Rockville, MD: Substance Abuse and Mental Health Services 24 Hastie T, Tibshirani R, Friedman J. The elements of statistical
Administration (US), 2008. learning: data mining, inference, and prediction. Second ed. Springer
3 Chao Y-S, Wu C-J, Wu H-C, HC W, et al. Composite diagnostic New York, 2009.
criteria are problematic for linking potentially distinct populations: the 25 James G, Witten D, Hastie T, et al. An introduction to statistical
case of frailty. Sci Rep 2020;10:2601. learning: with applications in R. New York: Springer, 2013.
4 Feighner JP, Robins E, Guze SB, et al. Diagnostic criteria for use in 26 Chao Y-S, Wu C-J. Principal component-based weighted
psychiatric research. Arch Gen Psychiatry 1972;26:57–63. indices and a framework to evaluate indices: results from the
5 Kendler KS, Muñoz RA, Murphy G. The development of the Feighner medical expenditure panel survey 1996 to 2011. PLoS One
criteria: a historical perspective. Am J Psychiatry 2010;167:134–42. 2017;12:e0183997.
6 Chao Y-S, Wu C-J. PP46 when composite measures or indices 27 Chao YS, Wu H-tieng, Wu C-J. Feasibility of classifying life
fail: data processing lessons. Int J Technol Assess Health Care stages and searching for the determinants: results from the
2018;34:83. medical expenditure panel survey 1996–2011. Front Public Health
7 Chao Y-S, Wu H-C, Wu C-J, et al. Index or illusion: the case 2017;5:2296–565.
of frailty indices in the health and retirement study. PLoS One 28 R Development Core Team. R: a language and environment for
2018;13:e0197859. statistical computing. Vienna, Austria: R Foundation for Statistical
8 Barnwell-Ménard J-L, Li Q, Cohen AA. Effects of categorization Computing, 2016.
method, regression type, and variable distribution on the inflation of 29 RStudio Team. RStudio: integrated development for R. Boston. MA:
type-I error rate when categorizing a confounding variable. Stat Med RStudio, Inc., 2016.
2015;34:936–49. 30 Demyttenaere K, Bonnewyn A, Bruffaerts R, et al. Clinical factors
9 Cigolle CT, Ofstedal MB, Tian Z, et al. Comparing models of frailty: influencing the prescription of antidepressants and benzodiazepines:
the health and retirement study. J Am Geriatr Soc 2009;57:830–9. results from the European study of the epidemiology of mental
10 Brown TA, Chorpita BF, Korotitsch W, et al. Psychometric properties disorders (ESEMeD). J Affect Disord 2008;110:84–93.
of the depression anxiety stress scales (DASS) in clinical samples. 31 Margolis RL. Nonpsychiatrist house staff frequently misdiagnose
Behav Res Ther 1997;35:79–89. psychiatric disorders in general Hospital inpatients. Psychosomatics
11 Lim GY, Tam WW, Lu Y, et al. Prevalence of depression in the 1994;35:485–91.
community from 30 countries between 1994 and 2014. Sci Rep 32 First MB. Harmonisation of ICD-11 and DSM-V: opportunities and
2018;8:2861. challenges. Br J Psychiatry 2009;195:382–90.
12 Smith DJ, Nicholl BI, Cullen B, et al. Prevalence and characteristics 33 Husain SF, Tang T-B, Yu R, et al. Cortical haemodynamic response
of probable major depression and bipolar disorder within UK measured by functional near infrared spectroscopy during a verbal
Biobank: cross-sectional study of 172,751 participants. PLoS One fluency task in patients with major depression and borderline
2013;8:e75362. personality disorder. EBioMedicine 2020;51:102586.
13 Chao Y-S, C-J W. PD26 principal component approximation: 34 Ho CSH, Zhang MWB, Ho RCM, CSH H, Ho R. Optical topography in
Canadian health measures survey. International Journal of psychiatry: a CHIP off the old block or a new look beyond the mind-
Technology Assessment in Health Care 2019;34:138–9. brain frontiers? Front Psychiatry 2016;7:74.
14 Chao Y-S, Wu C-J, Wu H-C, et al. Trend analysis for national surveys: 35 Chao Y-S, Wu C-J, Wu H-C, HC W, et al. Using syndrome mining
application to all variables from the Canadian health measures survey with the health and retirement study to identify the deadliest and
cycle 1 to 4. PLoS One 2018;13:e0200127. least deadly frailty syndromes. Sci Rep 2020;10:5357.