Professional Documents
Culture Documents
Int Forum Allergy Rhinol - 2017 - Chowdhury - Investigating The Minimal Clinically Important Difference For SNOT 22 Symptom
Int Forum Allergy Rhinol - 2017 - Chowdhury - Investigating The Minimal Clinically Important Difference For SNOT 22 Symptom
Background: Prior work has described 5 domains within with previously published metrics. Average MCID values
the 22-item Sino-Nasal Outcomes Test (SNOT-22) that al- for the rhinologic, extranasal rhinologic, ear/facial, psycho-
low for stratification of symptoms into similar clusters and logical, and sleep domain scores were 3.8, 2.4, 3.2, 3.9, and
that can be used to direct therapy. Although the out- 2.9, respectively. Anchor-based approaches with the SF-
comes of various interventions on these symptom domains 6D did not have strong predictive accuracy across total
have been reported, minimal clinically important difference SNOT-22 scores or domains (ROC areas under-the-curve
(MCID) values have not been investigated, which has lim- 0.71), indicating weak associations between improvement
ited clinical interpretation of these results. in SNOT-22 scores and health utility as measured by the
SF-6D.
Methods: This study was designed as a secondary analy-
sis of a prospective, multi-institutional, observational co- Conclusion: This estimation of MCID values for the SNOT-
hort. A total of 276 patients with medically refractory 22 symptom domains allows for improved clinical interpre-
CRS who underwent surgical management were enrolled. tation of results from past, present, and future rhinologic
Distribution-based methods (half-standard deviation, stan- outcomes research. C 2017 ARS-AAOA, LLC.
1149 International Forum of Allergy & Rhinology, Vol. 7, No. 12, December 2017
Chowdhury et al.
significant result may also be thought to offer a clinically Surgical intervention was not randomized or assigned
meaningful result. for study purposes. Study participants elected endoscopic
The validated 22-question Sino-Nasal Outcomes Test sinus surgery (ESS) due to inadequate symptom control
(SNOT-22) is a widely adopted instrument to evalu- from prior medical management. Surgical approach was
ate chronic rhinosinusitis (CRS) treatment outcomes. The dictated by each enrolling clinician using radiographic
MCID of the total SNOT-22 score has previously been imaging and endoscopic examination findings of disease
defined as 8.9 points using 3-month postoperative scores.1 location and extent. Patients underwent either primary or
The characterization of 5 distinct symptom domains within revision ESS completed under general anesthesia. Surgical
the SNOT-222 has further refined the utility of the SNOT- procedures consisted of unilateral or bilateral maxillary
22 instrument by allowing for patient responses to be mea- sinus antrostomy, total or partial ethmoidectomy, sphe-
sured across multiple clinical domains, thus enabling both noidotomy, or frontal sinusotomy when appropriate.
clinicians and researchers to examine the effects of vari- Anatomic ventilation was further maximized by incor-
ous interventions on discrete rhinologic, extrarhinologic, porating inferior turbinate reduction and/or septoplasty
ear/facial, psychological, and sleep symptoms associated if indicated. Postoperative therapeutics included at least
with CRS. Although many studies have reported treatment daily nasal saline irrigations as needed and topical corticos-
outcomes across the different SNOT-22 domains,3–5 an teroid spray/rinses as necessary to optimize postoperative
MCID value for these domain scores has not been iden- healing.
tified, which represents a crucial gap in the current liter-
ature. In this investigation, we sought to use distribution-
based and anchor-based methods to define the MCID of Exclusion criteria
the SNOT-22 domains to qualify the analyses present in Potential disparities in global health as well as variability
rhinologic outcomes research while also providing physi- in surgical interventions and postoperative therapeutic reg-
cians with a clinical context for interpreting these scores in imen warranted the exclusion of some patient groups from
patients. this heterogeneous population. Study participants with con-
current recurrent acute rhinosinusitis (RARS), comorbid
ciliary dyskinesia, or corticosteroid-dependent conditions
(eg, sinusitis, asthma) were excluded from final analyses. In
Patients and methods
addition, enrolled study participants who did not return for
Patient enrollment and inclusion criteria postoperative follow-up appointments or respond to study-
Findings and descriptions of this cohort investigation related follow-up communication were considered lost to
have been described previously.6–8 Adult (18 years of follow-up and excluded from the final analyses.
age) study participants were prospectively sampled from
heterogeneous patient populations referred to tertiary,
academic practices in North America for recalcitrant SNOT-22 instrument
symptoms of CRS between March 2011 and June 2015. Study participants were asked to complete the SNOT-22, a
Diagnoses were confirmed by fellowship-trained rhinolo- 22-item validated survey developed to quantify the severity
gists using guidelines currently defined by the American of sinonasal symptoms ( C 2006, Washington University,
Academy of Otolaryngology.9, 10 Study participants pro- St. Louis, MO).1 The 22-items of the SNOT-22 were
vided written, informed consent in English during baseline categorized into 5 symptom domain scores: rhinologic
enrollment meetings. An institutional review board (IRB) symptoms domain (range, 0-30); extranasal rhinologic
at each enrollment center provided annual approval and symptoms domain (range, 0-15); ear/facial symptoms
regulatory oversight of all protocols, annual reviews, and domain (range, 0-25); psychological dysfunction domain
safety monitoring. Clinical enrollment centers included: (range, 0-35); and sleep dysfunction domain (range, 0-25),
Oregon Health & Science University (Portland, OR; eIRB as identified through previous factor analysis of this
#7198); Stanford University (Palo Alto, CA; IRB #4947); cohort.12 A MCID value for SNOT-22 total scores has
the Medical University of South Carolina (Charleston, SC; been defined previously as within-subject postoperative
IRB #12409); and the University of Calgary (Calgary, AB, improvement of at least 8.9 points in patients with CRS.1
Canada; IRB #E-24208). Patients were assured that study Study participants were observed through the standard of
participation involved no more than minimal risk and was care up to 12-months after ESS and asked to complete the
voluntary per good clinical practice guidelines established SNOT-22 survey both preoperatively and during postop-
by the International Conference on Harmonization.11 erative follow-up evaluations. This time period was chosen
Each participant provided extensive medical history to due to consensus by the authors that this represented an ap-
confirm completion of prior therapeutics consisting of at propriate follow-up period by which the longitudinal effect
least 1 course (14 days) of culture-directed or empiric of the intervention could be assessed. Prior work has shown
antibiotics, either corticosteroid nasal sprays (21 days) or that postoperative improvements in overall symptom sever-
systemic corticosteroid therapy (5 days), and daily saline ity are durable between 6-, 12-, and 18-month follow-up
solution irrigation (240 mL) as needed. periods.13
International Forum of Allergy & Rhinology, Vol. 7, No. 12, December 2017 1150
MCID values for SNOT-22 domains
SF-6D health utility instrument distribution using Cronbach’s α.17 This reliability score (R)
Pre- and postoperative overall health states were also cap- was then used
√ to compute the SEM using the formula: SEM
tured using the Medical Outcomes Study Short Form 6D = SD × ( 1 − R).18, 19 The minimum detectable change
(SF-6D) instrument in addition to the SNOT-22. Individual (MDC) MCID score was also derived √ from the SEM using
survey items of the SF-6D are extracted from the 36-item a different formula, MDC = 1.96* 2 × SEM.20 The ef-
Medical Outcomes Study Short Form (SF-36) and trans- fect size–based MCID was calculated using Cohen’s small
formed into health utility values using a standardizing, effect size estimation (0.20)21 multiplied by the baseline SD
weighted algorithm, as described by Brazier et al, and ob- of scores.22
tained with permission from the Department of Health Eco- Anchor-based methods were then used to determine
nomics and Decision Science at the University of Sheffield.14 MCID values using the SF-6D as the determinant of a true
The normalized SF-6D value represents a quantified health change in health status.23 The threshold of improvement
state that an individual assigns to themselves on a spectrum was defined at 0.03, or 1 MCID of the SF-6D.15 Receiver-
(range, 0.3-1.0), whereas 1.0 reflects a state of “perfect operating characteristic (ROC) curves were computed for
health.” A posttreatment difference of 0.03-0.05 has been the 12-month change in the SNOT-22 total score and the
previously suggested as an MCID for the SF-6D.15 Due individual domain scores against the change in the SF-6D,
to the absence of a purely objective measure of sinonasal and area-under-the-curve (AUC) values were calculated to
health status, a postoperative improvement of 0.03 on the determine the predictive accuracy of this method.24 AUC
SF-6D was selected as the external anchor-based criterion values of 0.8 were considered to be strong candidates
for improvement following ESS. for established MCID values, as determined by the Youden
Index.25 A mean value across all methods was then com-
puted to determine a pooled measure of the MCID across
Data management and statistical analyses all methods.26
Patient confidentiality was achieved by unique study
identification number assignment and data collection
using a HIPAA-compliant, closed-environment database Results
(Microsoft Access, Microsoft Corp., Redmond, WA). Sec- Study cohort population
ondary data analysis of this cohort was completed us- Preliminary cohort data consisted of 604 study participants
ing commercial software (SPSS version 24; IBM Corp., who met inclusion criteria, provided preoperative SNOT-
Armonk, NY). Distribution of scaled data was evaluated 22 surveys, and completed ESS between March 2011 and
for assumptions of normality and/or linearity. All final de- June 2015. A total of 501 subjects were considered for
scriptive patient data are provided. analysis after exclusions for RARS (n = 39), ciliary dysk-
There are 2 schools of thought with respect to the pro- inesia (n = 23), or steroid-dependent comorbidity (n =
cess to compute the MCID. Distribution-based methods 41), and are presented in Table 1. Study participants pro-
aim to determine the spread of scores in a baseline pop- viding 12-month postoperative follow-up (n = 276; 55%)
ulation and use the standard deviation of the sample or were evaluated for postoperative changes in SNOT-22 and
the standard error of measurement to derive a threshold SF-6D scores and comprised the final cohort selection.
score that is, statistically speaking, unlikely to be due to
chance. A score above this threshold then gives the user Postoperative improvement in SNOT-22
confidence (but not certainty) that the measure obtained is
and SF-6D scores
a tangible change, as opposed to an error of measurement.
In contrast, anchor-based methods seek to link changes The distribution of all postoperative improvement scores
in patient-reported outcomes measure (PROM) scores to for the SNOT-22 and SF-6D was approximately normal
an independent clinical anchor measure. For the original by visual estimation. Highly significant 12-month postop-
SNOT-22 questionnaire, the MCID was computed to be erative improvement in mean scores was reported across
8.9 in a population of surgical patients with CRS using all patient-reported outcomes (Table 2), with an average
this approach. This value has now become widely adopted improvement of 26.7 points on total SNOT-22 scores after
in the literature and used extensively for CRS outcomes surgery (p < 0.001).
research.1
In the present investigation, we first used distribution- Distribution-based methods for MCID
based methods to compute the MCID of preoperative total determination
SNOT-22 score and the individual domain scores. The half Estimations of distribution-based methods for determining
standard deviation estimate was computed by calculating the MCID values from SNOT-22 total and domain scores
the standard deviation (SD) of each score-value distribu- (n = 276) were calculated and compared (Table 3). The
tion and multiplying this by 0.5.16 Note that this is also average MCID for SNOT-22 total scores was 9.0, whereas
equivalent to Cohen’s medium effect estimation. To de- average MCIDs for the rhinologic domain, extranasal rhi-
termine the standard error of measurement (SEM) MCID nologic domain, ear/facial domain, psychological domain,
score, we first calculated the internal reliability of each score and sleep domain score were 3.8, 2.4, 3.2, 3.9, and 2.9,
1151 International Forum of Allergy & Rhinology, Vol. 7, No. 12, December 2017
Chowdhury et al.
International Forum of Allergy & Rhinology, Vol. 7, No. 12, December 2017 1152
MCID values for SNOT-22 domains
TABLE 3. Results of distribution-based methods for determining MCID values for SNOT-22 scores
Cohen’s effect
Preoperative measures 0.5 SD SEM size (d)a MDC Mean MCID
1153 International Forum of Allergy & Rhinology, Vol. 7, No. 12, December 2017
Chowdhury et al.
TABLE 5. Results of ROC curve methods for determining MCID values for SNOT-22 scores
in minimization of fluctuations due to measurement error. an improvement in SNOT-22 scores alone may not change
Cohen’s d estimate was the lowest threshold for the MCID health utility as captured by the SF-6D. In a clinical con-
across all scores, representing the smallest acceptable value text, this makes sense, as many of the questions on utility
of the MCID. Conversely, the MDC set the most conserva- questionnaires are designed to capture systemic quality-of-
tive MCID threshold, thus increasing the specificity of the life changes that may not be measured by a disease-specific
MCID as a measure of true change at the expense of sensi- questionnaire. A separate consideration is that even a suc-
tivity. This higher value is the result of the MDC calculation cessful anchor-based ROC analysis with greater AUC val-
imposing a 95% confidence interval around the SEM. All ues would provide a wide range of possible MCID values
approaches are statistically valid methods, highlighting the with respective sensitivities and specificities, and several
fact that, ultimately, there is no “true” MCID value for all valid definitions for optimal MCID values. Future work
measured populations, and that the validity of the MCID within our group will continue to define these relationships
is dependent on its acceptance by physicians who use the between PROMs to determine whether alternative disease-
instrument routinely. For treatment outcomes research into specific questionnaires are better equipped to reflect im-
CRS, 8.9 is generally accepted as the MCID threshold for provements in global health utility than others.
improvement after sinus surgery. In this investigation we Having estimated the MCID for the 5 SNOT-22 subdo-
found that an equally weighted average MCID value ob- mains, we next turn our attention to the practical utility
tained by the 4 methods described was 9.0, approximat- and interpretation of these values. They are of particular
ing that previously published finding. The reported average relevance to the field of rhinologic outcomes research, as
SNOT-22 domain MCID values are therefore likely to be they enable researchers to compare posttreatment changes
the best current approximations for clinically interpreting in domain scores against a minimum threshold to determine
postoperative changes in patients following ESS. whether statistically significant improvements are actually
It may seem somewhat atypical to use statistical deter- clinically impactful. For example, a recent article3 inves-
minations alone to define clinically important differences, tigating improvements in psychological dysfunction after
and one could argue that reaching a minimum statistical ESS found that the mean improvement was 7.4 points. Al-
posttreatment difference is a necessary but not sufficient though this was a statistically significant result, it is diffi-
condition of a clinically important difference. An alterna- cult to determine whether 7.4 points on a 35-point con-
tive methodology, the anchor-based MCID calculation, at- tinuum is a discernible result for an individual patient or
tempts to address these concerns by comparing a change in just statistical “noise” within the range of patient varia-
PROM to a “gold standard” measure of health status.23, 29 tion. With the findings of this investigation estimating the
In the current study we attempted to use the SF-6D as an psychological domain MCID as 3.9, we have some guid-
anchor by which to compare changes in SNOT-22 total ance that, in properly selected patients with CRS, ESS can
and domain scores. Although statistically significant rela- actually provide clinically meaningful improvements in dis-
tionships were noted between improvements in the SF-6D cernible psychological function. This in turn allows clini-
and the SNOT-22, the ROC analysis showed poor diag- cians to properly counsel patients about the potential bene-
nostic accuracy across SNOT-22 total and domain scores, fits of surgery in the preoperative setting. In addition, as the
thus limiting its use as an anchor measure for MCID cal- breadth of clinical treatment options in rhinology grows,
culation. Fundamentally, the poor diagnostic accuracy of having MCID values for the SNOT-22 domains will allow
the AUC analysis is a sign that, although the SNOT-22 for a better assessment of how a particular intervention
and its domain scores are appropriate measures of disease- impacts quality of life. To illustrate this point, consider a
specific symptom severity, the questionnaire does not nec- hypothetical scenario in which 2 competing interventions
essarily predict overall health utility well. Put another way, for CRS have similar total SNOT-22 improvements, but
International Forum of Allergy & Rhinology, Vol. 7, No. 12, December 2017 1154
MCID values for SNOT-22 domains
only 1 achieves an MCID of improvement in the sleep dys- options for CRS. This transitions to a general limitation of
function domain. When selecting the appropriate clinical the MCID as a metric of minimum change—ultimately, it is
option for a CRS patient who has significant impairment a useful heuristic rather than an absolute value applicable
in sleep dysfunction, the clinician may be more inclined in all instances. In our analysis, health utility, as measured
to select the one that will achieve an MCID in the sleep by the SF-6D, did not provide sufficient diagnostic util-
domain, even if it may not do so in the other symptom ity to perform a robust anchor-based analysis, in contrast
domains. to earlier work by our group investigating the Brief Smell
There are several limitations to discuss within the context Identification Test (BSIT), in which a successful anchor-
of these findings. First, our analysis was based on SNOT-22 based approach was used to confirm the distribution-based
score distributions from a sample of patients with medically findings.30 Nevertheless, having a frame of reference to in-
refractory CRS, enrolled at academic, tertiary care centers. terpret the clinical significance of changes in PROM scores
The MCID values presented here may not be generalizable has significant utility in outcomes research, and the previ-
to SNOT-22 score improvements obtained in patients with ously discussed agreement of our mean scores with widely
other sinonasal diagnoses or in other clinical environments. accepted anchor-based measures of the MCID lends cred-
In addition, our analysis included only patients who under- ibility to the distribution approach in this particular case
went surgical management of CRS. This was a conscious and offers a useful benchmark to interpret domain score
decision made to mirror the approach of previous studies changes.
that define the MCID for SNOT-22 total scores. In reality,
due to the nature of the distribution-based methodology, Conclusion
there may be alternative MCID values for medical man- Similar to recently published thresholds, the mean MCID of
agement of CRS just as there may be different values for the SNOT-22 total score in this cohort using distribution-
other sinonasal diagnoses and even subgroups of patients based methods was approximately 9.0. The average MCIDs
within our cohort. This represents another gap within the for the rhinologic domain, extranasal rhinologic domain,
current literature that we are aiming to address separately. ear/facial domain, psychological domain, and sleep domain
Of course, it may be that there is some level of discrimi- score were 3.8, 2.4, 3.2, 3.9, and 2.9, respectively. Anchor-
native capability that is inherent within the SNOT-22 that based methods using the SF-6D did not have strong pre-
makes these values close enough to each other that a gen- dictive accuracy across SNOT-22 total scores or individual
eral value is obtained for all disease states and treatment domain scores.
References
1. Hopkins C, Gillett S, Slack R, Lund VJ, Browne JP. 11. International Conference on Harmonisation of Tech- 21. Cohen J. Statistical Power Analysis for the Behavioral
Psychometric validity of the 22-item Sinonasal Out- nical Requirements for Registration of Pharmaceu- Sciences. New York: Routledge Academic; 1988.
come Test. Clin Otolaryngol. 2009;34:447–454. ticals for Human Use. ICH Harmonised Tripar- 22. Samsa G, Edelman D, Rothman ML, Williams GR,
2. DeConde AS, Bodner TE, Mace JC, Smith TL. Re- tite Guideline. Guideline for Good Clinical Practice Lipscomb J, Matchar D. Determining clinically im-
sponse shift in quality of life after endoscopic sinus E6 (R1). http://www.ich.org/products/guidelines/. portant differences in health status measures: a general
surgery for chronic rhinosinusitis. JAMA Otolaryngol Accessed July 2nd, 2017. approach with illustration to the Health Utilities Index
Head Neck Surg. 2014;140:712–719. 12. DeConde AS, Mace JC, Bodner T, et al. SNOT-22 Mark II. Pharmacoeconomics. 1999;15:141–155.
3. Levy JM, Mace JC, DeConde AS, Steele TO, Smith quality of life domains differentially predict treatment 23. Copay AG, Subach BR, Glassman SD, Polly DW,
TL. Improvements in psychological dysfunction after modality selection in chronic rhinosinusitis. Int Forum Schuler TC. Understanding the minimum clinically im-
endoscopic sinus surgery for patients with chronic rhi- Allergy Rhinol. 2014;4:972–979. portant difference: a review of concepts and methods.
nosinusitis. Int Forum Allergy Rhinol. 2016;6:906– 13. DeConde AS, Mace JC, Alt JA, Rudmik L, Soler ZM, Spine J. 2007;7:541–546.
913. Smith TL. Longitudinal improvement and stability of 24. van der Roer N, Ostelo RWJG, Bekkering GE, van
4. Alt JA, Mace JC, Smith TL, Soler ZM. Endoscopic si- the SNOT-22 survey in the evaluation of surgical man- Tulder MW, de Vet HCW. Minimal clinically impor-
nus surgery improves cognitive dysfunction in patients agement for chronic rhinosinusitis. Int Forum Allergy tant change for pain intensity, functional status, and
with chronic rhinosinusitis. Int Forum Allergy Rhinol. Rhinol. 2015;5:233–239. general health status in patients with nonspecific low
2016;6:1264–1272. 14. Brazier J, Roberts J, Deverill M. The estimation of a back pain. Spine (Phila Pa 1976). 2006;31:578–582.
5. Chowdhury NI, Mace JC, Smith TL, Rudmik L. What preference-based measure of health from the SF-36. 25. Youden WJ. Index for rating diagnostic tests. Cancer.
drives productivity loss in chronic rhinosinusitis? A J Health Econ. 2002;21:271–292. 1950;3:32–35.
SNOT-22 subdomain analysis. Laryngoscope. 2017 15. Walters SJ, Brazier JE. What is the relationship be- 26. Schünemann HJ, Puhan M, Goldstein R, Jaeschke
Jun 10. https://doi.org/10.1002/lary.26723. [Epub tween the minimally important difference and health R, Guyatt GH. Measurement properties and inter-
ahead of print] state utility values? The case of the SF-6D. Health pretability of the Chronic Respiratory Disease Ques-
6. Smith TL, Mace JC, Rudmik L, et al. Comparing sur- Qual Life Outcomes. 2003;1:4. tionnaire (CRQ). COPD. 2005;2:81–89.
geon outcomes in endoscopic sinus surgery for chronic 16. Norman GR, Sloan JA, Wyrwich KW. Interpretation 27. Miller GA. The magical number seven plus or minus
rhinosinusitis. Laryngoscope. 2017;127:14–21. of changes in health-related quality of life: the remark- two: some limits on our capacity for processing infor-
7. Levy JM, Mace JC, Rudmik L, Soler ZM, Smith TL. able universality of half a standard deviation. Med mation. Psychol Rev. 1956;63:81–97.
Low 22-item sinonasal outcome test scores in chronic Care. 2003;41:582–592.
28. Farivar SS, Liu H, Hays RD. Half standard devia-
rhinosinusitis: why do patients seek treatment? Laryn- 17. Cronbach LJ. Coefficient alpha and the internal struc- tion estimate of the minimally important difference
goscope. 2017;127:22–28. ture of tests. Psychometrika. 1951;16:297–334. in HRQOL scores? Expert Rev Pharmacoecon Out-
8. DeConde AS, Mace JC, Levy JM, Rudmik L, Alt JA, 18. Wyrwich KW, Nienaber NA, Tierney WM, Wolinsky comes Res. 2004;4:515–523.
Smith TL. Prevalence of polyp recurrence after en- FD. Linking clinical relevance and statistical signifi- 29. Jaeschke R, Singer J, Guyatt GH. Measurement of
doscopic sinus surgery for chronic rhinosinusitis with cance in evaluating intra-individual changes in health- health status. Ascertaining the minimal clinically im-
nasal polyposis. Laryngoscope. 2017;127:550–555. related quality of life. Med Care. 1999;37:469–478. portant difference. Control Clin Trials. 1989;10:407–
9. Rosenfeld RM, Piccirillo JF, Chandrasekhar SS, et al. 19. Wyrwich KW, Tierney WM, Wolinsky FD. Further 415.
Clinical practice guideline (update): adult sinusitis. evidence supporting an SEM-based criterion for iden- 30. Levy JM, Mace JC, Bodner TE, Alt JA, Smith TL.
Otolaryngol Head Neck Surg. 2015;152(Suppl):S1– tifying meaningful intra-individual changes in health- Defining the minimal clinically important difference
S39. related quality of life. J Clin Epidemiol. 1999;52:861– for olfactory outcomes in the surgical treatment of
10. Fokkens WJ, Lund VJ, Mullol J, et al. European Posi- 873. chronic rhinosinusitis. Int Forum Allergy Rhinol.
tion Paper on Rhinosinusitis and Nasal Polyps 2012. 20. Beaton DE. Understanding the relevance of measured 2017;7:821–826.
Rhinol Suppl. 2012;(23):3 p preceding table of con- change through studies of responsiveness. Spine (Phila
tents, 1–298. Pa 1976). 2000;25:3192–3199.
1155 International Forum of Allergy & Rhinology, Vol. 7, No. 12, December 2017