Professional Documents
Culture Documents
Ferreira JCE2013
Ferreira JCE2013
Ferreira JCE2013
net/publication/256489554
CITATIONS READS
54 320
7 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Raymond W Ostelo on 16 January 2018.
Abstract
Objective: The aim of this study was to determine the smallest worthwhile effects of two treatments for nonspecific low back pain
(LBP).
Study Design and Setting: The benefiteharm trade-off method was used to estimate the smallest worthwhile effect of nonsteroidal
anti-inflammatory drugs (NSAIDs) and physiotherapy for LBP. Patients seeking care for chronic LBP were interviewed by telephone before
treatment commenced and 4 weeks later.
Results: Patients need to see a median of 30% (interquartile range [IQR]: 10e40) more improvement in pain and 20% (IQR: 10e40)
more improvement in disability than would occur without intervention to perceive the effect of NSAIDs are worthwhile. They would need
to see 20% (IQR: 0e30) more improvement on pain and disability over natural recovery to perceive that the effect of physiotherapy was
worthwhile. There was no difference in estimates of the smallest worthwhile effect elicited at baseline and 4 weeks later.
Conclusions: People with chronic back pain need to see larger effects on pain of NSAIDS than physiotherapy to consider the effects of
these interventions worthwhile. These estimates of the smallest worthwhile effect can be used to interpret the findings of clinical trials and
to design adequately powered clinical trials. Ó 2013 Elsevier Inc. All rights reserved.
Keywords: Minimum clinically important difference; Sufficiently important difference; Low back pain; Clinical trials; Research design; Drug therapy
interviewer then outlined in lay and general terms how the Table 1. Participant characteristics at baseline and after 4 weeks of
two interventions (NSAIDs and physiotherapy) were usu- treatment (N 5 102)
ally administered, as well as their costs and adverse events. Sample characteristics Mean (SD)
After the description, each participant was asked how much Age (yr) 45.2 (12.8)
additional improvement in symptom severity (in addition to Duration of symptoms (yr) 6.9 (9)
natural recovery) he or she would expect if given this treat- Number of previous episodes of low back pain 1 (2.6)
Pain severity at baseline (0e10) 3.5 (1.9)
ment. All participants indicated that they believed the ef- Pain severity at follow-up (0e10) 2.8 (2)a
fects of the magnitude they expected to experience were Disability level at baseline (0e24) 7.7 (4.9)
large enough to make the intervention worthwhile. The size Disability level at follow-up (0e24) 5.3 (5.2)a
of this hypothetical effect was then progressively reduced; Depression at baseline (0e42) 3.0 (6.3)
and with each iteration, the participants were asked again Anxiety at baseline (0e42) 2.6 (4.7)
Stress at baseline (0e42) 6.8 (7.9)
whether they would consider the effect large enough to Global perceived effect of treatment (11-point) 2.5 (1.7)
make the costs, risks, and inconveniences of intervention Gender (women), n (%) 78 (80)
worthwhile. The smallest expected benefit of intervention Educational level, n (%)
for which the participant would choose to have the inter- High school degree 14 (14)
vention was the smallest worthwhile effect for that patient. College degree 13 (13)
Bachelor/science degree 41 (42)
The procedure was repeated for each outcome (pain, dis- Postgraduate degree 32 (33)
ability, and time to recovery). The effects of each interven- Current smoker, n (%) 8 (8)
tion on pain and disability were expressed as percentages of Past experience with NSAIDs, n (%) 82 (83)
symptom reduction. The effects of each intervention on Past experience with physiotherapy, n (%) 86 (89)
time to recovery were measured in days. Presence of leg pain, n (%) 64 (65)
Work status, n (%)
Once the interview was finished, participants continued Not working 17 (17)
with the treatment initially planned for them. Treatment Working part-time 30 (31)
was chosen at the clinician’s discretion and did not neces- Working full-time 52 (53)
sarily include the interventions involved in the benefite Abbreviations: SD, standard deviation; NSAID, nonsteroidal anti-
harm trade-off interviews. Participants were reinterviewed inflammatory drug.
using the same procedures 4 weeks later. a
Indicates a statistically significant decrease from baseline; P !
0.001.
2.2. Analysis
participating in ongoing treatment (n 5 1). Participants’
Regression methods were used to quantify associations characteristics are presented in Table 1.
between estimates of smallest worthwhile effect and base-
line pain intensity, disability, mood (depression, stress,
3.1. Smallest worthwhile effects of interventions before
and anxiety), and duration of pain. These potential predic-
treatment commencement
tors were chosen a priori. First, univariate analyses were
performed. The strongest predictors (P ! 0.2) were then Figures 1, 2 and 3 present the distribution of estimates of
entered in a multivariate model, and a backward stepwise the smallest worthwhile effects for pain, disability and days
approach was used to remove variables that did not contrib- to recover, respectively.
ute significantly to the model. The confidence intervals Approximately 25% of the participants stated they would
(CIs) for regression coefficients were obtained using non- not take NSAIDs for back pain, irrespective of the additional
parametric bootstrap methods (BCa CIs) [21]. Data were benefits provided, after considering the costs and risks in-
analyzed using STATA 12 (StataCorp. 2011; Stata Statisti- volved. The remaining participants (n 5 77) stated that they
cal Software: Release 12. StataCorp LP, College Station, would need to see a median of 30% (interquartile range
TX). This study was funded by the Clive and Vera Rama- [IQR]: 10e40) more improvement in pain and a median
ciotti Foundation, Australia. 20% (IQR: 10e40) more improvement in disability than they
would with no treatment to make the costs, risks, and incon-
veniences of NSAIDs worthwhile. Participants also stated
3. Results they would need to recover a median of 7 (IQR: 4e9;
n 5 77) days sooner than they would with no treatment to
A total of 151 consecutive patients seeking physiother- make the costs, risks, and inconveniences of NSAIDs worth-
apy in private practices for back pain were invited to partic- while (Table 2).
ipate in the study. Of these, 102 were eligible to participate, Approximately 7% of the participants said that they
consented, and were included in the study. Reasons for non- would not choose to have physiotherapy, irrespective of
inclusion were: (1) unwilling to participate in the inter- its benefits. The remaining 95 participants nominated that
views (n 5 30), (2) not presenting with nonspecific LBP they would need to see a median of 20% or 20 points on
(n 5 11), (3) disconnected telephone (n 5 3), and (4) a 100-point scale (IQR: 0e30) improvement on pain and
1400 M.L. Ferreira et al. / Journal of Clinical Epidemiology 66 (2013) 1397e1404
A A
50th pc 90th pc
50th pc 90th pc
30
30
Frequency
Frequency
20
20
10
10
0
0 20 40 60 80
0
B B
50th pc 90th pc
50th pc 90th pc
30
30
Frequency
Frequency
20
20
10
10
0
0
0 20 40 60 80
0 20 40 60 80
Smallest worthwhile effect (/100)
Smallest worthwhile effect (/100)
Fig. 2. Distribution of estimates of the smallest worthwhile effects of
Fig. 1. Distribution of estimates of the smallest worthwhile effects of
disability at baseline. The corresponding 50th and 90th percentiles
pain at baseline. The corresponding 50th and 90th percentiles are in-
are indicated by arrows.
dicated by arrows.
disability compared with no treatment to perceive that the (Table 4). The associations were negative, indicating that
effect of physiotherapy was worthwhile. Participants per- the longer the duration of symptoms, the smaller the esti-
ceived that they needed to recover a median of 5 (IQR; mates of smallest worthwhile effect. No other factors were
0e6; n 5 96) days sooner than they would with natural re- retained in the multivariate models (P O 0.20).
covery to make the effects of physiotherapy worthwhile.
Additionally, 90% of participants would consider physio-
3.2. Smallest worthwhile effect of interventions
therapy worthwhile if they experienced an additional 40%
following a course of treatment
improvement on pain and disability (Table 2).
Physiotherapy had significantly smaller smallest worth- A total of 95 (93%) participants were followed-up 4
while effects than NSAIDs for pain (paired t-test, weeks after the first interview to assess whether estimates
P 5 0.009) and recovery (P ! 0.001), but not disability of the smallest worthwhile effect change after a course of
(P 5 0.065, Table 2). No univariate associations were ob- treatment. Reasons for declining participation in follow-
served among the smallest worthwhile effect for pain, dis- ups were unwilling to participate in the interviews (n 5 5)
ability, or recovery and the predictors of baseline pain and disconnected telephone (n 5 2).
severity, disability severity, and mood status (Table 3). Du- On average, at follow-up, participants had significantly
ration of symptoms was significantly associated with esti- reduced levels of pain (mean change from baseline: 0.8;
mates of smallest worthwhile effect of physiotherapy for 95% CI: 0.3, 1.2 on an 11-point scale; P 5 0.001) and dis-
pain and disability, but not with days to recovery. A multi- ability (mean change from baseline: 2.6; 95% CI: 1.9, 3.3;
variate analysis showed that duration of symptoms was sig- P ! 0.0001). There were small, statistically nonsignificant
nificantly associated with smallest worthwhile effect on increases in estimates of smallest worthwhile effect of
pain and disability after adjusting for pain severity NSAIDs on pain (mean change from baseline: 2.0; 95%
M.L. Ferreira et al. / Journal of Clinical Epidemiology 66 (2013) 1397e1404 1401
4. Discussion
These data demonstrate that typically, a patient with
Frequency
20
0 5 10 15
by at least 15 points on a 100-point scale and hasten time to
Smallest worthwhile effect (/100) recovery by at least 2 days to be worth the costs, risks, and
inconveniences of physiotherapy. These perceptions do not
B seem to be associated with severity of symptoms, mood, or
50
symptom intensity.
This is the first time the benefiteharm trade-off method
40
30
0 5 10
smallest worthwhile effect are needed if the estimates are to
Smallest worthwhile effect (/100) be used in the design and interpretation of randomized clin-
ical trials and meta-analyses [2]. We have recently demon-
Fig. 3. Distribution of estimates of the smallest worthwhile effects of
strated how estimates of the smallest worthwhile effect of
days to recover at baseline. The corresponding 50th and 90th percen-
tiles are indicated by arrows. interventions can be used to calculate sample sizes of future
trials using extended contour funnel plots [23]. We suggest
that a new trial should be designed to provide enough
CI: 3.4, 7.5), disability (mean change from baseline: 3.5; power to convert uncertainty regarding the clinical signifi-
95% CI: 1.3, 8.3), and days to recovery (mean change from cance of current evidence into certainty that treatment ef-
baseline: 0.1; 95% CI: 1, 1). Estimates of smallest worth- fects are worthwhile.
while effect of physiotherapy on pain (mean change from Typically, patients seeking care for chronic LBP per-
baseline: 3.5; 95% CI: 1, 8.1), disability (mean change ceive that the average effects of NSAIDs and exercise re-
from baseline: 3.9; 95% CI: 0.4, 8.1), and days to recovery ported in randomized trials and systematic reviews are
(mean change from baseline: 0.6; 95% CI: 0.2, 1.5) were too small to make the interventions worth their costs, risks,
Table 2. Estimates of the smallest worthwhile effect of interventions for back pain
NSAIDs Physiotherapy
10th 50th 90th 10th 50th 90th Mean effect
Parameters N Mean (SD) Percentile Percentile Percentile N Mean (SD) Percentile Percentile Percentile (95% CI); n
Baseline
Paina 77 29.0 (19.6) 10 30 60 95 19.0 (17.8) 0 20 40 7.4 (1.9, 13); 74
Disabilitya 74 26.3 (17.9) 10 20 50 95 18.2 (18.0) 0 15 40 4.8 (0.3, 10); 72
Recoveryb 77 6 (3.8) 0 7 11 96 3 (3.5) 0 2 8 2.6 (1.6, 3.6); 75
4 wk
Paina 55 31.6 (21.1) 3 30 65 82 21.2 (18.2) 0 30 50 5.7 (0.4, 11); 53
Disabilitya 55 31.5 (20.2) 3 30 65 82 20.9 (19.4) 0 30 50 4.7 (1.1, 10.5); 53
Recoveryb 53 6 (3.4) 0 6 11 82 4 (3.6) 0 4 9 1 (0.2, 1.9); 52
Abbreviations: NSAID, nonsteroidal anti-inflammatory drug; SD, standard deviation; CI, confidence interval.
a
Percent improvement.
b
Days.
1402 M.L. Ferreira et al. / Journal of Clinical Epidemiology 66 (2013) 1397e1404
Depressiona (0e42)
by, on average, 12.4 points (95% CI: 15.5, 9.7) com-
pared with placebo in people with chronic LBP [24]. Our
findings suggest that a typical patient seeking care for
chronic LBP would not consider these effects large enough
to be worthwhile.
The benefiteharm trade-off method has been success-
fully used since 1978 to ascertain the smallest worthwhile
0.32 (0.95, 1.26; 0.51)
0.16 (0.73, 1.34; 0.76)
0.05 (0.10, 0.18; 0.49)
Predictors that reached a level of significance of 0.2 or less were entered in multivariate analyses.
Duration of symptoms (yr)
Disability
Parameters
Pain
detect effect sizes that are large enough to meet the needs
b
a
Table 4. Multivariate regression coefficients (95% CI; P-value) for predictors of the smallest worthwhile effect for low back pain
Physiotherapy
Parameters Pain severity (0e10) Duration of symptoms (wk)
Pain 1.23 (3.17, 0.71; 0.21) 0.45 (0.82, 0.12; 0.01)
Disability 1.32 (3.10, 0.46; 0.15) 0.45 (0.81, 0.09; 0.02)
Abbreviation: CI, confidence interval.
The same applies when interpreting the clinical significance patients seeking other types of care for their LBP. Another
of treatment effects provided in clinical trials: readers of re- limitation is that, in the benefiteharm trade-off method,
ports of clinical trials might consider that the effects of the subjects are presented with a series of scenarios that differ
intervention examined in the randomized trial were big in just one attributedthe hypothetical size of the effect of
enough to be worthwhile if they exceeded the smallest intervention, whereas all other attributes are either explic-
worthwhile effect of the average patient, or alternatively, itly held constant or left undefined. This might explain
of 10% or 90% of the patients. why the smallest worthwhile effect of intervention can vary
In this study, we focus on the clinical importance of group greatly among participants [8,27]: it could be that the par-
differencesdnot on individual patient improvementsd ticipants assign different values to undefined attributes.
and argue that the findings can be used by clinicians and In the present study, we use a stated preference method
policy makers to select treatment programs with worthwhile to estimate the smallest worthwhile effect. Stated pre-
effects and researchers to design clinical trials powered to ferences methods (for treatments) have been previously
detect worthwhile effects. It is usually not possible, after a compared [28] and criterion validated against revealed pref-
period of intervention, to determine if an individual patient erences methods [29]. In the specific context of the present
actually experienced an effect of intervention that was at least study, cross-validation with revealed preferences is not pos-
as large as his or her smallest worthwhile effect because the sible because it is not possible to observe a treatment effect
effects of interventions on an individual usually cannot be as- on an individual [4]. A possible alternative is to validate
certained. We cannot know what the effect of intervention is findings of the benefiteharm trade-off study by identifying
on any individual patient because the effect of intervention is which attributes of intervention are most highly valued by
the difference in outcomes with and without intervention, but patients when nominating the smallest worthwhile effect.
we can only observe outcomes with or without intervention at This can be done using other stated preference methods
any one time [4]. such as the discrete choice experiments. We are currently
We acknowledge that the smallest worthwhile effect of conducting a discrete choice study to cross-validate esti-
an intervention may depend on perspective. Thus, patients mates of the smallest worthwhile effect in older people
may have different judgments to clinicians, health service who are offered to participate in physical activity programs
providers, or policy makers about how big the effect of an to prevent falls.
intervention needs to be to justify costs, risks, and incon-
veniences. However, a minimum requirement for provi-
sion of an intervention is that the potential consumers of Acknowledgments
health care services can expect that the intervention will
have a worthwhile effect. For that reason, we have focused R.D.H. is supported by the National Health and Medical
on the perspective of patients. We acknowledge that al- Research Council of Australia and J.L. is supported by the
though this perspective is necessary, it is not sufficient, Australian Research Council. None of the authors have any
and that other criteria such as cost-effectiveness are neces- financial or other relationships that might lead to a conflict
sary for comprehensive justification of provision of a of interest. All authors have contributed significantly to the
health intervention. study.
A limitation of this study is that these estimates might
have been influenced by the characteristics of the study
sample, which may not be broadly representative of people Appendix
seeking physiotherapy for persistent LBP. It seems, for ex-
ample, that the sample consisted of a large proportion of Supplementary data
well-educated people, and the sample was restricted to peo- Supplementary data related to this article can be found
ple who were seeking physiotherapy care. This might ex- online at http://dx.doi.org/10.1016/j.jclinepi.2013.02.018.
plain some or all of the observed difference in estimates
of the smallest worthwhile effect for NSAIDs and physio-
therapy. The estimates should be interpreted in the context References
of the included population, as it is possible that different es- [1] Jaeschke R, Singer J, Guyatt GH. Ascertaining the minimal clinically
timates would be elicited from different populations or important difference. Control Clin Trials 1989;10:407e15.
1404 M.L. Ferreira et al. / Journal of Clinical Epidemiology 66 (2013) 1397e1404
[2] Ferreira ML, Herbert RD, Ferreira PH, Latimer J, Ostelo RW, [17] Jansen S, Lievit J, Nooij M, Haes J, Overpelt I, Slooten H, et al. Pa-
Nascimento DP, et al. A critical review of methods used to determine tients’ preferences for adjuvant chemotherapy in early-stage breast
the smallest worthwhile effect of interventions for low back pain. cancer: is treatment worthwhile? Br J Cancer 2001;84:1577e85.
J Clin Epidemiol 2012;65:253e61. [18] Petherick E, O’Meara S, Spilsbury K, Iglesias C, Nelson E,
[3] Herbert RD, Jamtvedt G, Mead J, Hagen KB. Outcome measures Torgerson D. Patient acceptability of larval therapy for leg ulcer treat-
measure outcomes, not effects of intervention. Aust J Physiother ment: a randomised survey to inform the sample size calculation of
2005;51:3e4. a randomised trial. BMC Med Res Methodol 2006;6:43e6.
[4] Rubin DB. Formal modes of statistical inference for causal effects. [19] Maxwell S. Sample size and multiple regression analysis. Psychol
J Stat Plan Inference 1990;25:279e92. Methods 2000;5:434e58.
[5] Heckman JJ. The scientific model of causality. Sociol Methodol [20] Antony MM, Bieling PJ, Cox BJ, Enns MW, Swinson RP. Psycho-
2005;35:1e97. metric properties of the 42-item and 21-item versions of the De-
[6] Barrett B, Brown D, Mundt M, Brown R. Sufficiently important dif- pression Anxiety Stress Scales (DASS) in clinical groups and
ference: expanding the framework of clinical significance. Med Decis a community sample. Psychol Assess 1998;10:176e81.
Making 2005;25:250e61. [21] Efron B, Tibshirani RJ. An introduction to the bootstrap. Monographs
[7] Barrett B, Brown R, Mundt M, Dye L, Alt J, Safdar N, et al. Us- on statistics and applied probability. New York, NY; London, UK:
ing benefit harm tradeoffs to estimate sufficiently important differ- Chapman & Hall; 1993:xvi, 436.
ence: the case of the common cold. Med Decis Making 2005;25: [22] Ferreira ML, Herbert RD. What does ’clinically important’ really
47e55. mean? Aust J Physiother 2008;54:229e30.
[8] Barrett B, Haraban B, Brown D, Zhang Z, Brown R. Sufficiently im- [23] Ferreira ML, Herbert RD, Crowther MJ, Verhagen A, Sutton AJ.
portant difference for common cold: severity reduction. Ann Fam When is a further clinical trial justified? BMJ 2012;345:e5913.
Med 2007;5:216e23. [24] Kuijpers T, van Middelkoop M, Rubinstein S, Ostelo R, Verhagen A,
[9] Duric VM, Butow PN, Sharpe L, Heritier S, Boyle F, Beith J, et al. Com- Koes B, et al. A systematic review on the effectiveness of pharmaco-
paring patients’ and their partners’ preferences for adjuvant chemother- logical interventions for chronic non-specific low back pain. Eur
apy in early breast cancer. Patient Educ Couns 2008;72(2):239e45. Spine J 2011;20:40e50.
[10] McNeil BJ, Weichselbaum R, Pauker SG. Fallacy of the five-year sur- [25] Blinman P, King M, Norman R, Viney R, Stockler M. Preferences for
vival in lung cancer. N Engl J Med 1978;299:1397e401. cancer treatments: an overview of methods and applications in oncol-
[11] Bremnes RM, Andersen K, Wist EA. Cancer patients, doctors and ogy. Ann Oncol 2012;23:1104.
nurses vary in their willingness to undertake cancer chemotherapy. [26] Ostelo RW, Deyo RA, Stratford P, Waddell G, Croft P, Von Korff M,
Eur J Cancer 1995;31A:1955e9. et al. Interpreting change scores for pain and functional status in low
[12] Duric V, Stockler M, Heritier S, Boyle F, Beith J, Sullivan A, et al. back pain: towards international consensus regarding minimal impor-
Patients’ preferences for adjuvant chemotherapy in early breast can- tant change. Spine 2008;33(1):90e4.
cer: what makes AC and CMF worthwhile now? Ann Oncol 2005;16: [27] Ferreira ML, Ferreira PH, Herbert RD, Latimer J. People with low
1786e94. back pain typically need to feel ‘much better’ to consider intervention
[13] Hirose T, Horichi N, Ohmori T, Kusumoto S, Sugiyama T, Shirai T, worthwhile: an observational study. Aust J Physiother 2009;55:
et al. Patients preferences in chemotherapy for advanced non-small- 123e7.
cell lung cancer. Intern Med 2005;44:107e13. [28] Onwujekwe O, Dike N, Ojukwu J, Uzochukwu B, Ezumah N, Shu E,
[14] McNeil BJ, Weichselbaum R, Pauker SG. Speech and survival: trade- et al. Consumers stated and revealed preferences for community
offs between quality and quantity of life in laryngeal cancer. N Engl J health workers and other strategies for the provision of timely and
Med 1981;305:982e7. appropriate treatment of malaria in southeast Nigeria. Malar J
[15] Silvestri G, Pritchard R, Welch HG. Preferences for chemotherapy in 2006;5:117.
patients with advanced non-small cell lung cancer: descriptive study [29] Onwujekwe O. Criterion and content validity of a novel structured
based on scripted interviews. BMJ 1998;317:771e5. haggling contingent valuation question format versus the bidding
[16] Yellen SB, Cella DF, Leslie WT. Age and clinical decision making in game and binary with follow-up format. Soc Sci Med 2004;58(3):
oncology patients. J Natl Cancer Inst 1994;86:1766e70. 525e37.