Professional Documents
Culture Documents
Evidence For Physiotherapy Practice: A Survey of The Physiotherapy Evidence Database (Pedro)
Evidence For Physiotherapy Practice: A Survey of The Physiotherapy Evidence Database (Pedro)
Evidence For Physiotherapy Practice: A Survey of The Physiotherapy Evidence Database (Pedro)
Evidence-based practice involves the use of evidence from systematic reviews and randomised controlled trials, but the
extent of this evidence in physiotherapy has not previously been surveyed. The aim of this survey is to describe the quantity
and quality of randomised controlled trials and the quantity of systematic reviews relevant to physiotherapy. The
Physiotherapy Evidence Database (PEDro) was searched. The quality of trials was assessed with the PEDro scale. The
search identified a total of 2,376 randomised controlled trials and 332 systematic reviews. The first trial was published in 1955
and the first review was published in 1982. Since that time, the number of trials and reviews has grown exponentially. The
mean PEDro quality score has increased from 2.8 in trials published between 1955 and 1959 to 5.0 for trials published
between 1995 and 1999. There is a substantial body of evidence about the effects of physiotherapy. However, there remains
scope for improvements in the quality of the conduct and reporting of clinical trials. [Moseley AM, Herbert RD, Sherrington
C and Maher CG (2002): Evidence for physiotherapy practice: A survey of the Physiotherapy Evidence Database
(PEDro). Australian Journal of Physiotherapy 48: 43-49]
Key words: Evidence-Based Medicine; Meta-Analysis; Physical Therapy; Randomized Controlled Trials
Systematic reviews
2000 500
Number of trials
400
1500
300
1000
200
500
100
0 0
1955
1960
1965
1970
1975
1980
1985
1990
1995
2000
0 1 2 3 4 5 6 7 8 9 10
Figure 1. Cumulative number of randomised controlled Figure 2. Frequency distribution of the total PEDro score
trials and systematic reviews in physiotherapy by year. for the 2,297 randomised controlled trials with quality
ratings.
established by expert consensus (items: eligibility criteria the physiotherapy profession for a number of reasons. This
specified, subjects randomly allocated to groups, concealed information can assist the profession in identifying areas of
allocation, groups similar at baseline, blinding of subjects, practice that have not been well evaluated. Most
therapists and assessors, intention-to-treat analysis, point importantly, this information could guide the profession in
measures and measures of variability reported). Two deciding what services should be part of contemporary
additional items not on the Delphi list have been included physiotherapy practice. An evidence-based practice
in the PEDro scale (ie outcome measures obtained from perspective would suggest that the profession should
more than 85% of subjects and reporting of results of discontinue using treatments that are clearly ineffective and
between-group statistical comparisons). Four PEDro scale should consider take-up of new treatments when there is
items have been validated empirically in the medical evidence that they are effective.
literature: randomisation, concealed allocation, blinding,
and adequacy of follow-up (Moher et al 1999). The other This survey aims to describe the quality and quantity of
items have face validity but are yet to be empirically randomised controlled trials and the quantity of systematic
validated. The PEDro scale has been shown to have reviews indexed on PEDro. As PEDro probably is the most
acceptable inter-rater reliability (Moseley et al 1999). complete database of trials and reviews in physiotherapy,
Observed agreement for individual PEDro items ranged this survey will provide some insights into the scope of
from 70% (groups similar at baseline) to 96% (blinding of evidence that can be used to guide decisions about the
therapists), and the intraclass correlation coefficient for the effects of therapy. The PEDro database is unique in that it
total PEDro score (ie the score derived from adding all provides a measure of the quality of the randomised
PEDro scale items except the specification of eligibility controlled trials it indexes. This permits statements to be
criteria) was 0.54. To increase the accuracy of quality made about the quality of randomised controlled trials in
ratings on the PEDro database, each trial is independently physiotherapy.
rated by two reviewers, with a third rater arbitrating on
items where consensus cannot be reached.
Method
The extent of the external evidence relevant to neurological
physiotherapy archived on PEDro has recently been Data were extracted from PEDro on April 17, 2001. The
surveyed (Moseley et al 2000). There were 238 randomised authors, title, source, year of publication, publication type
controlled trials (54% were categorised as being of (randomised controlled trial or systematic review) and
moderate to high quality, rating five or more out of 10 on quality scores for the records were analysed. Three
the PEDro scale) and 27 systematic reviews relevant to analyses were performed. The first focused on the amount
neurological physiotherapy. However, the quantity and of external evidence available each year. Cumulative totals
quality of the external evidence on therapy relevant to other of randomised controlled trials and systematic reviews
subdisciplines of physiotherapy or physiotherapy in general were calculated. The relationship between the quantity of
has not been surveyed. Information on the quantity and reviews and trials and the year of publication was described
quality of the external evidence on therapy is important to using least-squares non-linear curve fitting.
0 1 2 3 4 5 6 7 8
0 20 40 60 80 100
Total PEDro score % of randomised controlled trials
Figure 3. Mean and standard deviation of the total PEDro Figure 4. Percentage of rated randomised controlled trials
score of rated randomised controlled trials in physiotherapy that satisfied each item of the PEDro scale.
for each 5-year period since 1955 (plus the trials published
in 2000 or 2001). The number of trials for each epoch is
indicated at the end of each bar.
The second analysis investigated the methodological number of randomised controlled trials has increased
quality of the randomised controlled trials. The analysis of rapidly and approximately exponentially since then (Figure
methodological quality was performed on the sub-set of 1). An exponential growth curve fits the data well (number
trials that had been quality rated using the PEDro scale (the of trials = e 0.18 x years since 1955; r2 = 0.94). An average of 155
remaining trials had bibliographic information entered on trials were published each year from 1995 to 1999.
the PEDro database but had not yet been rated). The mean
and standard deviation of the total PEDro score (ie the The first archived systematic review relevant to
score derived by adding all scale items except for Item 1, physiotherapy was published in 1982 and investigated the
the specification of eligibility criteria) for five-year epochs effects of weight loss treatments (including exercise) on
and the frequency distribution of the total PEDro score for hypertension (Hovell 1982). Since then, there has been an
all rated trials were calculated. In addition, the percentage exponential increase in the number of systematic reviews,
of trials satisfying each of the items on the PEDro scale with 59 published in 1999. Again, an exponential growth
was determined. curve fits the data very well (number of reviews =
e 0.33 x years since 1982; r2 = 0.99).
The last analysis examined the evidence available for each
subdiscipline of physiotherapy. All PEDro records were Of the 2,376 randomised controlled trials archived on
coded for up to three areas of physiotherapy PEDro at April 17, 2001, 2,297 (or 97%) had been rated for
(cardiothoracics, continence and women’s health, methodological quality at least once and 1,235 (52%) had
ergonomics and occupational health, gerontology, ratings confirmed by a second or third rater (the remaining
musculoskeletal, neurology, orthopaedics, paediatrics, and 79 trials had bibliographic information entered on the
sports – see Appendix 2). The number of randomised PEDro database but had not yet been rated). Trial quality
controlled trials and systematic reviews were calculated for ranged from 0/10 to 10/10, with most rating 4 or 5 (Figure
each area of physiotherapy. 2). Fifty-two per cent of trials were nominally of moderate
to high quality, rating 5 or more on the PEDro scale.
Results The quality of randomised controlled trials in
physiotherapy has increased gradually since 1955 (Figure
There were 2,708 records indexed on PEDro. Of these, 3). The mean total PEDro score has increased from 2.8 for
2,376 were randomised controlled trials and 332 were trials published between 1955 and 1959, to 5.0 for trials
systematic reviews (that is, one systematic review for every published between 1995 and 1999 (the mean for trials
7.2 trials)(a). Most records (93%) were in English. The first published in 2000 or 2001 is 5.7).
archived trials in physiotherapy were published in 1955(a),
about seven years after the first modern randomised The percentage of rated randomised controlled trials that
controlled trial in medicine (Medical Research Council satisfy each item on the PEDro scale is illustrated in Figure
1948). These trials investigated manipulation for low back 4. Most trials (94%) fulfilled the randomisation item of the
pain (Coyer and Curwen 1955) and wax baths for PEDro scale (ie subjects were randomly allocated to
rheumatoid arthritis (Harris and Millard 1955). The experimental conditions). This is not surprising given that
MacIntyre DL, McAuley CA and Parker-Taillon D (1999): Appendix 1: Criteria for the PEDro scale
Canadian physiotherapy research and evidence-based
practice initiative in the 1990s. Physical Therapy Reviews 1 eligibility criteria were specified
4: 127-137.
Medical Research Council (1948): Streptomycin treatment
2 subjects were randomly allocated to groups (in a
of pulmonary tuberculosis. British Medical Journal crossover study, subjects were randomly allocated an
ii: 769-782. order in which treatments were received)
Moher D, Cook DJ, Jadad AR, Tugwell P, Moher M, Jones 3 allocation was concealed
A, Pham B and Klassen TP (1999): Assessing the quality 4 the groups were similar at baseline regarding the
of reports of randomised trials: implications for the
most important prognostic indicators
conduct of meta-analyses. Health Technology
Assessment 3: 1-98. 5 there was blinding of all subjects
Moseley AM, Maher C, Herbert RD and Sherrington C 6 there was blinding of all therapists who administered
(1999): Reliability of a scale for measuring the the therapy
methodological quality of clinical trials. Proceedings of the
VIIth Cochrane Colloquium. Rome, p. 39. 7 there was blinding of all assessors who measured at
least one key outcome
Moseley A, Sherrington C, Herbert R and Maher C (2000):
The extent and quality of evidence in neurological 8 measures of at least one key outcome were obtained
physiotherapy: an analysis of the Physiotherapy Evidence from more than 85% of the subjects initially allocated
Database (PEDro). Brain Impairment 1: 130-140. to groups
National Health and Medical Research Council (2000): How 9 all subjects for whom outcome measures were
to Use the Evidence: Assessment and Application of available received the treatment or control condition
Scientific Evidence. Canberra: Biotext. as allocated or, where this was not the case, data for
Palmer KNV and Sellick BA (1952): Effect of procaine at least one key outcome was analysed by “intention
penicillin and breathing exercises in postoperative to treat”
pulmonary complications. Lancet 1: 345-346.
10 the results of between-group statistical comparisons
Palmer KNV and Sellick BA (1953): The prevention of are reported for at least one key outcome
postoperative pulmonary atelectasis. Lancet 1: 164-168.
11 the study provides both point measures and measures
Research Committee (Victorian Branch) of the APA and
contributors (1999): Evidence-based practice. Australian of variability for at least one key outcome
Journal of Physiotherapy 45: 167-171. Notes on administration of the PEDro scale:
Sackett DL, Straus SE, Richardson WS, Rosenberg W and
Haynes RB (2000): Evidence-Based Medicine: How to All criteria Points are only awarded when a criterion is
Practice and Teach EBM. (2nd ed.) Edinburgh: Churchill clearly satisfied. If on a literal reading of the trial report it
Livingstone. is possible that a criterion was not satisfied, a point should
Sherrington C, Herbert RD, Maher CG and Moseley AM
not be awarded for that criterion.
(2000): PEDro: A database of randomized trials and Criterion 1 This criterion is satisfied if the report describes
systematic reviews in physiotherapy. Manual Therapy 5: the source of subjects and a list of criteria used to
223-226.
determine who was eligible to participate in the study.
Streitberger K and Kleinhenz J (1998): Introducing a
placebo needle into acupuncture research. Lancet Criterion 2 A study is considered to have used random
352: 364-365. allocation if the report states that allocation was random.
University of York, NHS Centre for Reviews and The precise method of randomisation need not be
Dissemination (2000): Database of Abstracts of Reviews specified. Procedures such as coin-tossing and dice-rolling
of Effectiveness. http://agatha.york.ac.uk/darehp.htm. should be considered random. Quasi-randomisation
(Accessed November 9, 2000.). allocation procedures such as allocation by hospital record
van Tulder M, Ostelo R, Vlaeyen J, Linton S, Morley S and number or birth date, or alternation, do not satisfy this
Assendelft W (2000): Behavioural treatment for chronic criterion.
low back pain. A systematic review within the framework
of the Cochrane Back Review Group. Spine Criterion 3 Concealed allocation means that the person
25: 2688-2699. who determined if a subject was eligible for inclusion in
Verhagen AP, de Vet HC, de Bie RA, Kessels AG, Boers M, the trial was unaware, when this decision was made, of
Bouter LM and Knipschild PG (1998): The Delphi list: a which group the subject would be allocated to. A point is
criteria list for quality assessment of randomised clinical awarded for this criteria, even if it is not stated that
trials for conducting systematic reviews developed by allocation was concealed, when the report states that
Delphi consensus. Journal of Clinical Epidemiology allocation was by sealed opaque envelopes or that
51: 1235-1241. allocation involved contacting the holder of the allocation
Vickers AJ and de Craen AJM (2000): Why use placebos in schedule who was “off-site”.
clinical trials? A narrative review of the methodological
literature. Journal of Clinical Epidemiology 53: 157-161. Criterion 4 At a minimum, in studies of therapeutic
interventions, the report must describe at least one measure (each of) all groups. Measures of variability include
of the severity of the condition being treated and at least standard deviations, standard errors, confidence intervals,
one (different) key outcome measure at baseline. The rater interquartile ranges (or other quantile ranges) and ranges.
must be satisfied that the groups’ outcomes would not be Point measures and/or measures of variability may be
expected to differ, on the basis of baseline differences in provided graphically (for example, SDs may be given as
prognostic variables alone, by a clinically significant error bars in a Figure) as long as it is clear what is being
amount. This criterion is satisfied even if only baseline data graphed (for example, as long as it is clear whether error
of study completers are presented. bars represent SDs or SEs). Where outcomes are
categorical, this criterion is considered to have been met if
Criteria 4, 7-11 Key outcomes are those outcomes which the number of subjects in each category is given for each
provide the primary measure of the effectiveness (or lack of group.
effectiveness) of the therapy. In most studies, more than
one variable is used as an outcome measure.
Criterion 5-7 Blinding means the person in question Appendix 2: PEDro definitions for each
(subject, therapist or assessor) did not know which group sub-discipline of physiotherapy
the subject had been allocated to. In addition, subjects and
therapists are only considered to be “blind” if it could be Cardiothoracics includes, but is not restricted to, papers
expected that they would have been unable to distinguish evaluating acute and rehabilitation cardiothoracic
between the treatments applied to different groups. In trials interventions or fitness training on those with conditions
in which key outcomes are self-reported (eg visual affecting the cardiothoracic system. This subdiscipline
analogue scale, pain diary), the assessor is considered to be does not include studies of general fitness training among
blind if the subject was blind. patient populations. Studies of general fitness training for
healthy populations are not indexed on PEDro
Criterion 8 This criterion is only satisfied if the report
explicitly states both the number of subjects initially Continence and women’s health includes, but is not
allocated to groups and the number of subjects from whom restricted to, male and female incontinence and pre- and
key outcome measures were obtained. In trials in which post-natal interventions for the mother
outcomes are measured at several points in time, a key
outcome must have been measured in more than 85% of Ergonomics and occupational health includes, but is not
subjects at one of those points in time. restricted to, interventions based at workplaces or on
workers for work-related conditions
Criterion 9 An intention to treat analysis means that,
where subjects did not receive treatment (or the control Gerontology includes papers where the average age of the
condition) as allocated, and where measures of outcomes study sample is over 60, and papers on conditions which
were available, the analysis was performed as if subjects commonly affect older people (eg arthritis)
received the treatment (or control condition) they were Musculoskeletal includes, but is not restricted to, low back
allocated to. This criterion is satisfied, even if there is no pain, rheumatoid disease, entrapment syndromes and
mention of analysis by intention to treat, if the report neuralgia
explicitly states that all subjects received treatment or
control conditions as allocated. Neurology includes, but is not restricted to, lesions of the
central and peripheral nervous systems excluding those
Criterion 10 A between-group statistical comparison whose primary presentation is pain or paraesthesia such as
involves statistical comparison of one group with another. carpal tunnel syndrome, neuralgia or sciatica
Depending on the design of the study, this may involve
comparison of two or more treatments, or comparison of Orthopaedics includes only fractures and intervention
treatment with a control condition. The analysis may be a before or after orthopaedic surgery (eg knee replacements,
simple comparison of outcomes measured after the ligament repairs)
treatment was administered, or a comparison of the change
in one group with the change in another (when a factorial Paediatrics includes papers where the average age of the
analysis of variance has been used to analyse the data, the study sample is under 16, and papers on conditions which
latter is often reported as a group × time interaction). The commonly affect children (eg cystic fibrosis)
comparison may be in the form hypothesis testing (which Sports includes papers which specifically mention sports
provides a p value, describing the probability that the injuries as well as conditions which commonly affect sports
groups differed only by chance) or in the form of an people (eg ligament repairs)
estimate (for example, the mean or median difference, or a
difference in proportions, or number needed to treat, or a If the area for a particular paper does not fit under any of
relative risk or hazard ratio) and its confidence interval. the above categories, the code “no appropriate value in this
field” is used.
Criterion 11 A point measure is a measure of the size of
the treatment effect. The treatment effect may be described
as a difference in group outcomes, or as the outcome in