Professional Documents
Culture Documents
AASLD Clinical Practice Guidelines: A Critical Review of Scientific Evidence and Evolving Recommendations
AASLD Clinical Practice Guidelines: A Critical Review of Scientific Evidence and Evolving Recommendations
Abbreviations: AASLD, American Association for the Study of Liver Diseases; ACC, American College of Cardiology; AHA, American Heart Association;
AHRQ, Agency for Healthcare Research and Quality; AIH, autoimmune hepatitis; ALF, acute liver failure; GRADE, Grading of Recommendation Assessment,
Development, and Evaluation; HBV, hepatitis B virus; HCV, hepatitis C virus; HCC, hepatocellular carcinoma; IOM, Institute of Medicine; NAFLD, nonalcoholic fatty liver disease; PBC, primary biliary cirrhosis; PSC, primary sclerosing cholangitis; TIPS, transjugular intrahepatic portosystemic shunt.
From the 1Liver Diseases Branch, National Institute of Diabetes & Digestive & Kidney Diseases, National Institutes of Health, Bethesda, MD; 2Office of the
Director, National Institute of Diabetes & Digestive & Kidney Diseases, National Institutes of Health, Bethesda, MD; 3Division of Gastroenterology &
Hepatology, Mayo Clinic, Rochester, MN.
Received January 18, 2013; accepted June 5, 2013.
Supported by the Intramural Research Programs of the NIDDK, NIH.
2142
KOH ET AL.
2143
versions for each topic was also evaluated.18,24-34 Current AASLD guidelines are defined as the most
recently published document on a specific topic which
is posted on the AASLD website as of August 1, 2012
(http://www.aasld.org). For this investigation, only
complete clinical practice guidelines and position
papers were evaluated, thus focused updates were not
included.
Evaluation of Methodological Rigor and Transparency. To evaluate the evolutionary process of guideline
development and quality of reporting, the Appraisal of
Guidelines for Research and Evaluation II (AGREE II)
instrument was used on all comparable guidelines and
position papers.35 The AGREE II has been widely
used in the assessment of methodological rigor and
transparency of guideline development and has been
cited for its validity and reliability. Briefly, this tool
that evaluates 23 items organized into six domains
(scope and purpose, stakeholder involvement, rigor of
development, clarity of presentation, applicability, and
editorial independence) followed by two global rating
items (overall assessment) and includes a user manual
that provides guidance on rating of each item. The
scope and purpose domain evaluates the specific health
questions covered by the guideline, target population,
and the overall objective of the guideline. The stakeholder involvement domain evaluates the appropriateness of the guideline development group and its
representation of the views of its intended users. The
rigor of development domain evaluates the systemic
methodology used to gather and synthesize evidence,
methods of recommendation formulation, and the
mechanisms to update them. The clarity of presentation domain evaluates the overall structure, format,
and language of the guideline. The applicability
domain evaluates barriers, facilitators, and ease of
implementation and resource implications of guideline
application. Finally, the editorial independence domain
evaluates the extent to which external influences or
competing interests may have affected the specific
guideline.
For this study, three appraisers conducted the assessment (C.K., S.S., N.S.) after using the online training
tools recommended by the AGREE collaboration.
Address reprint requests to: Christopher Koh, M.D. M.H.Sc., Liver Diseases Branch, National Institute of Diabetes & Digestive & Kidney Diseases, National
Institutes of Health, Bldg. 10, Rm. 9B-16, 10 Center Dr., MSC 1800, Bethesda, MD 20892-1800. E-mail: Christopher.koh@nih.gov; Fax: 301-402-0491.
C 2013 by the American Association for the Study of Liver Diseases.
Copyright V
View this article online at wileyonlinelibrary.com.
DOI 10.1002/hep.26578
Potential conflict of interest: Nothing to report.
Additional Supporting Information may be found in the online version of this article.
2144
KOH ET AL.
After guideline evaluation, domain scores were calculated (as per the AGREE II manual) by summing all
individual scores in each domain and then scaling the
total as a percentage of the maximum possible score
for a given domain according to the formula:
Obtained score-Minimum Possible Score
3100
Maximum Possible Score-Minimum Possible Score
1998-2003
I 5 Evidence from multiple well-designed randomized controlled trials, each
involving a number of participants to be of sufficient statistical power
II 5 Evidence from at least one large well-designed clinical trial with or
without randomization, from cohort or case-control analytic studies,
or well-designed meta-analysis
III 5 Evidence based on clinical experience, descriptive studies or reports of
expert committees
IV 5 Not rated
2004-2010
I 5 Randomized controlled trials
II-1 5 Controlled trials without randomization
II-2 5 Cohort or case-control analytic studies
II-3 5 Multiple time series, dramatic uncontrolled experiments
III 5 Opinion of respected authorities, descriptive epidemiology
2007-2010
A 5 Data derived from multiple randomized clinical trials or meta-analysis
B 5 Data derived from a single randomized trial, or nonrandomized studies
C 5 Only consensus opinion of experts, case studies or standard-of-care
2010-Present
High (A) 5 Further research is unlikely to change confidence in the estimate of
the clinical effect
Moderate (B) 5 Further research may change confidence in the estimate of the
clinical effect.
Low (C) 5 Further research is very likely to impact confidence on the estimate
of clinical effect.
Class of Recommendations
1998-2000
A 5 Survival benefit
B 5 Improved diagnosis
C 5 Improvement in quality of life
D 5 Relevant pathophysiologic parameters improved
E 5 Impacts cost of health care
2007-2010
I 5 Conditions for where there is evidence and/or general agreement that a
given diagnostic evaluation, procedure or treatment is beneficial, useful and
effective
II 5 Conditions for which there is conflicting evidence and/or divergence of
opinion about the usefulness/efficacy of a diagnostic evaluation, procedure,
or treatment
Ila 5 Weight of evidence/opinion is in favor of usefulness/efficacy
Ilb 5 Usefulness/efficacy is less well established by evidence/opinion
III 5 Conditions for which there is evidence and/or general agreement that a
diagnostic evaluation/procedure/treatment is not useful/effective and in
some cases may be harmful
2011-Present
Strong (1) 5 Factors influencing the strength of the recommendation included
the quality of evidence, presumed patient-important outcomes, and cost
Weak (2) 5 Variability in preferences and values, or more uncertainty.
Recommendation is made with less certainty, or higher cost or resource
consumption
Results
Historical Guideline Summary. From January
1998 to August 1, 2012, the AASLD issued 28 clinical
practice guidelines on 17 topics, yielding a total of
991 recommendations. When examining the initial
publication for each AASLD guideline topic, a total of
512 recommendations were issued. The three guidelines with the greatest number of recommendations
include Vascular Disorders of the Liver (64), Hepatitis
C (HCV) (49), and the Diagnosis and Management
of Nonalcoholic Fatty Liver Disease (NAFLD) (45).
Of these 512 recommendations, 14% were grade I recommendations, 40% were grade II, and 46% were
grade III (Table 2). Regarding the types of recommendations, 14% were Feature of Disease recommendations, 28% were Diagnostic Recommendations, and
58% were Treatment Recommendations (Supporting
Table 1).
Current Guideline Summary. As of August 1,
2012, 17 AASLD guidelines were published or
updated between 2005-2012, with a total of 699 recommendations identified. The greatest number of rec-
KOH ET AL.
2145
2146
KOH ET AL.
Table 2. Data Summary of the First Issued Guidelines Evaluated by Grade and the Current Guidelines Evaluated by Grade
First Issued Guideline Evaluation By Grade
Guideline
AIH
ALF
Ascites
HBV
HCC
HCV
Hemochromatosis
PBC
TIPS
Transplantation
Wilson Disease
Vascular
Varices
PSC
Biopsy
Alcoholic Liver
NAFLD
Summary
% median
% quartile 1
% quartile 3
Year
Number of Recommendations
2002
2005
1998
2001
2005
2004
2001
2000
2005
2000
2003
2009
2007
2010
2009
2010
2012
23
44
22
27
21
49
12
22
26
25
21
64
25
36
34
16
45
512
Grade I
Number
Grade II
Number
Grade III
Number
4 (17%)
7(16%)
8 (36%)
6 (22%)
4 (19%)
9 (18%)
0 (0%)
2 (9%)
2 (7%)
0 (0%)
0 (0%)
2 (3%)
10 (40%)
6 (17%)
0 (0%)
6 (37%)
6 (13%)
72 (14%)
15.9%
3.13%
19.05%
6 (26%)
7(16%)
6 (27%)
13 (48%)
12 (57%)
15(31%)
12 (100%)
1 (4%)
18 (69%)
1 (4%)
8 (38%)
30 (47%)
6 (24%)
22 (61%)
7 (20%)
6 (37%)
34 (76%)
204 (40%)
37.5%
24.00%
57.14%
13 (56%)
30 (68%)
8 (36%)
8 (30%)
5 (24%)
25(51%)
0 (0%)
19 (86%)
6 (23%)
24 (96%)
13 (62%)
32 (50%)
9 (36%)
8 (22%)
27 (80%)
4 (25%)
5 (11%)
236 (46%)
36.4%
23.81%
61.90%
Grade I
Number
Grade II
Number
Grade III
Number
Number
5 (10%)
6 (37%)
7 (15%)
6 (20%)
0 (0%)
29 (30%)
5 (23%)
15 (21%)
4 (23%)
5 (33%)
6 (17%)
5 (18%)
0 (0%)
10 (40%)
2 (3%)
1 (4%)
6(13%)
112 (16%)
17.9%
8.59%
23.53%
Number
6 (12%)
6 (37%)
7 (15%)
13 (43%)
7 (21%)
38(39%)
11 (50%)
31 (44%)
10 (59%)
5 (33%)
22 (61%)
17 (61%)
46 (59%)
6 (24%)
30 (47%)
16 (70%)
34(76%)
305 (44%)
44.3%
33.33%
58.97%
Number
39 (78%)
4 (25%)
34 (71%)
11 (37%)
27 (79%)
31(32%)
6 (27%)
24 (34%)
3 (18%)
5 (33%)
8 (22%)
6 (21%)
32 (41%)
9 (36%)
32 (50%)
6 (26%)
6(13%)
283 (40%)
33.3%
25.00%
41.03%
Guideline
Year
AI H
Alcoholic Liver
ALF
Ascites
Biopsy
HBV
HCC
HCV
Hemochromatosis
PBC
PSC
TIPS
Transplantation
Varices
Vascular
Wilson Disease
NAFLD
Summary
% median
% quartile 1
% quartile 3
2010
2010
2011
2009
2009
2009
2010
2009
2011
2009
2010
2009
2005
2007
2009
2008
2012
Number of Recommedations
50
16
48
30
34
98
22
70
17
15
36
28
78
25
64
23
45
699
KOH ET AL.
2147
AIH (2002)
AIH (2010)
Percentage Change
ALF (2005)
ALF (2011)
Percentage Change
Ascites (1998)
Ascites (2008)
Percentage Change
HBV (2001)
HBV (2009)
Percentage Change
HCC (2005)
HCC (2010)
Percentage Change
HCV (2004)
HCV (2009)
Percentage Change
Hemochromatosis (2001)
Hemochromatosis (2011)
Percentage Change
PBC (2000)
PBC (2009)
Percentage Change
TIPS (2005)
TIPS (2009)
Percentage Change
Transplant (2000)
Transplant (2005)
Percentage Change
Wilson (2003)
Wilson (2008)
Percentage Change
Domain 1.
Scope
and Purpose
%
Domain 2.
Stakeholder
Involvement
%
Domain 3.
Rigour of
Development
Domain 4.
Clarity of
Presentation
%
Domain 5.
Applicability
%
Domain 6.
Editorial
Independence
%
Overall
Guideline
Assessment
%
74
76
2
83
83
0
76
80
4
87
100
13
83
85
2
85
85
0
70
80
9
70
81
11
81
83
2
83
89
6
81
85
4
54
59
6
56
56
0
56
57
2
59
59
0
56
59
4
57
57
0
56
57
2
50
56
6
63
63
0
44
59
15
46
48
2
58
60
3
56
56
1
63
65
2
59
72
13
63
63
0
64
69
5
59
67
8
56
63
7
59
69
10
54
60
6
60
63
2
81
85
4
81
85
4
80
85
6
80
100
20
85
85
0
85
85
0
74
85
11
67
85
19
81
85
4
70
81
11
74
81
7
58
65
7
50
58
8
67
69
3
67
79
13
75
78
3
71
76
6
72
72
0
61
67
6
76
76
0
56
67
11
61
64
3
6
39
33
25
50
25
11
42
31
11
64
53
50
50
0
42
53
11
22
47
25
11
64
53
44
56
11
17
56
39
22
61
39
56
67
11
72
72
0
72
83
11
72
100
28
72
89
17
89
89
0
72
83
11
56
89
33
72
83
11
50
83
33
72
83
11
were eligible for comparison. The average time elapsing from initial publication to the current version of
the guidelines was 7.2 years (range, 5-11 years). In
these 11 topics, the overall number of recommendations increased by 124% (from 292 to 654 recommendations). All of the guideline topics had an increase in
the number of recommendations over time except for
PBC, which had a 47% decrease (Table 4). The three
guidelines with the greatest increase in the number of
recommendations were HBV (171, 263%), Liver
Transplantation (153, 212%), and AIH (127,
117%).
In evaluating individual guideline topics for the
greatest change in number of grade I recommendations, the HBV guideline had the greatest increase
(123, 383%) followed by HCV (16, 67%) increase
(Table 4). In contrast, the Management of Adult
Patients with Ascites due to Cirrhosis guideline
had a 25% decrease in the number of grade I
recommendations.
2148
KOH ET AL.
Guideline
AIH
ALF
Ascites
HBV
HCC
HCV
Hemochromatosis
PBC
TIPS
Transplantation
Wilson Disease
change in #
change in %
median
change in %
quartile 1
change in %
Quatile 3
Year
Number of
Recommendations
Grade I #
(Change)
Grade
I%
2002
2010
2005
2011
1998
2009
2001
2009
2005
2010
2004
2009
2001
2011
2000
2009
2005
2009
2000
2005
2003
2008
23
50
44
48
22
30
27
98
21
22
49
70
12
17
22
15
26
28
25
78
21
23
4
5 (125%)
7
7 (0%)
8
6 (225%)
6
29 (1383%)
4
5 (25%)
9
15 (67%)
0
4
2
5 (1150%)
2
5 (1150%)
0
0 (0%)
0
1 (1100%)
40
46%
17%
10%
16%
15%
36%
20%
22%
30%
19%
23%
18%
21%
0%
24%
9%
33%
8%
18%
0%
0%
0%
4%
Grade I %
change %
Grade II #
(Change)
Grade
II %
26%
12%
16%
15%
27%
43%
48%
39%
57%
50%
31%
44%
100%
59%
5%
33%
69%
61%
4%
59%
38%
70%
18%
6
6 (0%)
7
7 (0%)
6
13 (1117%)
13
38 (1192%)
12
11 (28%)
15
31 (1107%)
12
10 (217%)
1
5 (1400%)
18
17 (26%)
1
46 (14500%)
8
16 (1100%)
101
53%
19%
217%
150%
58%
243%
28%
245%
33%
19%
17%
267%
132%
Grade II %
change%
Grade III #
(Change)
Grade
III %
57%
78%
68%
71%
36%
37%
30%
32%
24%
27%
51%
34%
0%
18%
86%
33%
23%
21%
96%
41%
62%
26%
28%
13
39 (1200%)
30
34 (113%)
8
11 (137%)
8
31 (1287%)
5
6 (120%)
25
24 (24%)
0
3
19
5 (274%)
6
6 (0%)
24
32 (133%)
13
6 (254%)
46
17%
21%
216%
23%
251%
136%
71%
36%
6%
254%
28%
59%
219%
213%
45%
241%
633%
212%
1374%
83%
Grade III %
change %
38%
4%
1%
7%
15%
233%
261%
27%
257%
258%
23%
KOH ET AL.
2149
Guideline
AIH
Alcoholic Liver
Ascites
Biopsy
HCV
PBC
Varices
Vascular
Wilson Disease
summary
% median
% quartile 1
% quartile 3
Year
2010
2010
2009
2009
2009
2009
2007
2009
2008
Number of
Recommendations
50
16
30
34
70
15
25
64
23
327
Class I Number
Class II Number
14 (28%)
13 (81%)
15 (50%)
30 (88%)
34 (49%)
14 (93%)
19 (76%)
53 (83%)
22 (96%)
214 (65%)
81%
50%
88%
0 (0%)
0 (0%)
0 (0%)
0 (0%)
1 (1%)
0 (0%)
0 (0%)
0 (0%)
0 (0%)
1 (0.3%)
0%
0%
0%
31 (62%)
0 (0%)
10 (33%)
2 (6%)
25 (36%)
1 (7%)
3 (12%)
3 (5%)
0 (0%)
75 (23%)
7%
5%
33%
1 (2%)
0 (0%)
2 (7%)
2 (6%)
5 (7%)
0 (0%)
0 (0%)
2 (3%)
1 (4%)
13 (4%)
3%
0%
6%
4 (8%)
3 (20%)
3 (10%)
0 (0%)
5 (7%)
0 (0%)
3 (12%)
6 (10%)
0 (0%)
24 (7%)
8%
0%
10%
Year
Hemochromatosis
PSC
NAFLD
2011
2010
2012
Number of Recommendations
Class 1 Number
Class 2 Number
17
36
45
15 (88%)
31 (86%)
39 (87%)
2 (12%)
5 (14%)
6 (13%)
(Table 5). In the ACC/AHA system, 327 recommendations were issued, with 214 (65.4%) designated as class
I recommendations suggesting evidence and/or general
agreement that a given diagnostic evaluation, procedure,
or treatment is beneficial, useful, and effective (Table 5).
In evaluating the classes of evidence system based on
types of recommendations, 64% were treatment recommendations, 23% were diagnostic recommendations,
and 13% were features of disease recommendations
(Supporting Table 4). In the GRADE classes of evidence
system, a total of 98 recommendations were provided
and 89% of the recommendations were designated class
I recommendations (Supporting Table 4).
Discussion
The AASLD clinical practice guidelines provide a
set of recommendations for guidance in managing
patients with acute and chronic liver disease. Since
1998, these guidelines have provided an additional
36% increase in the overall number of recommendations from the initial development of specific guidelines. However, despite this substantial increase, less
than 15% of all recommendations are categorized as
grade I, suggesting that the evidence used to develop
most recommendations does not come from randomized controlled trials, for a variety of reasons. Therefore, areas with insufficient evidence where
randomized trials can be conducted to improve the
evidence base should be identified for development.
2150
KOH ET AL.
recommendations and perhaps stronger recommendations. However, regardless of the type of evidence, the
quality of future clinical practice guidelines can be further improved, as identified by domains evaluated in
the AGREE II instrument.
Study Limitations. The current analysis does not
account for changes over time regarding the aims and
practices of AASLD practice guideline development
program, whereby the numbers of recommendations
and distribution across classes may have been influenced. Given the lengthy time span, turnover of writing groups, and the use of several grading systems in
these guidelines, there may have been unanticipated
changes in definitions, standards, and thresholds in the
determination of grades of recommendations that were
not easily measurable. Additionally, the sporadic use of
class systems and significant changes between systems
prohibited a comprehensive class comparison. With
the adoption of the current GRADE system for recent
and future guideline updates by the AASLD, the deficiencies in assessing quality of evidence and strength of
recommendations will hopefully be alleviated.
In conclusion, the evolution of the AASLD practice
guidelines is featured by a substantial increase in the
overall number of recommendations to assist healthcare providers in the management of patients with
liver disease. With the exception of practice guidelines
focused on chronic viral hepatitis (HBV and HCV),
the bulk of evidence for these recommendations still
derive from observational studies or expert consensus
opinions. Ideally, the basis of medical practice should
be as evidence-based as possible and we should aim to
perform the highest quality research to answer clinical
dilemmas whenever feasible. Nonetheless, guideline
development should continually strive to generate recommendations with the highest quality of evidence
possible while minimizing the effect of bias from
extrinsic sources. Additionally, guideline developers
should continue to strive to produce highest-quality
documents with compelling methodological rigor and
transparency. Whenever possible, clinical practice
guidelines should highlight the need for additional
research agenda to fill gaps within clinical care that
have the greatest impact on patient outcomes.
References
1. Field M, Lohr K. Guidelines for clinical practice: from development to
use. Washington, DC: National Academy Press; 1992.
2. Guidelines International Network, International Guidelines Library.
Accessed January 31, 2012, from http://www.g-i-n.net/library/international-guidelines-library.
KOH ET AL.
2151
2152
KOH ET AL.
37. Gross PA, Barrett TL, Dellinger EP, Krause PJ, Martone WJ, McGowan
JE Jr, et al. Purpose of quality standards for infectious diseases. Infectious
Diseases Society of America. Clin Infect Dis 1994;18:421.
38. Shiffman RN, Shekelle P, Overhage JM, Slutsky J, Grimshaw J,
Deshpande AM. Standardized reporting of clinical practice guidelines:
a proposal from the Conference on Guideline Standardization. Ann
Intern Med 2003;139:493-498.
39. ACC/AHA Task Force on Practice Guidelines. Manual for ACC/AHA
Guideline Writing Committees: Methodologies and Policies from the
ACC/AHA Task Force on Practice Guidelines. Available at http://
www.acc.org/qualityandscience/clinical/manual/pdfs/methodology.pdf and
http://circahajournals.org/manual/. Accessed January 2012.
40. Holmer HK, Ogden LA, Burda BU, Norris SL. Quality of clinical
practice guidelines for glycemic control in type 2 diabetes mellitus.
PloS One 2013;8:e58625.
41. Acuna-Izcaray A, Sanchez-Angarita E, Plaza V, Rodrigo G, Montes de
Oca M, Gich I, et al. Quality assessment of asthma clinical practice
guidelines: a systematic appraisal. Chest 2013 [Epub ahead of print].
42. Al-Ansary LA, Tricco AC, Adi Y, Bawazeer G, Perrier L, Al_Ghonaim
M, et al. A systematic review of recent clinical practice guidelines on
the diagnosis, assessment and management of hypertension. PloS One
2013;8:e53744.