Professional Documents
Culture Documents
Ground Truth Absent (Arxiv)
Ground Truth Absent (Arxiv)
Ground Truth Absent (Arxiv)
ABSTRACT
Machine
Interpretable Machine Learning (IML) has become increas- Learning
This is a husky
(p =0 .93)
ingly important in many real-world applications, such as Process
weight
Does human satisfy
Does explanation
or comprehend
Interpretation Manner
explanation well?
General reflect the target
Properties system well?
leaf node output sequence
(decision)
Decision Tree Attention Mechanism
light
heated
deep model shallow model Figure 4: Three general properties for explanations
Mimic Learning Instance Heatmap in IML, including generalizability, fidelity and per-
suasibility. Each property essentially corresponds
Figure 3: A two-dimensional categorization for ex- to one specific aspect in evaluation. Generalizabil-
planations in IML, covering interpretation scope ity focuses on the generalization power of explana-
and interpretation manner. According to the two- tion. Fidelity focuses on the faithfulness degree of
dimensional standard, we can divide explanations explanation. Persuasibility focuses on the usefulness
into four different groups: (a) intrinsic-global; (b) degree of explanation.
intrinsic-local; (c) posthoc-global; (d) posthoc-local.
For each category, we attach a representative exam-
ple for illustration. In particular, we employ deci-
to reflect the generalization power of explanation. In real-
sion tree as the example for intrinsic-global explana-
world applications, human users employ explanation from
tions, attention mechanism for intrinsic-local ones,
IML techniques mainly to obtain insights from the target
mimic learning for posthoc-global ones, and instance
system, which naturally brings forward the demand on ex-
heatmap for posthoc-local ones.
planation generalization performance. If a set of explana-
tions is poorly generalized, it can hardly be regarded with
good quality, since the knowledge and guidance it provides
where generated attention weights can help interpret par- would be rather limited in practice. One thing to clarify is
ticular predictions by indicating the important components. that the explanation generalization mentioned here is not
Attention model is widely used in both image captioning necessarily equal to the model predictive power, unless the
and machine translation tasks. Posthoc-global explanation model itself is interpretable with self-explanations (e.g., de-
serves as the third category, and the representative exam- cision tree). By measuring the generalizability of explana-
ple can be shown with mimic learning techniques for deep tion, users can have a sense of how accurate the generated
models. As for mimic learning, the teacher usually is a deep explanations are for specific tasks.
model, while the student is typically deployed as a shallow
model that is easier to be interpreted. The overall process of Definition 1: We define the generalizability of expla-
mimic learning can be regarded as a distillation process from nation in IML as an indicator for generalization perfor-
the teacher to the student, where the interpretable student mance, regarding to the knowledge and guidance deliv-
model provides a global view in a posthoc manner for the ered by the corresponding explanation.
deep teacher model. The posthoc-local explanation fills up
the last part of the taxonomy. We introduce this category The second general property is fidelity, which is used to
with an example of instance heatmap, which is used to visu- indicate how faithful explanations are to the target system.
alize the input regions with attribution score (i.e., a quan- Faithful explanation is always preferred by human, because
tified importance indicator). Instance heatmap works well it can precisely capture the decision making process of the
for both image and text, and is capable of showing the local target system and show the correct evidences for particu-
behaviour of the target model. Since heatmap depends on lar predictions. Explanations with high quality need to be
the particular input and does not involve the specific model faithful, since they are essentially served as important tools
design, it is a typical local explanation within a posthoc way. for users to understand the target system. Without suffi-
cient fidelity, explanations can only provide limited insights
2.2 General Properties of Explanation to the system, which degrades the functionalities of IML to
human users. To guarantee the relevance of explanations,
To formally define the problem of evaluating explanations we need fidelity to conduct explanation evaluation in IML.
in IML, it is important to make clear the general properties
of explanation for evaluation. In this article, we summarize
Definition 2: We define the fidelity of explanation in
three significant properties from different perspectives, i.e.,
IML as the faithfulness degree with regard to the target
generalizability, fidelity and persuasibility, where each of the
system, aiming to measure the relevance of explanations
property corresponds to one specific aspect in evaluation.
in practical settings.
The intuitions of the properties are illustrated in Figure 4.
The first general property is generalizability, which is used The third general property is persuasibility, which reflects
IML on the second part of IML evaluation, i.e., the explanation
Evaluation evaluation, and rigorously define the problem as follows.