Professional Documents
Culture Documents
Tutorial On Methods For Interpreting and Understanding Deep Neural Networks
Tutorial On Methods For Interpreting and Understanding Deep Neural Networks
Wojciech Samek
Grgoire Montavon
Klaus-Robert Mller
Alexander Binder
Sebastian Lapuschkin
Part 1: Introduction
Autonomous search-and-rescue
drones outperform humans
Interpretable
Information
ex
tra
Computing power
ct
Deep Nets / Kernel Machines / Information (implicit)
AlexNet
Clarifai
VGG
GoogleNet
ResNet
Performance
Interpretable
Data Information
for human
vs.
vs.
60 million parameters
train best
train interpretable
interpret it vs.
model model
suboptimal or biased due to
assumptions (linearity, sparsity )
Different dimensions
prediction
of interpretability Explain why a certain pattern x has
been classified in a certain way f(x).
model
What would a pattern belonging
to a certain category typically look
like according to the model.
data
Which dimensions of the data
are most relevant for the task.
right to explanation
Data Protection Regulation
Right to explanation
In medical context
Interpreting models
cheeseburger
goose
car
Interpreting models
cheeseburger
goose
car
simple regularizer
(Simonyan et al. 2013)
Interpreting models
cheeseburger
goose
car
complex regularizer
(Nguyen et al. 2016)
Explaining decisions
- why does the model arrive at a certain prediction
- verify that model behaves as expected
Explaining decisions
- why does the model arrive at a certain prediction
- verify that model behaves as expected
- Sensitivity Analysis
Theoretical interpretation
Deep Taylor Decomposition
(Montavon et al., 2017)
Part 2
Part 2