Professional Documents
Culture Documents
Slides Swabha
Slides Swabha
Slides Swabha
frame-semantic
role labeling
Swabha Swayamdipta
2. Primary Subtasks
a. Target Identification
b. Frame Identification
c. Frame-Element Identification
3. Advanced Modeling
After encouraging them , he told them goodbye and left for Macedonia .
Frame-Semantic Role Labeling
(frame-SRL)
Sentence → Graph
Speaker Addressee
After encouraging them , he told them goodbye and left for Macedonia .
tell.V
After encouraging them , he told them goodbye and left for Macedonia .
tell.V
• Nodes: tokens / spans in the sentence. Could represent both targets and arguments.
Frame-Semantic Role Labeling
(frame-SRL)
Sentence → Graph
Speaker Addressee
After encouraging them , he told them goodbye and left for Macedonia .
tell.V
• Nodes: tokens / spans in the sentence. Could represent both targets and arguments.
• Node Labels: lexical units (LUs) and frames.
Frame-Semantic Role Labeling
(frame-SRL)
Sentence → Graph
Speaker Addressee
After encouraging them , he told them goodbye and left for Macedonia .
tell.V
• Nodes: tokens / spans in the sentence. Could represent both targets and arguments.
• Node Labels: lexical units (LUs) and frames.
• Edges: Between target nodes and argument nodes
Frame-Semantic Role Labeling
(frame-SRL)
Sentence → Graph
Speaker Addressee
After encouraging them , he told them goodbye and left for Macedonia .
tell.V
• Nodes: tokens / spans in the sentence. Could represent both targets and arguments.
• Node Labels: lexical units (LUs) and frames.
• Edges: Between target nodes and argument nodes
• Edge Labels: roles of arguments / frame-elements
Frame-Semantic Graphs:
Overlapping Nodes
After encouraging them , he told them goodbye and left for Macedonia .
tell.V
Time Speaker
TELLING Addressee Message
Frame-Semantic Graphs:
Overlapping Nodes
After encouraging them , he told them goodbye and left for Macedonia .
after.PREP encourage.V tell.V leave.V
Time Speaker
TELLING Addressee Message
STIMULATE
_EMOTION Experiencer
DEPARTING
Time Theme Goal
TIME_VECTOR
Landmark Event Event
Frame-SRL: Subtasks
Frame-SRL: Subtasks
1. Target
Identification
After encouraging them , he told them goodbye and left for Macedonia .
after.PREP encourage.V tell.V leave.V
Frame-SRL: Subtasks
1. Target 2. Frame
Identification Identification
After encouraging them , he told them goodbye and left for Macedonia .
after.PREP encourage.V tell.V leave.V
TELLING
STIMULATE
_EMOTION
DEPARTING
TIME_VECTOR
Frame-SRL: Subtasks
1. Target 2. Frame 3. Frame-Elements
Identification Identification Identification
After encouraging them , he told them goodbye and left for Macedonia .
after.PREP encourage.V tell.V leave.V
STIMULATE
Experiencer
_EMOTION
After encouraging them , he told them goodbye and left for Macedonia .
after.PREP encourage.V tell.V leave.V
STIMULATE
Experiencer
_EMOTION
1. Target
Predicate
Identification
TOP|S
VP
PP VP
S|VP VP PP
NP NP NP NP NP
After encouraging them , he told them goodbye and left for Macedonia .
A close relative:
PropBank SRL
1. Target 2. Frame
Predicate Sense
Identification Identification
TOP|S
VP
PP VP
S|VP VP PP
NP NP NP NP NP
encourage.02 tell.01 leave.04
After encouraging them , he told them goodbye and left for Macedonia .
A close relative:
PropBank SRL
TOP|S
ARGM-TMP
ARGM-TMP
VP
PP VP
ARG2
S|VP VP PP
ARG0
ARG1 ARG0 ARG1
NP NP NP NP NP
encourage.02 tell.01 leave.04
After encouraging them , he told them goodbye and left for Macedonia .
Dependency Graphs
Semantic proto-roles
After encouraging them he told them goodbye and left for Macedonia .
tell.V
• Full-text annotations
• 22 K targets
• Train (70%) / Dev (10%) / Test (20%)
Feed-forward Nets
Model architectures
• Most common approach: Supervised learning
• Linear models - most models prior to 2015
• May use distributional representations
• Non-linear / neural models
Feed-forward Nets
Model architectures
• Most common approach: Supervised learning
FrameNet / PropBank
• Linear models - most models prior to 2015 / Span-based graphs /
Dependency Graphs
• May use distributional representations
• Non-linear / neural models
Feed-forward Nets
Why automatic frame-SRL?
Why automatic frame-SRL?
• Information extraction (Surdeanu, et al., 2003)
• Textual entailment (Tatu & Moldovan, 2005; Burchardt & Frank, 2006)
• Text categorization (Moschitti, 2008)
• Question answering (Narayanan & Harabagiu, 2004; Frank, et. al., 2007;
Moschitti, et. al., 2007; Shen & Lapata, 2007)
• Machine Translation (Wu & Fung, 2009, Marchegianni et. al., 2017) and
its evaluation (Giménez & Màrquez, 2007)
2. Primary Subtasks
a. Target Identification
b. Frame Identification
c. Frame-Element Identification
3. Advanced Modeling
• after and before were removed unless their head was marked as temporal,
• into, to, and through were removed unless their head was marked as direction,
• as, for, so, and with were always removed,
• because the only sense of the word of was the frame PARTITIVE, it was removed unless it was preceded by
only, member, one, most, many, some, few, part, majority, minority, proportion, half, third, quarter, all,
or none, or it was followed by all, group, them, or us,
• all targets marked as support verbs for some other target were removed.
Johansson & Nugues (SemEval, 2007)
Das et. al. (CL 2014)
Target ID: Neural Model
76.1 79.2
67.5 70.8 73.4
67.1
SemEval 07 Test
45
22.5
0
Johansson & Nugues 07 Das et. al. 14 Swayamdipta et. al. 17
*Not directly comparable
FN 1.5 Test
Outline
1. Task of frame-SRL
2. Primary Subtasks
a. Target Identification
b. Frame Identification
c. Frame-Element Identification
3. Advanced Modeling
• Simple Classification
frame = arg max p(frame | target, LU, sentence)
frame∈LU
• When LU is ambiguous:
• Treat it as another unknown
• Learn a distribution for it (Das et. al., 2014)
Linear Frame ID Models
90
90
87.6 87.5
86.5
84.75
83.6
83
al ., 14 . a l . 17 a l . , 17 al., 15 . a l , 18 . al., 17 a l . , 18
t . e t et. et. e t et et .
Das e m a n n i
md Hermp t a a n n
ots c h e n Y a n g P en g
Hart Sw a y a B
Outline
1. Task of frame-SRL
2. Primary Subtasks
a. Target Identification
b. Frame Identification
c. Frame-Element Identification
3. Advanced Modeling
Time Speaker
TELLING Addressee Message
STIMULATE
_EMOTION Experiencer
DEPARTING
Time Theme Goal
TIME_VECTOR
Landmark Event Event
PP VP
ARG2
S|VP VP PP
ARG0
ARG1 ARG0 ARG1
NP NP NP NP NP
encourage.02 tell.01 leave.04
After encouraging them , he told them goodbye and left for Macedonia .
• Primary difference between PropBank SRL and Frame SRL arg ID:
‣ PropBank role labels (ARG0 - ARGn) are uniform across
predicates.
• Arguments as spans
• Arguments as sequences
Role of syntax
• Traditional feature
design (Das et. al. 2014)
TOP|S
Semantics
PropBank
ARGM-TMP
VP
Syntax
ARGM-TMP
argument identification PP VP
ARG2
(Das et. al., 2014) S|VP ARG0
VP PP
ARG1 ARG0 ARG2 ARG1
NP NP NP NP NP
• Continuous valued
encourage.02 tell.01 leave.04
After encouraging them , he told them goodbye and left for Macedonia
after.PREP encourage.V tell.V leave.V
DEPARTING
Time Theme Goal
• Constraints during
TIME_VECTOR
Landmark Event Event
• Span classification
task
• Candidate spans
pruned by
syntactic rules
• Features rely
heavily on syntax
• BIO specification
• Deep bidirectional,
highway
(Zhou & Xu,2015;
He et. al. ACL 2017)
• Transformers
(Tan et. al., AAAI
2018; Strubell et. al.,
2018)
He et. al. (2017)
Frame + Arg ID: Evaluation
Given GOLD targets!
90
Precision Recall F1
82.5
80.4
78.8
77.4
76.6
75
75 74.5 74.7
60
a l. , 14 l . , 1 5 a l . , 17 a l . , 17 a l ., 18
t. t. a et. et. et.
Das e er a ld e
md i p t a Y a n g Pe n g
Fitzg Sw ay a
End-to-end
Frame SRL evaluation
Summary of Part 2
1. Task of frame-SRL
2. Primary Subtasks
a. Linear
b. Neural models
3. Advanced Modeling
• Availability of data
Biggest Challenge
• Availability of data
• Part 3: Make the best of available supervision through
advanced modeling
‣ Joint prediction
‣ Multitask learning
Biggest Challenge
• Availability of data
• Part 3: Make the best of available supervision through
advanced modeling
‣ Joint prediction
‣ Multitask learning
• Part 4: Enhance the amount of supervision
Getting the most out of
data
• Frame-semantic parsing with heterogenous annotations
(Kshirsagar et. al., 2015)
• Frame Hierarchy
• Exemplar annotations
• PropBank
• Relatively unused: Grammatical Functions and Phrase
Types of frame-elements
• Frames + frame-elements:
• Approximate prediction with AD3 (Yang et. al., 2017; Peng et. al.,
2018)
Joint prediction
• Pipelining the subtasks is lossy
‣ Targets, frames, frame-elements are mutually informative!
• All frame-elements together:
• Reranking with global features (Toutanova et. al. 2008)
• Approximate prediction with global constraints (AD3 : Das et. al.
2012)
• Frames + frame-elements:
• Approximate prediction with AD3 (Yang et. al., 2017; Peng et. al.,
2018)
VP
ARGM-TMP
PP VP
S|VP VP PP
ARG0 ARG2 ARG1
NP NP NP NP NP
tell.01
After encouraging them , he told them goodbye and left for Macedonia .
Syntactic Scaffolding
TOP|S
VP
ARGM-TMP
PP VP
S|VP VP PP
ARG0 ARG2 ARG1
NP NP NP NP NP
tell.01
After encouraging them , he told them goodbye and left for Macedonia .
VP
ARGM-TMP
PP VP
S|VP VP PP
ARG0 ARG2 ARG1
NP NP NP NP NP
tell.01
After encouraging them , he told them goodbye and left for Macedonia .
VP
ARGM-TMP
PP VP
S|VP VP PP
ARG0 ARG2 ARG1
NP NP NP NP NP
tell.01
After encouraging them , he told them goodbye and left for Macedonia .
75
• Instead of learning entire Precision
F1
Recall
after training. 55
Swayamdipta et. al. (EMNLP, 2018) t . a l . 15 t. al. 17 t . a l . 18
s a g a r e Yang e di pt ae
Kshir S w a y am
PropBank + Frame-SRL
• Significant TOP|S
Semantics
PropBank
ARGM-TMP
overlaps between VP
Syntax
ARGM-TMP
PP VP
formalisms S|VP ARG0
VP
ARG2
PP
ARG1 ARG0 ARG2 ARG1
NP NP NP NP NP
• PropBank much
encourage.02 tell.01 leave.04
After encouraging them , he told them goodbye and left for Macedonia
after.PREP encourage.V tell.V leave.V
_EMOTION Experiencer
performance TIME_VECTOR
Landmark Event Event
• Disjoint formalisms :
span-based vs.
dependency-based.
• Treat semantic
dependencies as
“latent” when Peng et. al. (2018)
predicting frame-
elements.
Alternatives to supervised learning
Unsupervised approaches
1. Task of frame-SRL
2. Primary Subtasks
a. Linear
b. Neural models
3. Advanced Modeling
• Domain Adaptation
• Distributional methods (Croce et. al., 2010)
• FrameID (Hartmann et. al. EACL 2017)
One glaring gap
One glaring gap
Multilingual SRL:
Generating Annotations via Projection
Multilingual SRL:
Generating Annotations via Projection
• Challenge: Differences in
annotation schemes across
languages.