Professional Documents
Culture Documents
1 3DataFunSummit 实体对齐算法在电商领域当中的实践和应用 Final
1 3DataFunSummit 实体对齐算法在电商领域当中的实践和应用 Final
APIs
pipeline
P
/
Schema
/ (
L
Taxo Taxo
APPLE
APPLE
( )
iPhone 12 OLED OLED
c
a
•
•
•
•
•
•
!"# = %# , '# , (# !") = %) , ') , ()
*= +# , +) |+# ∈ %# , +) ∈ %)
candidate
!"# !")
Beijing
Panda
Capital
Live in Sichuan
China Locate in
seed
Dataset Subset Graph Triples Entities Relations Alignments
[1] Sun et al., Cross-lingual entity alignment via joint attribute-preserving embedding. ISWC 2017
[2] Sun et al., Bootstrapping entity alignment with knowledge graph embedding. IJCAI 2017
9:;
• G
•
1 1
344 = 6
ℒ rank ", )
7,8 ∈ℒ
• E@
9:< • H
DBP15k ZH-EN
Hits@1 Hits@10
A Schema
Schema
KB
B
Standard Product Unit S = Stock Keeping Unit H @ Ht
K SPU P i SKU U H
•
•
•
•
•
•
• s
• SPU SKU
•
•
•
• ×100%
• = ×100%
<100
•
• Bert
•
•
–
G
– N
seed
• CNN N
• candidate
• Bert ?
G
N
– N
–
•
1 2
u 1 2 u 1 2
• u C
• BiLSTM / BERT
• l M
• n 0+ - - E
Ar
• n L k
• P
• n 0+ e
concat
Masked
Multi-Head
• iat B H S oC Attention
MLP
• u sd
softmax
• GNN
• GNN
• •
• batch size
•
GNN
•
•
Sku1
GNN
P30
pro
•
•
Sku2
•
•
•
•
• SKU SKU GNN
•
•
• SKU
g cE
• mc c
• u A
T
• u Ii e
• r pa i c
• e u i
•
• nN N u itC
• G E b vTSN u
i E
• d iu h u
i tCs
2
. 2 1 Message Passing
#$%
(") = / ."# , .)# , 1") , + ∈ -" Message
Aggregate
#$%
!" = ' #$%
(") , + ∈ -"
#$%
." = 00 #
." , #$%
!" Update
.# /
/
.#
• M
• a M G
• > ?
<
3: LeakyReLU ;"( : Relation Embedding between node i and j
G
G
seed N
candidate
N ! "# , "% = "# − "%
()
• N
• 0 5 1 •
5 - •
(Marginal Ranking Loss)
( ( max 0, score 7"8 , 7'8 − score 7": , 7': + <
)* ,,* )- ,,-
G
seed N Neighbor similarity matrix Relation score matrix
N !"# !"$ ⋯ !"&
candidate
Row Max-pooling
!'#
Kernel pooling
!'$
G
⋯
N
N !'&
GNN embedding similarity
Column Max-pooling
• % 0 -
[1] Tang et al., BERT-INT: A BERT-based Interaction Model For Knowledge Graph Alignment. IJCAI 2020
•
• .
• U
• 1
• 5 - 3
• 3 S K
SK
Dense Attribute Classification
• B E d
-
• E m E
• d d m B I E O B I O O B I O O O O
Cascade
• da m
S da Oe
• das k B BiLSTM / BERT
Character Embedding
+ + + + + + + + + + + + + +
• o n
Word Embedding
• Is t i
• n i g Tiangang Zhu, Yue Wang, Haoran Li, Youzheng Wu, Xiaodong He, Bowen Zhou. Multimodal Joint Attribute Prediction
and Value Extraction for E-commerce Product. EMNLP2020.
b m
•
•
•
•
•
•
•
H
A K
N ST