Download as pdf or txt
Download as pdf or txt
You are on page 1of 30

N

APIs

pipeline
P
/
Schema

/ (

L
Taxo Taxo

APPLE
APPLE

( )
iPhone 12 OLED OLED

Hive JoyGraph MySQL


e

c
a





!"# = %# , '# , (# !") = %) , ') , ()
*= +# , +) |+# ∈ %# , +) ∈ %)

candidate
!"# !")
Beijing
Panda
Capital

Live in Sichuan

China Locate in
seed
Dataset Subset Graph Triples Entities Relations Alignments

fr 192,191 66,858 1,379


fr-en 15,000
en 278,590 105,889 2,209

ja 164,373 65,744 2,043



DBP15k[1] ja-en 15,000
en 233,319 95,680 2,096

zh 53,929 66,469 2,830


zh-en 15,000
en 237,674 98,125 2,317

dbp 463,294 100,000 330


dbp-wd 10,000

wd 448,774 100,000 220
DWY100k[2]
dbp 428,952 100,000 302
dbp-yg 10,000
yg 502,563 100,000 31

[1] Sun et al., Cross-lingual entity alignment via joint attribute-preserving embedding. ISWC 2017
[2] Sun et al., Bootstrapping entity alignment with knowledge graph embedding. IJCAI 2017
9:;
• G

", ) ∈ ℒ| rank ", ) ≤ 2


!"#$@& =

seed candidate

1 1
344 = 6
ℒ rank ", )
7,8 ∈ℒ

• E@

9:< • H
DBP15k ZH-EN
Hits@1 Hits@10

MTransE [Chen et al., 2017] 0.308 0.614 TransE


E
IPTransE [Zhu et al., 2017] 0.406 0.735 TransE

JAPE [Sun et al., 2017] 0.412 0.745 TransE +

BootEA [Sun et al., 2018] 0.629 0.848 TransE

GCN-Align [Wang et al., 2018] 0.413 0.744 GNN

MuGNN [Cao et al., 2019] 0.494 0.844 TransE+GNN

TransEdge [Sun et al., 2019] 0.735 0.919 TransE

RDGCN [Wu et al., 2019] 0.708 0.846 GNN

HGCN [Wu et al., 2019] 0.720 0.857 GNN +

• HMAN [Yang et al., 2019] 0.871 0.987 GNN +

AliNet [Sun et al., 2020] 0.539 0.826 TransE+GNN +


• DGMC [Fey et al., 2020] 0.772 0.897 GNN

RREA [Mao et al., 2020] 0.822 0.964 GNN

EPEA [Wang et al., 2020] 0.885 0.953 GNN



RNM [Zhu et al., 2021] 0.840 0.919 GNN +
DB

A Schema

Schema
KB

B
Standard Product Unit S = Stock Keeping Unit H @ Ht
K SPU P i SKU U H






• s
• SPU SKU



• ×100%

• = ×100%
<100


• Bert



G
– N
seed
• CNN N
• candidate
• Bert ?
G
N
– N


1 2

u 1 2 u 1 2
• u C
• BiLSTM / BERT
• l M
• n 0+ - - E
Ar
• n L k
• P
• n 0+ e

concat
Masked
Multi-Head
• iat B H S oC Attention
MLP
• u sd

softmax
• GNN
• GNN
• •
• batch size

GNN


Sku1
GNN
P30
pro

Sku2




• SKU SKU GNN


• SKU
g cE

• mc c
• u A
T
• u Ii e
• r pa i c
• e u i


• nN N u itC
• G E b vTSN u
i E
• d iu h u
i tCs
2
. 2 1 Message Passing

#$%
(") = / ."# , .)# , 1") , + ∈ -" Message

Aggregate
#$%
!" = ' #$%
(") , + ∈ -"

#$%
." = 00 #
." , #$%
!" Update

.# /

.# ' ! #$% 0 . #$%


/

/
.#
• M
• a M G

• > ?

• @ Relation Aware Neighborhood Aggregation

!"#$% = ∑( )"( *"( +(#$% , - ∈ /"


)"( N A e G
i
exp 3 456 +"#$% ||+(#$%
)"( =
∑8∈/9 exp 3 456 +"#$% ||+8#$%
6 <
exp 3 4: ;"( *"( C G R
*"( = g
∑8∈/9 exp 3 6 <
4: ;"8

<
3: LeakyReLU ;"( : Relation Embedding between node i and j
G

G
seed N
candidate
N ! "# , "% = "# − "%
()

7 % (Marginal Hinge Embedding Loss)


G 4 4 6 6
N * * max 0, 1 23 , 25 −1 23 , 25 +8
N #+ ,%+ #, ,%,

• N
• 0 5 1 •
5 - •
(Marginal Ranking Loss)
( ( max 0, score 7"8 , 7'8 − score 7": , 7': + <
)* ,,* )- ,,-

G
seed N Neighbor similarity matrix Relation score matrix
N !"# !"$ ⋯ !"&
candidate

Row Max-pooling
!'#

Kernel pooling
!'$
G


N
N !'&
GNN embedding similarity
Column Max-pooling

Kernel pooling [1] average concat MLP

• % 0 -

Text embedding similarity

[1] Tang et al., BERT-INT: A BERT-based Interaction Model For Knowledge Graph Alignment. IJCAI 2020

• .

• U
• 1
• 5 - 3

• 3 S K
SK
Dense Attribute Classification
• B E d
-
• E m E
• d d m B I E O B I O O B I O O O O
Cascade

Knowledge Fusion Value Extraction

• da m
S da Oe
• das k B BiLSTM / BERT

Character Embedding
+ + + + + + + + + + + + + +

• o n
Word Embedding
• Is t i
• n i g Tiangang Zhu, Yue Wang, Haoran Li, Youzheng Wu, Xiaodong He, Bowen Zhou. Multimodal Joint Attribute Prediction
and Value Extraction for E-commerce Product. EMNLP2020.
b m






H

A K
N ST

You might also like