+ All Categories
Home > Documents > Multilingual Knowledge Graph Embeddings for Cross-lingual...

Multilingual Knowledge Graph Embeddings for Cross-lingual...

Date post: 12-Jul-2020
Category:
Upload: others
View: 14 times
Download: 0 times
Share this document with a friend
20
Multilingual Knowledge Graph Embeddings for Cross-lingual Knowledge Alignment Muhao Chen 1 , Yingtao Tian 2 , Mohan Yang 1 , and Carlo Zaniolo 1 University of California, Los Angeles 1 Stony Brook University 2
Transcript
Page 1: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Multilingual Knowledge Graph Embeddingsfor Cross-lingual Knowledge Alignment

Muhao Chen1, Yingtao Tian2, Mohan Yang1, and Carlo Zaniolo1

University of California, Los Angeles1

Stony Brook University2

Page 2: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Outline

• Background

• MTransE—A multilingual knowledge graph embedding model

• Evaluation

• Open Challenges and Future Work

Page 3: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Knowledge Graphs

• Symbolic representation of entities and relationsMonolingual knowledge: triples (relation facts of entities)

Cross-lingual knowledge: alignment of monolingual knowledge across languages

(California, capital city, Sacramento)

(カリフォルニア, 首都,サクラメント)

Page 4: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Knowledge Graph Embeddings

• Encode entities as vectors

Bach

Male

Germany

Eisenach

Knowledge Graph

Encode

Embeddings

Enable

Relational inferences as vector algebra– France – Paris ≈ capital

– US – USD ≈ currency

– Bach – German ≈ nationality– …

Applications • KG Completion• Relation extraction from text • Question answering

Capture

Semantic similarity of entities

Paris (0.036, -0.12, ..., 0.323)capital (0.102, 0.671, …, -0.101)France (0.138, 0.551, …, 0.222)…

Page 5: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Current KG Embedding Approaches

TransE: h+r≈t

• Focused on embedding monolingual triples (h, r, t)

Later approaches– TransH [Wang et al. 2014]

– TransR [Lin et al. 2015]

– TransD [Ji et al. 2015]

– HolE [Nickle et al. 2016]

– ComplEx [Trouillon et al. 2016]

– …

Embedding of monolingualknowledge seems to be

well-addressed.

What about cross-lingual knowledge?

Page 6: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Emerging challenge

• Existing works do not characterize cross-lingual knowledge– Entity inter-lingual links (ILLs): (ambulance --- krankenwagen)

– Triple-wise alignment (TWA): ((State of California, capital city, Sacramento) --- (カリフォルニア, 首都,サクラメント))

– Many KGs store such knowledge

Why important?• Enables multilingual

semantic representations• Benefits cross-lingual NLP

– Knowledge alignment– Machine translation– Cross-lingual Q&A– …

Difficult to characterize:• Fewer samples: Cross-lingual knowledge currently

accounts for a small portion of each KB• Larger domains: Cross-lingual knowledge applies on the

entire spaces of involved languages• Incoherence: Language-specific versions of KG are

usually incoherent• Heterogeneity: Applies to both entities and

monolingual relations with inconsistent vocabularies

Page 7: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

What does MTransE use and enable?

• Corpora: (partially-aligned) multilingual KGs• Enabling: inferable embeddings of

multilingual semantics• Can be applied to:

– Knowledge alignment– Cross-lingual Q&A– Multilingual chat-bots– …

France Capital Paris +

フラ

ンス首都 パリ+

Page 8: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

MTransE Model Components

• Knowledge model

• Alignment model

• Objective of learning– Minimizing 𝐽(𝜃) = 𝑆𝐾 + 𝛼𝑆𝐴

𝑆𝐾 =

𝐿∈{𝐿𝑖,𝐿𝑗}

𝑇∈𝐺𝐿

||𝐡 + 𝐫 − 𝐭||

(h, r, t)(h , r , t )

Space L1

Space L2

Alignment model

Knowledge model

𝑆𝐴 =

𝑇,𝑇′ ∈𝛿(𝐿𝑖,𝐿𝑗)

𝑆𝑎(𝑇, 𝑇′)

All aligned triples

Page 9: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Different alignment techniques

Space Li

Space Lj

Space Li Space Lj

Translate

Translate

Translate

Space Li Space Lj

Transformations

Mij

Axis calibration• Cross-lingual counterparts

have close embeddings

Translation vectors• Encoding cross-lingual

transitions just like monolingual relations

Linear Transformations• Transformations across

embedding spaces of different languages

Page 10: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Alignment Scores and Five Model Variants

• Vari combines the ith alignment model with the knowledge model

Variant Alignment Score Remark

Var1 𝑆𝑎1 = 𝒉 − 𝒉′ + 𝒕 − 𝒕′

Var2 𝑆𝑎2 = 𝒉 − 𝒉′ + 𝒓 − 𝒓′ + 𝒕 − 𝒕′

Var3 𝑆𝑎3 = 𝒉 + 𝒗𝒊𝒋𝒆 − 𝒉′ + 𝒓 + 𝒗𝒊𝒋

𝒓 − 𝒓′

+ 𝒕 + 𝒗𝒊𝒋𝒆 − 𝒕′

𝒗𝒊𝒋𝒆 =−𝒗𝒋𝒊

𝒆 , 𝒗𝒊𝒋𝒓 =−𝒗𝒋𝒊

𝒓

Var4 𝑆𝑎4 = 𝑴𝑖𝑗𝑒 𝒉 − 𝒉′ + 𝑴𝑖𝑗

𝑒 𝒕 − 𝒕′ 𝑴𝑖𝑗𝑒 ∈ ℝ𝒌×𝒌, 𝑴𝑖𝑗

𝑟 ∈ ℝ𝒌×𝒌

Var5 𝑆𝑎5 = 𝑴𝑖𝑗𝑒 𝒉 − 𝒉′ + 𝑴𝑖𝑗

𝑟 𝒓 − 𝒓′

+ 𝑴𝑖𝑗𝑒 𝒕 − 𝒕′

Axis Calibration

Linear Transforms

Translation Vector

Page 11: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Experimental Evaluation

• Cross-lingual knowledge alignment tasks– Entity Matching– Triple-wise Alignment (TWA) Verification

• Monolingual relation extraction task

• Trilingual data sets– Wiki-based (WK3l-15k, WK3l-120k)– ConceptNet-based (CN3l)

• Baselines– LM [Mikolov et al. 2013] + Knowledge models – CCA [Faruqui et al. 2014] + Knowledge models– OT [Xing et al. 2015] + Knowledge models

These three data sets are available at https://github.com/muhaochen/MTransE

Page 12: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Entity Matching

• Evaluation protocol– For each (e, e’), rank e’ in the neighborhood of 𝜏 𝒆

• Training sets– Pairs of language-specific graphs and corresponding alignment sets

• Test data– Entity Inter-lingual links {(e, e’)} (Unidirectional)

What is the German entity for the English entity “Regulation of Property”?

Page 13: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Entity Matching

0

20

40

60

80

100

Hits@10/En-Fr Hits@10/Fr-En Hits@10/En-De Hits@10/De-En

Hits@10 on WK3l-15k

LM CCA OT Var1 Var2 Var3 Var4 Var5

0

20

40

60

80

100

Hits@10/En-Fr Hits@10/Fr-En Hits@10/En-De Hits@10/De-En

Hits@10 on WK3l-120k

LM CCA OT Var1 Var2 Var3 Var4 Var5

1

10

100

1000

10000

Mean/En-Fr Mean/Fr-En Mean/En-De Mean/De-En

Mean on WK3l-15k

LM CCA OT Var1 Var2 Var3 Var4 Var5

1

10

100

1000

10000

Mean/En-Fr Mean/Fr-En Mean/En-De Mean/De-En

Mean on CN3l

LM CCA OT Var1 Var2 Var3 Var4 Var5

Var4≈Var5>Var1≈Var3≈OT>Var2≫CCA>LM

Axis Calibration Var1, Var2

Trans. Vectors Var3

Linear Transforms Var4, Var5

0

20

40

60

80

100

Hits@10/En-Fr Hits@10/Fr-En Hits@10/En-De Hits@10/De-En

Hits@10 on CN3l

LM CCA OT Var1 Var2 Var3 Var4 Var5

Page 14: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Triple-wise Alignment Verification

Var4≈Var5>Var1>Var2>Var3≈OT≫CCA>LM

We receive similar evaluation conclusions in all settings.

Axis Calibration Var1, Var2

Trans. Vectors Var3

Linear Transforms Var4, Var5

0

10

20

30

40

50

60

70

80

90

100

Accuracy of TWA Verification

LM CCA OT Var1 Var2 Var3 Var4 Var5

Page 15: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Monolingual Relation Extraction (English, French)

• Train/Test– Train Sets: 90% triples and

intersecting alignment sets– Test Sets: 10% triples

• MTransE preserves well the monolingual relations

0

5

10

15

20

25

30

35

40

45

WK3l-15k/EN WK3l-15k/FR WK3l-120k/EN WK3l-120k/FR

Predicting Missing Tails (Hits@10)

TransE Var1 Var2 Var3 Var4 Var5

0

10

20

30

40

50

60

70

80

WK3l-15k/EN WK3l-15k/FR WK3l-120k/EN WK3l-120k/FR

Predicting Missing Relations (Hits@10)

TransE Var1 Var2 Var3 Var4 Var5

Axis Calibration Var1, Var2

Trans. Vectors Var3

Linear Transforms Var4, Var5

Page 16: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Applications based on MTransE

• Multilingual Q&A

• Cross-lingual relation prediction

• Improving monolingual KG completion using multilingual correlation

• Knowledge alignment across knowledge bases

Page 17: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Examples of Cross-lingual Question Answering

Bold-faced ones are correct answers, italic ones are close answers.

Page 18: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Improve the embedding model

• Other forms of knowledge models and alignment models– Neural knowledge models such as HolE and ComplEx– Other alignment models such as affine transformations– Alignment models which consider disambiguation

• Encoding more information from multilingual KGs– Entity domains, class templates, entity descriptions, etc– Cross-lingual disambiguation

• Jointly embedding with other forms of corpora such as multilingual documents

Page 19: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

References

• [Bordes et al., 2013] Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, andOksana Yakhnenko. Translating embeddings for modeling multi-relational data. In NIPS, pages2787–2795, 2013.

• [Nickel et al., 2016] Maximilian Nickel, Lorenzo Rosasco, Tomaso Poggio, et al. Holographicembeddings of knowledge graphs. In AAAI, 2016.

• [Saxe et al., 2014] Andrew M Saxe, James L McClelland, and Surya Ganguli. Exact solutions to thenonlinear dynamics of learning in deep linear neural networks. ICLR, 2014.

• [Wang et al., 2014] Zhen Wang, Jianwen Zhang, Jianlin Feng, and Zheng Chen. Knowledge graphembedding by translating on hyperplanes. In AAAI, 2014.

• [Lin et al., 2015] Yankai Lin, Zhiyuan Liu, Maosong Sun, Yang Liu, and Xuan Zhu. Learning entityand relation embeddings for knowledge graph completion. In AAAI, 2015.

• [Ji et al., 2015] Guoliang Ji, Shizhu He, Liheng Xu, Kang Liu, and Jun Zhao. Knowledge graphembedding via dynamic mapping matrix. In ACL, pages 687–696, 2015.

• [Mikolov et al., 2013] Tomas Mikolov, Quoc V Le, and Ilya Sutskever. Exploiting similarities amonglanguages for machine translation. arXiv, 2013.

• [Faruqui and Dyer, 2014] Manaal Faruqui and Chris Dyer. Improving vector space wordrepresentations using multilingual correlation. EACL, 2014.

• [Xing et al., 2015] Chao Xing, Dong Wang, Chao Liu, and Yiye Lin. Normalized word embeddingand orthogonal transform for bilingual word translation. In NAACL HLT, pages 1006–1011, 2015.

Page 20: Multilingual Knowledge Graph Embeddings for Cross-lingual ...yellowstone.cs.ucla.edu/~muhao/slides/mtranse_slides_short.pdf · Var 4≈Var 5 >Var 1 >Var 2 >Var 3≈OT ≫CCA>LM We

Thank You

20


Recommended