Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

邮发代号 80-970

2019 Impact Factor: 1.275

Frontiers of Computer Science  2024, Vol. 18 Issue (4): 184319   https://doi.org/10.1007/s11704-023-2542-x
  本期目录
Semantic-aware entity alignment for low resource language knowledge graph
Junfei TANG1,2, Ran SONG1,2, Yuxin HUANG1,2, Shengxiang GAO1,2, Zhengtao YU1,2()
1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
2. Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650500, China
 全文: PDF(9116 KB)   HTML
Abstract

Entity alignment (EA) is an important technique aiming to find the same real entity between two different source knowledge graphs (KGs). Current methods typically learn the embedding of entities for EA from the structure of KGs for EA. Most EA models are designed for rich-resource languages, requiring sufficient resources such as a parallel corpus and pre-trained language models. However, low-resource language KGs have received less attention, and current models demonstrate poor performance on those low-resource KGs. Recently, researchers have fused relation information and attributes for entity representations to enhance the entity alignment performance, but the relation semantics are often ignored. To address these issues, we propose a novel Semantic-aware Graph Neural Network (SGNN) for entity alignment. First, we generate pseudo sentences according to the relation triples and produce representations using pre-trained models. Second, our approach explores semantic information from the connected relations by a graph neural network. Our model captures expanded feature information from KGs. Experimental results using three low-resource languages demonstrate that our proposed SGNN approach out performs better than state-of-the-art alignment methods on three proposed datasets and three public datasets.

Key wordsgraph neural network    knowledge graph    entity alignment    low-resource language
收稿日期: 2022-08-19      出版日期: 2023-06-16
Corresponding Author(s): Zhengtao YU   
 引用本文:   
. [J]. Frontiers of Computer Science, 2024, 18(4): 184319.
Junfei TANG, Ran SONG, Yuxin HUANG, Shengxiang GAO, Zhengtao YU. Semantic-aware entity alignment for low resource language knowledge graph. Front. Comput. Sci., 2024, 18(4): 184319.
 链接本文:  
https://academic.hep.com.cn/fcs/CN/10.1007/s11704-023-2542-x
https://academic.hep.com.cn/fcs/CN/Y2024/V18/I4/184319
Fig.1  
Fig.2  
Fig.3  
Fig.4  
Fig.5  
Datasets Language Entity Relation Relation triples Attribute triples Rel/Rel triples/%
ZH-VI Chinese 13,053 335 38,610 46,651 0.868
Vietnamese 12,411 342 39,993 45,661 0.855
TH-VI Thai 10,442 298 35,309 41,478 0.844
Vietnamese 10,043 310 38,571 40,257 0.804
ZH-TH Chinese 8,275 315 29,132 31,958 1.081
Thai 8,186 299 27,145 30,203 1.101
Tab.1  
Datasets Language Entity Relation Relation triples Attribute triples Rel/Rel triples/%
ZH-EN Chinese 66,469 2,830 70,414 468,439 4.019
English 98,125 2,317 95,142 496,039 2.435
JA-EN Japanese 65,774 2,043 77,214 379,795 2.646
English 95,680 2,096 93,484 440,216 2.242
FR-EN French 66,858 1,379 105,998 476,372 1.301
English 105,889 2,209 115,722 551,607 1.909
Tab.2  
Model ZH-VI ZH-TH TH-VI
Hits@1 Hits@10 Hits@1 Hits@10 Hits@1 Hits@10
GCN-Align(2018) [30] 17.12 45.21 20.00 46.51 20.06 51.20
HGCN(2019) [36] 13.67 22.33 13.89 20.69 11.77 18.74
RDGCN(2019) [35] 7.12 12.69 4.97 13.60 8.80 12.29
MuGNN(2019) [31] 20.83 67.52 28.74 73.14 23.38 69.79
AttrGNN(2020) [34] 41.11 71.75 51.62 76.38 47.49 71.71
RNM(2021) [37] 18.83 24.14 15.94 20.91 10.91 15.09
KE-GCN(2021) [38] 43.11 67.02 50.94 74.54 46.80 72.657
SGNN 45.17 71.28 53.49 77.46 51.20 73.14
Tab.3  
Model DBP15KZH?EN DBP15KJA?EN DBP15KFR?EN
Hits@1 Hits@10 Hits@1 Hits@10 Hits@1 Hits@10
GCN-Align(2018) [30] 50.82 79.15 53.09 82.96 54.49 84.73
RDGCN(2019) [35] 70.75 84.55 76.74 89.54 88.64 95.72
AttrGNN(2020) [34] 79.60 92.93 78.33 92.08 91.85 97.77
KE-GCN(2021) [38] 56.20 84.20 57.00 85.20 57.20 85.40
SGNN 82.83 94.17 83.73 93.76 91.54 97.88
Tab.4  
Model ZH?VI
Hits@1 Hits@10
SGNN 45.17 71.28
SGNN-w/o structure 23.03 38.02
SGNN-w/o attribute 29.10 55.63
SGNN-w/o ps 43.01 69.30
SGNN-mBERT 43.97 71.02
SGNN-random 40.01 65.69
SGNN-w/o sub-graph 38.97 63.71
Tab.5  
Fig.6  
Language Head Relation Tail Score
Chinese the United Kingdom official language English ?
head of government Boris Johnson
participant in World War II
Vietnamese the United Kingdom official language Welsh 0.8813
head of government Boris Johnson
participant in World War II
the United States of America speaking language English 0.7531
country of citizenship Boris Johnson
participant in World War II
France writing language French 0.5810
head of government élisabeth Borne
participant in World War II
Tab.6  
  
  
  
  
  
1 B, Yang T Mitchell . Leveraging knowledge bases in LSTMs for improving machine reading. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2019, 1436−1446
2 Y, Cao L, Hou J, Li Z Liu . Neural collective entity linking. In: Proceedings of the 27th International Conference on Computational Linguistics. 2018, 675−686
3 Yang Z, Qi P, Zhang S, Bengio Y, Cohen W, Salakhutdinov R, Manning C D. HotpotQA: a dataset for diverse, explainable multi-hop question answering. In: Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. 2018, 2369−2380
4 M, Franco-Salvador P, Rosso M Montes-y-Gómez . A systematic study of knowledge graph analysis for cross-language plagiarism detection. Information Processing & Management, 2016, 52( 4): 550–570
5 J, Lehmann R, Isele M, Jakob A, Jentzsch D, Kontokostas P N, Mendes S, Hellmann M, Morsey Kleef P, van S, Auer C Bizer . DBpedia–A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web, 2015, 6( 2): 167–195
6 F M, Suchanek G, Kasneci G Weikum . Yago: a core of semantic knowledge. In: Proceedings of the 16th International Conference on World Wide Web. 2007, 697−706
7 R, Navigli S P Ponzetto . BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence, 2012, 193: 217–250
8 M, Chen Y, Tian M, Yang C Zaniolo . Multilingual knowledge graph embeddings for cross-lingual knowledge alignment. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2017, 1511−1517
9 Z, Sun W, Hu C Li . Cross-lingual entity alignment via joint attribute-preserving embedding. In: Proceedings of the 16th International Semantic Web Conference. 2017, 628−644
10 P, Veličković G, Cucurull A, Casanova A, Romero P, Liò Y Bengio . Graph attention networks. In: Proceedings of the ICLR 2018. 2018
11 A, Conneau K, Khandelwal N, Goyal V, Chaudhary G, Wenzek F, Guzmán E, Grave M, Ott L, Zettlemoyer V Stoyanov . Unsupervised cross-lingual representation learning at scale. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 8440−8451
12 Petroni F, Rocktäschel T, Riedel S, Lewis P, Bakhtin A, Wu Y, Miller A H, Riedel S. Language models as knowledge bases? In: Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. 2019, 2463−2473
13 Jiang Z, Anastasopoulos A, Araki J, Ding H, Neubig G. X-FACTR: multilingual factual knowledge retrieval from pretrained language models. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 5943−5959
14 A, Sanchez-Gonzalez N, Heess J T, Springenberg J, Merel M A, Riedmiller R, Hadsell P W Battaglia . Graph networks as learnable physics engines for inference and control. In: Proceedings of the 35th International Conference on Machine Learning. 2018, 4467−4476
15 Y, Wu D, Lian Y, Xu L, Wu E Chen . Graph convolutional networks with Markov random field reasoning for social spammer detection. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2020, 1054−1061
16 A, Fout J, Byrd B, Shariat A Ben-Hur . Protein interface prediction using graph convolutional networks. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 6533−6542
17 H, Dai E B, Khalil Y, Zhang B, Dilkina L Song . Learning combinatorial optimization algorithms over graphs. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 6351−6361
18 T N, Kipf M Welling . Semi-supervised classification with graph convolutional networks. In: Proceedings of the 5th International Conference on Learning Representations. 2017
19 Y, Li D, Tarlow M, Brockschmidt R S Zemel . Gated graph sequence neural networks. In: Proceedings of the 4th International Conference on Learning Representations. 2016
20 K, Xu W, Hu J, Leskovec S Jegelka . How powerful are graph neural networks? In: Proceedings of the 7th International Conference on Learning Representations. 2019
21 M, Defferrard X, Bresson P Vandergheynst . Convolutional neural networks on graphs with fast localized spectral filtering. In: Proceedings of the 30th International Conference on Neural Information Processing Systems. 2016, 3844−3852
22 W L, Hamilton R, Ying J Leskovec . Inductive representation learning on large graphs. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 1025−1035
23 A, Bordes N, Usunier A, Garcia-Durán J, Weston O Yakhnenko . Translating embeddings for modeling multi-relational data. In: Proceedings of the 26th International Conference on Neural Information Processing Systems. 2013, 2787−2795
24 S, Ji S, Pan E, Cambria P, Marttinen P S Yu . A survey on knowledge graphs: representation, acquisition, and applications. IEEE Transactions on Neural Networks and Learning Systems, 2022, 33( 2): 494–514
25 Wang Z, Zhang J, Feng J, Chen Z. Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the 28th AAAI Conference on Artificial Intelligence. 2014, 1112−1119
26 Lin Y, Liu Z, Sun M, Liu Y, Zhu X. Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of the 29th AAAI Conference on Artificial Intelligence. 2015, 2181−2187
27 H, Zhu R, Xie Z, Liu M Sun . Iterative entity alignment via joint knowledge embeddings. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2017, 4258−4264
28 Z, Sun W, Hu Q, Zhang Y Qu . Bootstrapping entity alignment with knowledge graph embedding. In: Proceedings of the 27th International Joint Conference on Artificial Intelligence. 2018, 4396−4402
29 Q, Zhu X, Zhou J, Wu J, Tan L Guo . Neighborhood-aware attentional representation for multilingual knowledge graphs. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence. 2019, 1943−1949
30 Wang Z, Lv Q, Lan X, Zhang Y. Cross-lingual knowledge graph alignment via graph convolutional networks. In: Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. 2018, 349−357
31 Y, Cao Z, Liu C, Li Z, Liu J, Li T S Chua . Multi-channel graph neural network for entity alignment. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019, 1452−1461
32 K, Xu L, Wang M, Yu Y, Feng Y, Song Z, Wang D Yu . Cross-lingual knowledge graph alignment via graph matching neural network. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019, 3156−3161
33 Y, Wu X, Liu Y, Feng Z, Wang D Zhao . Neighborhood matching network for entity alignment. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 6477−6487
34 Liu Z, Cao Y, Pan L, Li J, Chua T S. Exploring and evaluating attributes, values, and structures for entity alignment. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 6355−6364
35 Wu Y, Liu X, Feng Y, Wang Z, Yan R, Zhao D. Relation-aware entity alignment for heterogeneous knowledge graphs. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence. 2019, 5278−5284
36 Wu Y, Liu X, Feng Y, Wang Z, Zhao D. Jointly learning entity and relation representations for entity alignment. In: Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. 2019, 240−249
37 Zhu Y, Liu H, Wu Z, Du Y. Relation-aware neighborhood matching model for entity alignment. In: Proceedings of the 35th AAAI Conference on Artificial Intelligence. 2021, 4749−4756
38 D, Yu Y, Yang R, Zhang Y Wu . Knowledge embedding based graph convolutional network. In: Proceedings of the Web Conference 2021. 2021, 1619−1628
39 Duchi J, Hazan E, Singer Y. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research, 2011, 12(61): 2121–2159
40 M D Zeiler . ADADELTA: an adaptive learning rate method. 2012, arXiv preprint arXiv: 1212.5701
[1] FCS-22542-OF-JT_suppl_1 Download
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed