Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front. Comput. Sci.    2020, Vol. 14 Issue (2) : 291-303    https://doi.org/10.1007/s11704-018-7175-0
RESEARCH ARTICLE
Graph-ranking collective Chinese entity linking algorithm
Tao XIE(), Bin WU, Bingjing JIA, Bai WANG
Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing 100876, China
 Download: PDF(406 KB)  
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

Entity linking (EL) systems aim to link entity mentions in the document to their corresponding entity records in a reference knowledge base. Existing EL approaches usually ignore the semantic correlation between the mentions in the text, and are limited to the scale of the local knowledge base. In this paper, we propose a novel graphranking collective Chinese entity linking (GRCCEL) algorithm, which can take advantage of both the structured relationship between entities in the local knowledge base and the additional background information offered by external knowledge sources. By improved weighted word2vec textual similarity and improved PageRank algorithm, more semantic information and structural information can be captured in the document. With an incremental evidence mining process, more powerful discrimination capability for similar entities can be obtained.We evaluate the performance of our algorithm on some open domain corpus. Experimental results show the effectiveness of our method in Chinese entity linking task and demonstrate the superiority of our method over state-of-the-art methods.

Keywords collective entity linking      knowledge mapping      word embedding      entity correlation graph      PageRank     
Corresponding Author(s): Tao XIE   
Just Accepted Date: 24 November 2017   Online First Date: 17 September 2019    Issue Date: 16 October 2019
 Cite this article:   
Tao XIE,Bin WU,Bingjing JIA, et al. Graph-ranking collective Chinese entity linking algorithm[J]. Front. Comput. Sci., 2020, 14(2): 291-303.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-018-7175-0
https://academic.hep.com.cn/fcs/EN/Y2020/V14/I2/291
1 B X Huai, T F Bao, H S Zhu, Q Liu. Topic modeling approach to named entity linking. Journal of Software, 2014, 21(8): 1235–1248
2 L Xiao, D S Weld. Fine-grained entity recognition. In: Proceedings of AAAI Conference on Artificial Intelligence. 2012, 1189–1192
3 F Wu, D S Weld. Open information extraction using Wikipedia. In: Proceedings of the 48th AnnualMeeting of the Association for Computational Linguistics. Association for Computational Linguistics. 2013, 118–127
4 F Wu, D S Weld. Autonomously semantifying Wikipedia. In: Proceedings of the 16th ACM Conference on Conference on Information and Knowledge Management. 2007, 41–50
https://doi.org/10.1145/1321440.1321449
5 H Ji, R Grishman. Knowledge base population: successful approaches and challenges. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Lanuoge Technology- Volume 1. Association for Computational Linguistics. 2011, 1148–1158
6 M Dredze, P Mcnamee, D Rao, A Gerber, T Finin. Entity disambiguation for knowledge base population. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. 2010, 277–285
7 W Shen, J Wang, J Han. Entity linking with a knowledge base: issues, techniques, and solutions. IEEE Transactions on Knowledge & Data Engineering, 2015, 27(2): 443–460
https://doi.org/10.1109/TKDE.2014.2327028
8 X Li, S M Strassel, H Ji, K Griffitt, J Ellis. Linguistic resources for entity linking evaluation: from monolingual to cross-lingual. In: Proceedings of International Conference on Language Resources and Evaluation. 2013, 3098–3105
9 R C Bunescu, M Pasca. Using encyclopedic knowledge for named entity disambiguation. In: Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics. 2006, 9–16
10 S Cucerzan. Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings of Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2007, 708–716
11 Z Yang, H Huang. WSD method based on heterogeneous relation graph. Journal of Computer Research & Development, 2013, 50(2): 437–444
12 S Cucerzan. Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings of Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2007, 708–716
13 H T Nguyen, T H Cao. Exploring Wikipedia and text features for named entity disambiguation. In: Proceedings of Asian Conference on Intelligent Information and Database Systems. 2010, 11–20
https://doi.org/10.1007/978-3-642-12101-2_2
14 Y Zeng, D Wang, T Zhang, H Wang, H Hao. Linking entities in short texts based on a Chinese semantic knowledge base. In: Proceedings of Natural Language Processing and Chinese Computing: Second CCF Conference. 2013, 266–276
https://doi.org/10.1007/978-3-642-41644-6_25
15 T Zhang, K Liu, J Zhao. A graph-based similarity measure between Wikipedia concepts and its applications in entity linking system. Journal of Chinese Information Processings, 2015, 22(1): 58–67
16 Z Zuo, G Kasneci, T Gruetze, F Naumann. BEL: bagging for entity linking. In: Proceedings of International Conference on Computational Linguistics. 2014, 266–276
17 J Xu, L Gan, B Zhou, Q Wu. An unsupervised method for linking entity mentions in Chinese text. In: Proceedings of Asia-Pacific Services Computing Conference. 2016, 183–195
https://doi.org/10.1007/978-3-319-49178-3_14
18 X Han, L Sun, J Zhao. Collective entity linking in web text: a graphbased method. In: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval. 2011, 765–774
https://doi.org/10.1145/2009916.2010019
19 W Shen, J Wang, P Luo, M Wang. LINDEN: linking named entities with knowledge base via semantic knowledge. In: Proceedings of the 21st International Conference on World Wide Web. 2012, 449–458
https://doi.org/10.1145/2187836.2187898
20 J Hoffart, M A Yosef, I Bordino, H Fürstenau. Robust disambiguation of named entities in text. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011, 782–792
21 Z Guo, D Barbosa. Robust entity linking via random walks. In: Proceedings of the 23rd ACM International Conference on Information and Knowledge Management. 2014, 499–508
https://doi.org/10.1145/2661829.2661887
22 A Moro, A Raganato, R Navigli. Entity linking meets word sense disambiguation: a unified approach. Journal of Transactions of the Association for Computational Linguistics, 2014, 22(5): 231–244
https://doi.org/10.1162/tacl_a_00179
23 Y Li, C Wang, F Han, J Han, R Dan. Mining evidences for named entity disambiguation. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2013, 1070–1078
https://doi.org/10.1145/2487575.2487681
24 T Mikolov, I Sutskever, K Chen, G Corrado, J Dean. Distributed representations of words and phrases and their compositionality. Journal of Advances in Neural Information Processing Systems, 2013, 26: 3111–3119
25 A Alhelbawy, R Gaizauskas. Graph ranking for collective named entity disambiguation. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. 2014, 75–80
https://doi.org/10.3115/v1/P14-2013
26 B Hachey, W Radford, J R Curran. Graph-based named entity linking with Wikipedia. In: Proceedings of International Conference on Web Information Systems Engineering. 2011, 213–226
https://doi.org/10.1007/978-3-642-24434-6_16
[1] Article highlights Download
[1] Fei WANG, Tieyun QIAN, Bin LIU, Zhiyong PENG. Patent expanded retrieval via word embedding under composite-domain perspectives[J]. Front. Comput. Sci., 2019, 13(5): 1048-1061.
[2] Yinglong ZHANG,Cuiping LI,Chengwang XIE,Hong CHEN. Accuracy estimation of link-based similarity measures and its application[J]. Front. Comput. Sci., 2016, 10(1): 113-123.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed