|
|
Extracting terms from clinical records of traditional Chinese medicine |
Cungen Cao1,*(),Meng Sun2,Shi Wang1 |
1. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China 2. Software College of Beihang University, Beijing 100191, China |
|
|
Abstract Health records of traditional Chinese medicine contain valuable clinical information which can be used for improvement of disease treatment and for medical research. In this paper, we present a practical iterative extraction method for extracting terms from the records. The method is based on a set of extraction rules, the Mesh, and the likelihood ratio technique, and achieved a precision rate of 88.18% and a recall rate of 94.21%.
|
Keywords
term extraction
rule-based
likelihood ratio
|
Corresponding Author(s):
Cungen Cao
|
Online First Date: 27 August 2014
Issue Date: 09 October 2014
|
|
1 |
Ji PP, Yan XY, Cen YH. A survey of term recognition and extraction for domain specific Chinese text information processing. J Libr Inf Serv2010; 16: 124–129
|
2 |
Daille B. Study and implementation of combined techniques for automatic extraction of terminology. In: Klavans JL, Resnik P. The Balancing Act: Combining Symbolic and Statistical Approaches to Language. Cambridge, MA: MIT Press, 49–66
|
3 |
Wang WM, He DC, Fu JH. Research of professional term identification method based on seed expansion. J Comput Appl (Ji Suan Ji Ying Yong)2012; 29(11): 4105–4107 (in Chinese)
|
4 |
Zhang F, Xu Y, Hou Y, Fan X Z. Chinese term extraction system based on mutual information. J Comput Appl (Ji Suan Ji Ying Yong)2005; 22(5): 72–73 (in Chinese)
|
5 |
Hu WM, HE TT, Zhang Y. Extraction of Chinese term based on Chi-square test. J Comput Appl (Ji Suan Ji Ying Yong)2007; 27(12): 3019–3020 (in Chinese)
|
6 |
Zhang WB, Bai Y, Wang PY, Zhang GP. An automatic domain terms extraction method on traditional Chinese medicine books. J Shenyang Aerosp Univ (Shenyang Hang Kong Hang Tian Da Xue Xue Bao)2011; 28(1): 72–75 (in Chinese)
|
7 |
Cen YH, Han Z, Ji PP. Chinese term recognition based on hidden Markov model. J New Technol Libr Inf Serv2008; 12: 54–58
|
8 |
Zhang HP, Liu Q. HHMM-based Chinese lexical analyzer ICTCLAS. In: Proceedings of the second SIGHAN workshop on Chinese language processing. 2003; 17: 184–187
|
9 |
Dunning T. Accurate methods for the statistics of surprise and coincidence. Int J Comput Linguist 1993; 19(1): 61–74
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|