Library and Information Service >
An Approach of Ranking Synonym Extracting Results Based on Directed Graph
Received date: 2015-05-14
Revised date: 2015-05-25
Online published: 2015-06-20
[Purpose/significance] There is lots of noise in the results extracted by current synonym extraction methods. It needs high artificial cost to remove noise. An approach is proposed to rank synonym extracting results which can make synonyms ahead of noises, and enhance the extraction accuracy and reduce the manual cost. [Method/process] It transforms the extracting results into a directed graph of extracting relation. The semantic similarity between each unit in the result and the word (or phrase) is calculated based on the directed graph, and the units in the result are ranked by semantic similarity. The approach just uses the existing synonym extracting method with no any human involvement and other semantic knowledge. [Result/conclusion] The experiments conducted on the real dataset show that the ranking effectiveness is getting better as the size of the extracting result increases.
Key words: synonym; information extraction; noise cleaning; result ranking
Liu Wei . An Approach of Ranking Synonym Extracting Results Based on Directed Graph[J]. Library and Information Service, 2015 , 59(12) : 128 -134 . DOI: 10.13266/j.issn.0252-3116.2015.12.019
[1] 同义词抽取结果测评.[EB/OL].[2015-04-22].http://tcci.ccf.org.cn/conference/2012/dldoc/2012语义关系评测结果.pdf.
[2] Pantel P, Lin Dekang. Discovering word senses from text[C]//Proceedings of SIGKDD Conference on Knowledge Discovery and Data Mining. Edmonton: ACM Press, 2002: 613-619.
[3] Cheng Tao, Lauw W, Paparizos S. Entity synonyms for structured Web search[J]. IEEE Transactions on Knowledge and Data Engineering, 2012, 24(10): 1862-1875.
[4] Plas L, Tiedemann J. Finding synonyms using automatic word alignment and measures of Distributional Similarity[C]//Proceedings of 44th Annual Meeting of the Association for Computational Linguistics. Sydney: Association for Computer Linguistics Press, 2006: 866-873.
[5] 陆勇, 侯汉清. 基于模式匹配的汉语同义词自动识别[J]. 情报学报, 2006, 25(6): 720-724.
[6] 吴云芳, 石静, 金澎. 基于图的同义词集自动获取方法[J]. 计算机研究与发展, 2011, 48(4): 610-616.
[7] Masato H, Yasuhiro O, Katsuhiko T. Supervised synonym acquisition using distributional features and syntactic patterns[J]. Information and Media Technologies, 2009, 4(2): 558-582.
[8] Kaji N, Kitsuregawa M. Using hidden Markov random fields to combine distributional and pattern-based word clustering[C]//Proceedings of the 22nd International Conference on Computational Linguistics. Stroudsburg: Association for Computational Linguistics Press, 2008: 401-408.
[9] Weale T, Brew C, Fosler-Lussier E. Using the Wiktionary graph structure for synonym detection[C]//Proceedings of the 2009 Workshop on the People's Web Meets NLP.Stroudsburg: Association for Computational Linguistics Press, 2009: 28-31.
[10] Blondel V, Senellart P. Automatic extraction of synonyms in a dictionary[C]//Proceedings of the 2009 SIAM Workshop on Text Mining.Arlington: Springer Press, 2002:1-6.
[11] 陆勇,章成志,侯汉清,等.基于百科资源的多策略中文同义词自动抽取研究[J].中国图书馆学报,2010,36(1):56-62.
[12] 刘伟,黄小江,万小军,等.互联网环境下的英文同义术语自动发现研究与系统实现[J].图书情报工作,2012,56(22):26-31.
[13] 刘伟.互联网同义词搜索中的词义聚类问题研究[J].图书情报工作,2013,57(16):15-19.
/
〈 |
|
〉 |