Library and Information Service >
A Citation-based Method for Automatic Indexing of Chinese Academic Literatures
Received date: 2013-12-16
Revised date: 2014-01-17
Online published: 2014-02-05
A new automatic indexing method is proposed for Chinese academic literatures. Based on the relationship between literature references, this method improves the genetic algorithm with the indexing terms of references to implement the automatic indexing task, which can avoid limitations of using the internal text features of contents and titles. The experimental results over real Chinese academic literatures testify the effectiveness of the proposed method.
Wang Xing , Liu Wei . A Citation-based Method for Automatic Indexing of Chinese Academic Literatures[J]. Library and Information Service, 2014 , 58(03) : 106 -110,105 . DOI: 10.13266/j.issn.0252-3116.2014.03.017
[1] 李素建, 王厚峰, 俞士汶, 等.关键词自动标引的最大熵模型应用研究[J].计算机学报, 2004, 27(9): 1192-1197.
[2] 章成志, 白振田. 文本自动标引与自动分类研究[M]. 南京: 东南大学出版社, 2009.
[3] 张静. 自动标引技术的回顾与展望[J]. 现代情报, 2009, 29(4): 221-225.
[4] 黄昌宁, 赵海. 中文分词十年回顾[J]. 中文信息学报, 2007, 21(3): 8-19.
[5] Cohen J D. Highlights: Language and domain-independent auto indexing terms for abstracting[J]. Journal of American Society for Information Science, 1995, 46(3): 162-174.
[6] 张雪英, Krause J. 中文文本关键词自动抽取方法研究[J]. 情报学报, 2008, 27(4): 512-520.
[7] Matsuo Y, Ishizuka M. Keyword extraction from a single document using word co-occurrence statistical information[J]. Journal of Artificial Intelligence Tools, 2004, 3(1): 157-169.
[8] Chien L F. PAT-tree-based keyword extraction for Chinese information retrieval[C] //Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 1997: 50-59.
[9] Ercan G, Cicekli I.Using lexical chains for keyword extraction[J]. Information Processing & Management, 2007, 43(6):1705-1714.
[10] 索红光, 刘玉树, 曹淑英. 一种基于词汇链的关键词抽取方法[J]. 中文信息学报, 2006, 20(6): 25-30.
[11] Zhang Chengzhi, Wang Huilin, Liu Yao, et al. Automatic keyword extraction from documents using conditional random fields[J]. Journal of Computational Information Systems, 2008, 4(3): 1169-1180.
[12] 章成志, 苏新宁. 基于条件随机场的自动标引模型研究[J]. 中国图书馆学报, 2008, 34(5): 89-94, 99.
[13] 王昊, 邹杰利, 邓三鸿. 面向中文图书的自动标引模型构建及实验分析[J]. 现代图书情报技术, 2013(7/8): 55-62.
[14] Nguyen T D, Kan M Y.Keyphrase extraction in scientific publications[C] //Proceedings of the 10th International Conference on Asian Digital Libraries: Looking Back 10 Years and Forging New Frontiers. Heidelberg: Springer, 2007: 317-326.
[15] Peng Fuchun, McCallum A. Information extraction from research papers using conditional random fields[J]. Information Processing & Management, 2006, 42(4): 963-979.
[16] 章成志. 基于集成学习的自动标引方法研究[J].情报学报, 2010, 29(1):3-8.
[17] 殷蜀梅, 张智雄. 医学文献集合的主题抽取和主题聚类实践[J]. 数字图书馆论坛, 2008(9):32-36.
[18] 殷蜀梅, 张智雄, 吴振新. 一种从医学文本中实现自动关键词抽取和筛选的技术方法[J].现代图书情报技术, 2008(8):31-36.
[19] Goldberg D E. Genetic algorithms in search, optimization, and machine learning[M]. Boston:Addison-Wesley Longman Press, 1989.
[20] 张晋, 李冬黎, 李平. 遗传算法编码机制的比较研究[J]. 中国矿业大学学报, 2002, 31(6): 637-640.
〈 | 〉 |