To Determine the Concept Property of Thesaurus Based on Keyword Frequency

  • Chang Chun
Expand
  • Institute of Scientific and Technical Information of China, Beijing 100038

Received date: 2013-06-26

  Revised date: 2013-08-03

  Online published: 2013-08-20

Abstract

The keyword frequency information plays an important reference value in the construction and application of thesaurus, to improve its practicality. Through research and practice, this paper summarizes three methods to use the keyword frequency information, which are to find the core concepts with keyword frequency and proportion, to determine the preferred words with it, and to give the category for the concept with it. This paper also discusses the level of vocabulary frequency properties, the importance of preferred words, the changes of keyword frequency life-cycle and the limitation of the word frequency.

Cite this article

Chang Chun . To Determine the Concept Property of Thesaurus Based on Keyword Frequency[J]. Library and Information Service, 2013 , 57(16) : 11 -14,24 . DOI: 10.7536/j.issn.0252-3116.2013.16.002

References

[1] 中国科学技术情报研究所, 北京图书馆. 汉语主题词表[M]. 北京: 科学技术文献出版社, 1980.
[2] 中国科学技术情报研究所. 汉语主题词表:自然科学(增订本)[M]. 北京: 科学技术文献出版社, 1991.
[3] 常春. 数字环境下叙词表的变革发展及应用展望[J]. 情报理论与实践, 2009, 32(12):48-50.
[4] 常春, 赖院根. 基于文献标题词汇共现获取词间关系研究[J]. 图书情报工作, 2009, 53(8): 17-20.
[5] Li Changling, Guo Fengjiao, Zhi Ling, et al. Knowledge management research status in China from 2006 to 2010:Based on analysis of the degree theses[J]. Scientometrics, 2013, 94(1): 95-111.
[6] Guo Hanning, Weingart S, Boerner K. Mixed-indicators model for identifying emerging research areas[J]. Scientometrics, 2011, 89(1): 421-435.
[7] Milojevic S, Sugimoto C R, Yan E, et al. The cognitive structure of library and information science:Analysis of article title words[J]. Journal of the American Society for Information Science and Technology, 2011, 62(10): 1933-1953.
[8] 秦玉平, 冷强奎, 王秀坤, 等. 基于局部词频指纹的论文抄袭检测算法[J]. 计算机工程, 2011, 37(6):193-194, 197.
[9] 汤建民, 余丰民. 国内知识图谱研究综述与评估:2004-2010年[J]. 情报资料工作, 2012(1): 16-21.
[10] 薛云, 汤江明, 余丰民. 近10年来图书馆参考咨询服务发展之主题分析:结构与趋势[J]. 图书馆论坛, 2011, 31(3):123-126.
[11] 程智江, 周佩. 国内图书馆参考咨询研究三十年——基于文献计量和词频分析[J]. 图书馆界, 2010(6): 91-94.
[12] 陆华娟. 从"中国期刊全文数据库"相关论文的关键词词频分析看我国读者工作及藏书建设的发展[J]. 图书馆界, 2011(1): 4-6.
[13] 刘艳文, 周朝晖. 自动标引中船舶资料位置权重方案的确定[J]. 科技情报开发与经济, 2012, 22(17): 101-104.
[14] 朱明. 国内图书馆馆员与读者关系的现状研究——基于实证方法的考量[J]. 图书情报工作, 2011, 55(19): 88-91.
[15] 姜远, 周志华. 基于词频分类器集成的文本分类方法[J]. 计算机研究与发展, 2006, 43(10): 1681-1687.
[16] 梁晓娜, 于红, 范丽民, 等. 改进词频分类器集成的文本分类算法[J]. 智能系统学报, 2010, 5(2): 177-180.
[17] Petrova M, Sutcliffe P, Fulford K W M, et al. Search terms and a validated brief search filter to retrieve publications on health-related values in Medline:A word frequency analysis study[J]. Journal of the American Medical Informatics Association, 2012, 19(3):479-488.
[18] Tian T, Chun S A, Geller J. A prediction model for Web search hit counts using word frequencies[J]. Journal of Information Science, 2011, 37(5): 462-475.
[19] Rishel T, Perkins L A, Yenduri S. Determining the context of text using augmented latent semantic indexing[J]. Journal of the American Society for Information Science and Technology, 2007, 58(14): 2197-2204.
[20] 吴芳芳, 王永成, 许一震. 英文文献主题概念的自动提取[J]. 计算机工程, 2001, 27(4): 185-187.
[21] Lloret E, Ferrandez O, Munoz R. A text summarization approach under the influence of textual entailment[EB/OL] [2013-05-20] http://www.dlsi.ua.es/melloret/publications/elloretlCEIS08.pdf.
[22] Alvarado R U,Arango C R. Zipf's law and goffman's transition point in the automatic indexing[J]. Investigacion Bibliotecologica, 2011, 25(54): 71-92.

Outlines

/