情报研究

引文上下文在文献内容分析中的信息价值研究

  • 刘洋 ,
  • 崔雷
展开
  • 1. 中国医科大学附属盛京医院信息科;
    2. 中国医科大学医学信息学系
刘洋,中国医科大学附属盛京医院信息科助理馆员,E-mail:704923089@qq.com;崔雷,中国医科大学医学信息学系教授,副主任,博士生导师。

收稿日期: 2014-02-21

  修回日期: 2014-03-01

  网络出版日期: 2014-03-20

The Information Value of Citation Context in Document Content Analysis

  • Liu Yang ,
  • Cui Lei
Expand
  • 1. The Shengjing Hospital, China Medical University, Shenyang 110004;
    2. Department of Medical Informatics, China Medical University, Shenyang 110001

Received date: 2014-02-21

  Revised date: 2014-03-01

  Online published: 2014-03-20

摘要

以引文上下文为研究对象,探讨来自于引文上下文、目标文献摘要以及目标文献自标医学主题词(下称主题词)三者间的符合程度,定量分析引文上下文在表征目标文献内容特征时的作用。以被Circulation杂志高频引证的5篇研究类论文作为目标文献,提取其施引文献的全部引文上下文,并对其进行分词及主题词匹配;将其结果与目标文献摘要提取的主题词以及文献自标的主题词进行两两比较。结果表明,引文上下文与目标文献摘要具有较高的符合度,而且在表征被引文献内容特征的效果上明显具有优势。

本文引用格式

刘洋 , 崔雷 . 引文上下文在文献内容分析中的信息价值研究[J]. 图书情报工作, 2014 , 58(06) : 101 -104 . DOI: 10.13266/j.issn.0252-3116.2014.06.017

Abstract

In order to investigate the relationships among citation context, the abstract and the cited target papers, and explore quantitatively the effect of the citation contexts in revealing the subject features of the cited target papers, five highly cited target papers by Circulation were screened. The citation contexts were extracted from all citing papers and the medical subject headings (MeSH terms) were matched. The pairwise comparison was performed among MeSH terms extracted from citation context, abstracts and indexed by the target paper itself. The result shows that the MeSH terms matched from the citation context is in highly conformity with that from the abstract of the target paper. The citation context can express the subject features of the target paper even better than the abstract.

参考文献

[1] Nakov P, Schwartz A, Hearst M. Citances:Citation sentences for semantic analysis of bioscience text[EB/OL].[2013-04-30].http://biotext.berkeley.edu/papers/citances-nlpbio04.pdf.

[2] Ritchie A, Robertson S, Teufel S. Comparing citation contexts for information retrieval[C]// Proceedings of the 17th ACM Conference on Information and Knowledge Management(CIKM). Napa Valley:ACM, 2008: 213-222.

[3] 崔雷,刘伟,闫雷,等.文献数据库中书目信息共现挖掘系统的开发[J]. 现代图书情报技术, 2008(8): 70-75.

[4] MetaMap-a tool for recognizing UMLS concepts in text[DB/OL]. [2013-04-30]. http://mmtx.nlm.nih.gov/.

[5] Thomson Reuters. Journal Citation Reports, Science edition, 2011[DB/OL]. [2013-04-30]. https://www.webofknowledge.com/.

[6] Aljaber B,Martinez D, Stokes N,et al. Improving MeSH classification of biomedical articles using citation contexts[J]. Journal of Biomedical Informatics,2011, 44(5):881-896.

[7] 孙枫军. 引文上下文中的概念抽取[D]. 北京:中国科学技术信息研究所,2012.

[8] Ritchie A, Teufel S, Robertson S. How to find better index terms through citations[C]// Proceedings of the Workshop on How Can Computational Linguistics Improve Information Retrieval?(CLIIR). Sydney: Association for Computational Linguistics,2006: 25-32.

[9] Liu Shengbo,Chen Chaomei. The differences between latent topics in abstracts and citation contexts of citing papers[J]. Journal of the American Society for Information Science and Technology,2013, 64(3):627-639.

文章导航

/