图书情报工作 ›› 2022, Vol. 66 ›› Issue (17): 93-105.DOI: 10.13266/j.issn.0252-3116.2022.17.009

• 情报研究 • 上一篇    下一篇

考虑时序的单篇科技文献新颖性评估方法

张吉玉, 张均胜   

  1. 中国科学技术信息研究所 北京 100038
  • 收稿日期:2022-03-28 修回日期:2022-07-02 出版日期:2022-09-05 发布日期:2022-09-09
  • 通讯作者: 张均胜,研究员,博士,通信作者,E-mail:zhangjs@istic.ac.cn
  • 作者简介:张吉玉,硕士研究生
  • 基金资助:
    本文系国家重点研发计划项目"颠覆性技术识别理论、方法与专家预判系统"(项目编号:2019YFA0707201)和中国科学技术信息研究所创新研究基金项目"科技论文原创性与新颖性评估方法研究"(项目编号:MS2022-05)研究成果之一。

Novelty Evaluation Method of Single Scientific and Technical Literature Considering Time Series

Zhang Jiyu, Zhang Junsheng   

  1. Institute of Scientific and Technical Information of China, Beijing 100038
  • Received:2022-03-28 Revised:2022-07-02 Online:2022-09-05 Published:2022-09-09

摘要: [目的/意义]结合时间序列对科技文献新颖性进行分析,从历史发展的视角评估其创新性,为科技文献代表作评价提供辅助。[方法/过程]提出一种考虑时序的科技文献问题-方法矩阵用于评估单篇文献新颖性。该方法利用模式匹配和词表匹配抽取文献中的研究问题和研究方法,再用Jaccard系数计算文本相似度进行相似文本聚类,构建问题-方法矩阵用于评估文献的时间相关新颖性,并以可视化矩阵的形式呈现评估结果。[结果/结论]实验中选取发表时间为2019-2021年,分类为TP391.1的222篇文献做数据集,验证本文所提方法的有效性和可行性,结果表明该方法可辅助代表作评价中的新颖性评估。

关键词: 时间序列, 新颖性评估, 问题-方法矩阵, 科技文献, 代表作评价

Abstract: [Purpose/Significance] This paper analyzes the novelty of scientific and technical literature in combination with time series, evaluates its innovativeness from the perspective of historical development, and assists in the evaluation of representative works of scientific and technical literature.[Method/Process] This paper proposed a question-method matrix for scientific and technical literature that considers time series and was used for novelty assessment of a single literature. First, this method used pattern matching and vocabulary matching to extract the research questions and research methods of the literature. Secondly, the Jaccard coefficient was used to calculate the text similarity to cluster similar texts. Next, the problem-method matrix was constructed for assessing the time-relevant novelty of the literature and presented the assessment results in the form of a visual matrix.[Result/Conclusion] In the experiment, 222 papers published in 2019-2021 and classified as TP391.1 are selected as the dataset to verify the effectiveness and feasibility of the method proposed in this paper. The results show that the method can assist in novelty evaluation in the evaluation of representative works.

Key words: time series, novelty evaluation, question-method matrix, scientific and technical literature, the evaluation of representative works

中图分类号: