情报研究

语义分类的学科主题演化分析方法研究——以我国图书情报领域大数据研究为例

  • 刘自强 ,
  • 王效岳 ,
  • 白如江
展开
  • 山东理工大学科技信息研究所 淄博 255049
刘自强(ORCID:0000-0003-1814-8655),硕士研究生;白如江(ORCID:0000-0003-3822-8484)副教授,博士

收稿日期: 2016-05-23

  修回日期: 2016-07-16

  网络出版日期: 2016-08-05

基金资助

本文系国家社会科学基金项目“未来新兴科学研究前沿识别研究”(项目编号:16BTQ083)和教育部人文社会科学研究青年基金项目“基于引文内容分析的科技创新路径识别研究”(项目编号:16YJC870008)研究成果之一。

Research on the Discipline Topic Evolution Analysis Method of Semantic Classification——A Case Study of Big Data in the Field of Library and Information Science in China

  • Liu Ziqiang ,
  • Wang Xiaoyue ,
  • Bai Rujiang
Expand
  • Institute of Scientific & Technical Information,Shandong University of Technology, Zibo 255049

Received date: 2016-05-23

  Revised date: 2016-07-16

  Online published: 2016-08-05

摘要

[目的/意义] 学科主题演化研究有助于掌握学科发展现状、研究热点、研究前沿和发展趋势等情况,是进行科技创新的基础,是面向科技创新的重要研究方向。[方法/过程] 提出一种语义分类的学科主题演化分析方法:将关键词分为研究问题、研究方法和研究技术3类,构建不同语义分类的共词网络;然后基于Fast Unfolding社区发现算法识别具有语义特征的社区(主题);利用相似度算法计算相邻子时期主题间的相似度,构建学科主题演化图谱,以分析某学科领域研究问题、研究方法和研究技术的变化,实现深度、细致的学科主题演化分析。[结果/结论] 通过对2012-2015年CNKI数据库收录的我国大数据研究领域相关论文数据的处理分析,证明该方法的准确性和有效性。

本文引用格式

刘自强 , 王效岳 , 白如江 . 语义分类的学科主题演化分析方法研究——以我国图书情报领域大数据研究为例[J]. 图书情报工作, 2016 , 60(15) : 76 -85,93 . DOI: 10.13266/j.issn.0252-3116.2016.15.011

Abstract

[Purpose/significance] Topic evolution research is helpful to master the development situation, research focus, research front and development trend in a discipline. It is an important research direction for science and technology innovation.[Method/process] This paper put forward an analysis method of discipline topic evolution based on semantic classification. First, the authors classified keywords into three types of semantic roles——research problems, research methods and research techniques, then constructed co-keywords network of different semantic roles. After this, based on the fast unfolding community discovery algorithm, the authors detected the semantic features of the community (topic); the similarity algorithm was used to calculate similarity between the topics of adjacent periods. The authors also constructed the topic evolution map to analyze changes in a discipline's research problems, research methods and research techniques and achieve an in-depth and meticulous topic evolution analysis. The accuracy and effectiveness of the method are proved according to the analysis of the data of big data in the related research field from 2012-2015 in CNKI.

参考文献

[1] 王效岳,白如江.海量网络学术文献自动分类技术研究[M].北京:人民出版社,2015:40-42.
[2] BUSH V.As we may think[EB/OL].[2016-03-26].http://ccat.sas.upenn.edu/~jod/texts/vannevar.bush.html.
[3] KLEINBERG J.Bursty and hierarchical structure in streams[J].Data mining&knowledge discovery,2003,7(4):373-397.
[4] KONTOSTATHIS A,GALITSKY L M,POTTER NGER W M,et al.A survey of emerging trend in textual data mining[M].New York:Springer Verlag,2004:185-224.
[5] 马费成,张勤.国内外知识管理研究热点——基于词频的统计分析[J].情报学报,2006(2):146-151.
[6] 邱均平,温芳芳.近五年来图书情报学研究热点与前沿的可视化分析——基于13种高影响力外文源刊的计量研究[J].中国图书馆学报,2011(2):51-60.
[7] KOSTOFF R N.EBERHART H J,TOOTHMAN D R,et al.Database tomography for technical intelligence:comparative roadmaps of the research impact assessment literature and the Journal of the american chemical society[J].Scientiometrics,1997,40(1):103-138.
[8] DING Y,CHOWDHURY G G,FOO S.Bibliometric cartography of information retrieval research by using co-word analysis[J].Information processing and management,2001,37(6):817-842.
[9] MANE K K,BORNER K.Mapping topics and topic bursts in PNAS[J].Proceedings of the National Academy of Sciences of the United States of America,2004,101(Suppl 1):5287-5290.
[10] RITZHAUPT A D.An investigation of distance education in North American research literature using co-word analysis[J].International review of research in open&distance learning,2010,11(1):37-60.
[11] 刘晓波.我国图书馆学研究热点及趋势——基于关键词共现和词频统计的可视化研究[J].图书情报工作,2012,56(7):62-67.
[12] GIRVAN M,NEWMAN M E J.Community structure in socia1and biological networks[J].Proceedings of the National Academy of Sciences of the United States of America,2002,99(12):7821-7826.
[13] 杨博,刘大有,LIU J M,等.复杂网络聚类方法[J].软件学报,2009,20(1):54-66.
[14] WALLACE M L,GINGRAS Y,DUHON R.A new approach for detecting scientific specialties from raw cocitation networks[J].Journal of the American Society for Information Science and Technology,2009,60(2):240-246.
[15] 程齐凯,王晓光.一种基于共词网络社区的科研主题演化分析框架[J].图书情报工作,2013,57(8):91-96.
[16] 白如江,冷伏海.k-clique社区知识创新演化方法研究[J].图书情报工作,2013,57(17):94-99.
[17] 刘自强,王效岳,白如江.基于时间序列模型的研究热点分析预测方法研究[J].情报理论与实践,2016,39(5):27-33.
[18] DONOHUE J C.Understanding Scientific Literature[M].Massachusetts:The MIT Press,1974.
[19] NEWMAN M E J,GIRVAN M.Finding and evaluating community structure in networks[J].Physical review,2004,69(2):108-113.
[20] BLONDEL V D,GUILLAUME J L,LAMBIOTTE R,et al.Fast unfolding of communities in large networks[J].Journal of Statistical Mechanics:Theory and Experiment,2008,30(2):155-168.
[21] LIN D.An information theoretic definition of similarity[C].//Fifteenth international conference on machine learning.San Francisco:Morgan Kaufman Publishers,1998:296-304.

文章导航

/