综述述评

学术论文引用预测研究进展

  • 夏琬钧 ,
  • 陈晓红 ,
  • 江艳萍
展开
  • 1. 西南交通大学图书馆 成都 611756;
    2. 西南交通大学信息科学与技术学院 成都 611756
夏琬钧(ORCID:0000-0001-9722-9837),馆员,博士研究生,E-mail:xiawanjun@home.swjtu.edu.cn;陈晓红(ORCID:0000-0003-3277-8725),副研究馆员,硕士;江艳萍(ORCID:0000-0003-3152-0204),馆员,博士。

收稿日期: 2019-07-02

  修回日期: 2019-09-19

  网络出版日期: 2020-03-20

基金资助

本文系四川省文化和旅游厅图书情报学与文献学规划项目"基于学术大数据的潜力学者挖掘研究"(项目编号:WHTTSXM[2018]25)和四川省社会科学重点研究基地-四川学术成果分析与应用研究中心项目"基于在线评论的中文图书影响力研究"(项目编号:SCAA17-006)研究成果之一。

Research on Academic Paper Citation Prediction

  • Xia Wanjun ,
  • Chen Xiaohong ,
  • Jiang Yanping
Expand
  • 1. Library of Southwest Jiaotong University, Chengdu 611756;
    2. School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756

Received date: 2019-07-02

  Revised date: 2019-09-19

  Online published: 2020-03-20

摘要

[目的/意义] 对学术论文引用预测影响因素和预测方法进行梳理,分析现存问题并提出发展方向。[方法/过程] 采用文献调研法,综述国内外研究进展,总结预测影响因素和预测方法的相关内容和特点。[结果/结论] 现有影响因素指标繁多,无统一标准;预测方法理论基础薄弱;引文预测动态性研究不足;预测模型通用性受限。未来应加强引文预测的理论研究、加强传统文献计量和替代计量的结合、加强自然语言处理的深度应用、建立统一的基线标准、构建更加精准的预测模型。

本文引用格式

夏琬钧 , 陈晓红 , 江艳萍 . 学术论文引用预测研究进展[J]. 图书情报工作, 2020 , 64(6) : 138 -145 . DOI: 10.13266/j.issn.0252-3116.2020.06.016

Abstract

[Purpose/significance] This paper summarizes the influencing factors and prediction methods of academic paper citation, analyzes the existing problems and proposes the future development directions.[Method/process] This paper used the literature research method to review the research progress of academic papers at home and abroad, and summarized the relevant content and characteristics of influencing factors and prediction methods.[Result/conclusion] There are many indicators of influencing factors, but there is no unified selection criteria. The theoretical basis of prediction methods is weak. The research on dynamics of citation prediction is insufficient. The generality of prediction models is limited. In the future, we should strengthen the theoretical research of citation prediction methods, the combination of traditional bibliometrics and alternative metrics, the deep application of natural language processing, and establish a unified baseline standard, a more accurate prediction model.

参考文献

[1] 耿骞,景然,靳健,等.学术论文引用预测及影响因素分析[J].图书情报工作,2018,62(14):29-40.
[2] YANG L, ZHANG Z, CAI X, et al. Citation recommendation as edge prediction in heterogeneous bibliographic network:a network representation approach[J]. IEEE Access, 2019,7:23232-23239.
[3] 鲍玉芳,马建霞.科学论文被引频次预测的现状分析与研究[J].情报杂志,2015,34(5):66-71.
[4] STEWART J A. Achievement and ascriptive processes in the recognition of scientific articles[J].Socialforces,1983,62(1):166-189.
[5] WILLIS D L, BAHLER C D, NEUBERGER M M, et al.Predictors of citations in the urological literature[J].BJUinternational,2011,107(12):1876-1880.
[6] KOSTEAS V D.Predicting long-run citation counts for articles in top economics journals[J].Scientometrics,2018,115(3):1395-1412.
[7] CHAKRABORTY T,KUMAR S,GOYAL P,et al.Towards a stratified learning approach to predict future citation counts[C]//2014 IEEE/ACM joint conference on digital libraries (JCDL).London:IEEE computer society,2014:351-360.
[8] ANTONIOU G A,ANTONIOU S A,GEORGAKARAKOS E I,et al.Bibliometric analysis of factors predicting increased citations in the vascular and endovascular literature[J].Annals of vascular surgery,2015,29(2):286-292.
[9] FU L D, ALIFERIS C F. Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature[J]. Scientometrics,2010,85(1):257-270.
[10] YU T, YU G, LI P Y, et al. Citation impact prediction for scientific papers using stepwise regression analysis[J].Scientometrics,2014,101(2):1233-1252.
[11] HASLAM N, BAN L, KAUFMANN L, et al. What makes an article influential? Predicting impact in social and personality psychology[J]. Scientometrics,2008,76(1):169-185.
[12] ROTHC, Wu J, Lozano S. Assessing impact and quality from local dynamics of citation networks[J]. Journal of informetrics,2013,6(1):111-120.
[13] DIDEGAH F, THELWALL M. Which factors help authors produce the highest impact research? Collaboration, journal and document properties[J].Journal of informetrics,2013,7(4):861-873.
[14] SUBOTIC S, MUKHERJEE B. Short and amusing:the relationship between title characteristics, downloads, and citations in psychology articles[J]. Journal of information science, 2014, 40(1):115-124.
[15] SOHRABI B, IRAJ H. The effect of keyword repetition in abstract and keyword frequency per journal in predicting citation counts[J].Scientometrics,2017, 110(1):243-251.
[16] YAN R, TANG J, LIU X, et al. Citation count prediction:learning to estimate future citations for literature.[C]//ACM international conference on information & knowledge management. Glasgow:ACM,2011:1247-1252.
[17] TAHAMTAN I, AFSHAR A S, AHAMDZADEH K. Factors affecting number of citations:a comprehensive review of the literature[J].Scientometrics,2016,107(3):1195-1225.
[18] BORNMANN L, LEYDESDORFF L, WANG J. How to improve the prediction based on citation impact percentiles for years shortly after the publication date?[J].Journal of informetrics,2014,8(1):175-180.
[19] 张美平, 尚明生. 基于持续关注度衰减的重要论文预测[J]. 复杂系统与复杂性科学, 2015, 12(3):77-84.
[20] BORNMANN L, DANIEL H D. Citation speed as a measure to predict the attention an article receives:An investigation of the validity of editorial decisions at AngewandteChemie International Edition[J]. Journal of informetrics, 2010, 4(1):83-88.
[21] 熊泽泉,段宇锋.论文早期下载量可否预测后期被引量?——以图书情报领域期刊为例[J].图书情报知识,2018(4):32-42.
[22] SHEMA H, BAR-ILAN J, THELWALL M. Do blog citations correlate with a higher number of future citations? Research blogs as a potential source for alternative metrics[J].Journal of the Association for Information Science and Technology,2014,65(5):1018-1027.
[23] PEOPLES B K, MIDWAY S R, SACKETT D, et al. Twitter predicts citation rates of ecological research[J].Plos One,2016,11(11):e0166570.
[24] ZOLLER D, DOERFEL S, JÄSCHKE R, et al. Posted, visited, exported:Altmetrics in the social tagging system bibsonomy[J].Journal of informetrics,2016,10(3):732-749.
[25] THELWALL M, NEVILL T. Could scientists use Altmetric.com scores to predict longer term citation counts?[J].Journal of informetrics,2018,12(1):237-248.
[26] WANG D, SONG C, Barabasi A L. Quantifying long-term scientific Impact[J]. Science, 2013, 342(6154):127-132.
[27] ONODERA N, YOSHIKANE F. Factors affecting citation rates of research articles[J].Journal of the Association for Information Science and Technology,2015,66(4):739-764.
[28] 余厚强,邱均平.论替代计量学在图书馆文献服务中的应用[J].情报杂志, 2014(9):163-166.
[30] ABRAMO G, D ANGELO, FELICI G. Predicting publication long-term impact through a combination of early citations and journal impact factor[J].Journal of informetrics,2019,13(1):32-49.
[31] STEGEHUIS C, LITVAK N, WALTMAN L. Predicting the long-term citation impact of recent publications[J].Journal of informetrics,2015, 9(3):642-657.
[32] Newman M E J. The first-mover advantage in scientific publication[J]. Europhysics letters,2009,86(6):68001.
[33] Newman M E J. Prediction of highly cited papers[J].Europhysics letters,2014,105(2):28002.
[34] WANG M, WANG Z, CHEN G. Which can better predict the future success of articles? Bibliometric indices or alternative metrics[J].Scientometrics,2019,119(3):1575-1595.
[35] BHAT H S, HUANG L H, RODRIGUEZ S, et al. Citation prediction using diverse features[C]//IEEE international conference on data mining workshop,USA:IEEE, 2015:589-596.
[36] CAO X, CHEN Y, RAY LIU K J. A data analytic approach to quantifying scientific impact[J].Journal of informetrics,2016,10(2):471-484.
[37] ABRISHAMI A, ALIAKBARY S. Predicting citation counts based on deep neural network learning techniques[J].Journal of informetrics,2019,13(2):485-499.
[38] 吴智勇. 学术论文排序预测算法研究[D].内蒙古:内蒙古大学,2015.
[39] POBIEDINA N, ICHISE R. Citation count prediction as a link prediction problem[J].Applied intelligence,2016,44(2):252-268.
[40] CHEN C. Predictive effects of structural variation on citation counts[J].Journal of the Association for information science and technology,2014,63(3):431-449.
[41] 于志涛,牟晓青.文献科学计量陈氏预测指标及其应用述评[J].图书馆论坛,2013,33(4):32-41.
[42] 白晓梅. 基于社会网络分析的学术影响力评估与预测[D].大连:大连理工大学,2017.
[43] SAYYADI H, GETOOR L. FutureRank:ranking scientific articles by predicting their future pagerank[C]//Proceedings of the SIAM international conference on data mining, USA:Society for industrial and applied mathematics,2009:533-544.
[44] 刘大有,薛锐青,齐红.基于作者权威值的论文价值预测算法[J].自动化学报,2012,38(10):1654-1662.
[45] 樊玮,韩佳宁,张宇翔.基于网络表示学习的论文影响力预测算法[J/OL].计算机工程.[2019-06-15].https://doi.org/10.19678/j.issn.1000-3428.0053395.
[46] 中国人工智能学会.机器学习白皮书.[EB/OL].[2019-05-20].http://www.caai.cn/index.php?s=/home/article/detail/id/49.html.
[47] 沈雷.基于学术网络的新论文影响力预测[D].济南:山东大学,2018.
[48] WANG S, XIE S, ZHANG X, et al. Coranking the future influence of multiobjects in bibliographic network through mutual reinforcement[J].ACM transactions on intelligent systems and technology,2016,7(4):1-28.
[49] 曾玮.文献排名预测算法及作者影响力评估算法研究[D].重庆:西南大学,2014.
[50] 杜建,武夷山."睡美人"文献的重要特征、预测线索与政策启示[J].科学学研究,2018,36(11):1938-1945.
文章导航

/