Research on Academic Paper Citation Prediction

  • Xia Wanjun ,
  • Chen Xiaohong ,
  • Jiang Yanping
Expand
  • 1. Library of Southwest Jiaotong University, Chengdu 611756;
    2. School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756

Received date: 2019-07-02

  Revised date: 2019-09-19

  Online published: 2020-03-20

Abstract

[Purpose/significance] This paper summarizes the influencing factors and prediction methods of academic paper citation, analyzes the existing problems and proposes the future development directions.[Method/process] This paper used the literature research method to review the research progress of academic papers at home and abroad, and summarized the relevant content and characteristics of influencing factors and prediction methods.[Result/conclusion] There are many indicators of influencing factors, but there is no unified selection criteria. The theoretical basis of prediction methods is weak. The research on dynamics of citation prediction is insufficient. The generality of prediction models is limited. In the future, we should strengthen the theoretical research of citation prediction methods, the combination of traditional bibliometrics and alternative metrics, the deep application of natural language processing, and establish a unified baseline standard, a more accurate prediction model.

Cite this article

Xia Wanjun , Chen Xiaohong , Jiang Yanping . Research on Academic Paper Citation Prediction[J]. Library and Information Service, 2020 , 64(6) : 138 -145 . DOI: 10.13266/j.issn.0252-3116.2020.06.016

References

[1] 耿骞,景然,靳健,等.学术论文引用预测及影响因素分析[J].图书情报工作,2018,62(14):29-40.
[2] YANG L, ZHANG Z, CAI X, et al. Citation recommendation as edge prediction in heterogeneous bibliographic network:a network representation approach[J]. IEEE Access, 2019,7:23232-23239.
[3] 鲍玉芳,马建霞.科学论文被引频次预测的现状分析与研究[J].情报杂志,2015,34(5):66-71.
[4] STEWART J A. Achievement and ascriptive processes in the recognition of scientific articles[J].Socialforces,1983,62(1):166-189.
[5] WILLIS D L, BAHLER C D, NEUBERGER M M, et al.Predictors of citations in the urological literature[J].BJUinternational,2011,107(12):1876-1880.
[6] KOSTEAS V D.Predicting long-run citation counts for articles in top economics journals[J].Scientometrics,2018,115(3):1395-1412.
[7] CHAKRABORTY T,KUMAR S,GOYAL P,et al.Towards a stratified learning approach to predict future citation counts[C]//2014 IEEE/ACM joint conference on digital libraries (JCDL).London:IEEE computer society,2014:351-360.
[8] ANTONIOU G A,ANTONIOU S A,GEORGAKARAKOS E I,et al.Bibliometric analysis of factors predicting increased citations in the vascular and endovascular literature[J].Annals of vascular surgery,2015,29(2):286-292.
[9] FU L D, ALIFERIS C F. Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature[J]. Scientometrics,2010,85(1):257-270.
[10] YU T, YU G, LI P Y, et al. Citation impact prediction for scientific papers using stepwise regression analysis[J].Scientometrics,2014,101(2):1233-1252.
[11] HASLAM N, BAN L, KAUFMANN L, et al. What makes an article influential? Predicting impact in social and personality psychology[J]. Scientometrics,2008,76(1):169-185.
[12] ROTHC, Wu J, Lozano S. Assessing impact and quality from local dynamics of citation networks[J]. Journal of informetrics,2013,6(1):111-120.
[13] DIDEGAH F, THELWALL M. Which factors help authors produce the highest impact research? Collaboration, journal and document properties[J].Journal of informetrics,2013,7(4):861-873.
[14] SUBOTIC S, MUKHERJEE B. Short and amusing:the relationship between title characteristics, downloads, and citations in psychology articles[J]. Journal of information science, 2014, 40(1):115-124.
[15] SOHRABI B, IRAJ H. The effect of keyword repetition in abstract and keyword frequency per journal in predicting citation counts[J].Scientometrics,2017, 110(1):243-251.
[16] YAN R, TANG J, LIU X, et al. Citation count prediction:learning to estimate future citations for literature.[C]//ACM international conference on information & knowledge management. Glasgow:ACM,2011:1247-1252.
[17] TAHAMTAN I, AFSHAR A S, AHAMDZADEH K. Factors affecting number of citations:a comprehensive review of the literature[J].Scientometrics,2016,107(3):1195-1225.
[18] BORNMANN L, LEYDESDORFF L, WANG J. How to improve the prediction based on citation impact percentiles for years shortly after the publication date?[J].Journal of informetrics,2014,8(1):175-180.
[19] 张美平, 尚明生. 基于持续关注度衰减的重要论文预测[J]. 复杂系统与复杂性科学, 2015, 12(3):77-84.
[20] BORNMANN L, DANIEL H D. Citation speed as a measure to predict the attention an article receives:An investigation of the validity of editorial decisions at AngewandteChemie International Edition[J]. Journal of informetrics, 2010, 4(1):83-88.
[21] 熊泽泉,段宇锋.论文早期下载量可否预测后期被引量?——以图书情报领域期刊为例[J].图书情报知识,2018(4):32-42.
[22] SHEMA H, BAR-ILAN J, THELWALL M. Do blog citations correlate with a higher number of future citations? Research blogs as a potential source for alternative metrics[J].Journal of the Association for Information Science and Technology,2014,65(5):1018-1027.
[23] PEOPLES B K, MIDWAY S R, SACKETT D, et al. Twitter predicts citation rates of ecological research[J].Plos One,2016,11(11):e0166570.
[24] ZOLLER D, DOERFEL S, JÄSCHKE R, et al. Posted, visited, exported:Altmetrics in the social tagging system bibsonomy[J].Journal of informetrics,2016,10(3):732-749.
[25] THELWALL M, NEVILL T. Could scientists use Altmetric.com scores to predict longer term citation counts?[J].Journal of informetrics,2018,12(1):237-248.
[26] WANG D, SONG C, Barabasi A L. Quantifying long-term scientific Impact[J]. Science, 2013, 342(6154):127-132.
[27] ONODERA N, YOSHIKANE F. Factors affecting citation rates of research articles[J].Journal of the Association for Information Science and Technology,2015,66(4):739-764.
[28] 余厚强,邱均平.论替代计量学在图书馆文献服务中的应用[J].情报杂志, 2014(9):163-166.
[30] ABRAMO G, D ANGELO, FELICI G. Predicting publication long-term impact through a combination of early citations and journal impact factor[J].Journal of informetrics,2019,13(1):32-49.
[31] STEGEHUIS C, LITVAK N, WALTMAN L. Predicting the long-term citation impact of recent publications[J].Journal of informetrics,2015, 9(3):642-657.
[32] Newman M E J. The first-mover advantage in scientific publication[J]. Europhysics letters,2009,86(6):68001.
[33] Newman M E J. Prediction of highly cited papers[J].Europhysics letters,2014,105(2):28002.
[34] WANG M, WANG Z, CHEN G. Which can better predict the future success of articles? Bibliometric indices or alternative metrics[J].Scientometrics,2019,119(3):1575-1595.
[35] BHAT H S, HUANG L H, RODRIGUEZ S, et al. Citation prediction using diverse features[C]//IEEE international conference on data mining workshop,USA:IEEE, 2015:589-596.
[36] CAO X, CHEN Y, RAY LIU K J. A data analytic approach to quantifying scientific impact[J].Journal of informetrics,2016,10(2):471-484.
[37] ABRISHAMI A, ALIAKBARY S. Predicting citation counts based on deep neural network learning techniques[J].Journal of informetrics,2019,13(2):485-499.
[38] 吴智勇. 学术论文排序预测算法研究[D].内蒙古:内蒙古大学,2015.
[39] POBIEDINA N, ICHISE R. Citation count prediction as a link prediction problem[J].Applied intelligence,2016,44(2):252-268.
[40] CHEN C. Predictive effects of structural variation on citation counts[J].Journal of the Association for information science and technology,2014,63(3):431-449.
[41] 于志涛,牟晓青.文献科学计量陈氏预测指标及其应用述评[J].图书馆论坛,2013,33(4):32-41.
[42] 白晓梅. 基于社会网络分析的学术影响力评估与预测[D].大连:大连理工大学,2017.
[43] SAYYADI H, GETOOR L. FutureRank:ranking scientific articles by predicting their future pagerank[C]//Proceedings of the SIAM international conference on data mining, USA:Society for industrial and applied mathematics,2009:533-544.
[44] 刘大有,薛锐青,齐红.基于作者权威值的论文价值预测算法[J].自动化学报,2012,38(10):1654-1662.
[45] 樊玮,韩佳宁,张宇翔.基于网络表示学习的论文影响力预测算法[J/OL].计算机工程.[2019-06-15].https://doi.org/10.19678/j.issn.1000-3428.0053395.
[46] 中国人工智能学会.机器学习白皮书.[EB/OL].[2019-05-20].http://www.caai.cn/index.php?s=/home/article/detail/id/49.html.
[47] 沈雷.基于学术网络的新论文影响力预测[D].济南:山东大学,2018.
[48] WANG S, XIE S, ZHANG X, et al. Coranking the future influence of multiobjects in bibliographic network through mutual reinforcement[J].ACM transactions on intelligent systems and technology,2016,7(4):1-28.
[49] 曾玮.文献排名预测算法及作者影响力评估算法研究[D].重庆:西南大学,2014.
[50] 杜建,武夷山."睡美人"文献的重要特征、预测线索与政策启示[J].科学学研究,2018,36(11):1938-1945.
Outlines

/