REVIEW & COMMENTARY

Theoretical Exploration and Practical Progress of Software Citation and Its Specification

  • Ding Jingda ,
  • Zheng Qiao ,
  • Liu Chao
Expand
  • Department of Library, Information and Archives, Shanghai University, Shanghai 200444

Received date: 2020-12-28

  Revised date: 2021-03-01

  Online published: 2021-06-10

Abstract

[Purpose/significance] This paper surveys the status of theory and practice of software citation, analyzes the existing difficulties and problems, and promotes the establishment of software citation norm and standard. [Method/process] Through the literature research, it’s found that software citation has faced four challenges: acceptance culture, reward system, citation cognition and metadata. Then this paper analyzed the exploration of theory and practice of software citation and its specification at home and abroad in order to meet the challenges. [Result/conclusion] Stakeholder analysis of software citation, theoretical exploration of software citation principles and metadata standards, citation guidelines of international research organizations and communities, support of related projects and databases, personnel training, implementation of contribution distribution and reward plan, and provision of software citation documents have laid a good foundation for the establishment of software citation norms, but the stakeholders still need work together to overcome the difficulties and challenges of software citation.

Cite this article

Ding Jingda , Zheng Qiao , Liu Chao . Theoretical Exploration and Practical Progress of Software Citation and Its Specification[J]. Library and Information Service, 2021 , 65(11) : 143 -152 . DOI: 10.13266/j.issn.0252-3116.2021.11.015

References

[1] 软件[EB/OL]. [2020-12-15]. https://bk.tw.lvfukeji.com/baike-软件.
[2] 潘雪莲. 软件实体的自动抽取和学术影响力研究[D]. 南京:南京大学,2016:9.
[3] 中国国家标准化管理委员会.信息与文献 参考文献著录规则 GB/T 7714-2015[S/OL].[2020-06-03]. http://www.gs.fudan.edu.cn/_upload/article/files/e2/3e/0197d0d64cbea9d9fae9a668329f/b9a3a4c2-1e6f-4e81-9974-36eb01635471.pdf.
[4] HEINLE A, KOEPF W, SPERBER W. Some steps to improve software information [J]. ACM communications in computer algebra, 2017, 51(1): 1-11.
[5] 崔明,潘雪莲,华薇娜. 我国图书情报领域的软件使用和引用研究[J]. 中国图书馆学报, 2018, 44(3): 66-78.
[6] 杨波,王雪,佘曾溧. 生物信息学文献中的软件利用行为研究[J]. 情报学报, 2016, 35(11): 1140-1147.
[7] 杨波,王雪,苏娜. 不同文献集中中国学者引用软件和数据集的特征比较研究[J]. 图书情报工作, 2017, 61(14): 109-115.
[8] HWANG L, FISH A, SOITO L, et al. Software and the scientist: Coding and citation practices in geodynamics [J]. Earth and space science, 2017, 4(11): 670-680.
[9] WOFFORD M F, BOSCOE B M, BORGMAN C L, et al. Jupyter notebooks as discovery mechanisms for open science: citation practices in the astronomy community [J]. Computing in science & engineering,2020, 22(1): 5-15.
[10] PAN X,YAN E,HUA W. Disciplinary differences of software use and impact in scientific literature [J]. Scientometrics, 2016, 109(3): 1593-1610.
[11] PAN X,YAN E,CUI M, et al. How important is software to library and information science research? a content analysis of full-text publications [J]. Journal of informetrics, 2019, 13(1): 397-406.
[12] HOWISON J, BULLARD J. Software in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature [J]. Journal of the Association for Information Science and Technology, 2016, 67(9): 2137-2155.
[13] 于晓彤,潘雪莲,华薇娜. 知识图谱研究中软件引用和扩散分析[J]. 情报资料工作, 2019, 40(2): 19-29.
[14] PAN X, YAN E, CUI M, et al. Examining the usage, citation, and diffusion patterns of bibliometric mapping software: a comparative study of three tools [J]. Journal of informetrics,2018, 12(2):481-493.
[15] 于晓彤,潘雪莲,华薇娜. 基于内容分析法的开源软件使用与引用研究[J]. 图书情报研究, 2020, 13(1): 101-107,121.
[16] LI K, CHEN P-Y, YAN E. Challenges of measuring software impact through citations: an examination of the lme4 R package [J]. Journal of informetrics, 2019, 13(1): 449-461.
[17] LI K, YAN E, FENG Y. How is R cited in research outputs? Structure, impacts, and citation standard [J]. Journal of informetrics, 2017, 11(4): 989-1002.
[18] LI K, YAN E. Co-mention network of R packages: Scientific impact and clustering structure [J]. Journal of informatrics, 2018, 12(1): 87-100.
[19] 赵蓉英,魏明坤,汪少震. 基于Altmetrics的开源软件学术影响力评价研究[J]. 中国图书馆学报, 2017, 43(2): 80-95.
[20] ZHAO R, WEI M. Impact evaluation of open source software: an altmetrics perspective [J]. Scientometrics, 2017(110): 1017-1033.
[21] PAN X, YAN E, WANG Q, et al. Assessing the impact of software on science: a bootstrapped learning of software entities in full-text papers [J]. Journal of informetrics, 2015, 9(4): 860-871.
[22] SILVA R. Measuring behaviour 2018 [EB/OL]. [2020-12-01]. https://www.software.ac.uk/blog/2018-06-22-measuring-behaviour-2018.
[23] HANNAY J E, MACLEOD C, SINGER J, et al. How do scientists develop and use scientific software? [C]//Proceedings of the 2009 ICSE workshop on software engineering for computational science and engineering. Piscataway: IEEE Computer Society, 2009: 1-8.
[24] PRABHU P, JABLIN T B, RAMAN A, et al. A survey of the practice of computational science[EB/OL]. [2020-12-15]. https://www.cs.princeton.edu/~dpw/papers/survey-sc11.pdf.
[25] HOWISON J, HERBSLEB J. The sustainability of scientific software: ecosystem context and science policy[EB/OL]. [2020-12-15]. http://james.howison.name/pubs/HowisonHerbsleb-Sustainability.pdf.
[26] SMITH A M, NIEMEYER K E, KATZ D S, et al.Design and first-year review[J/OL]. Journal of open source software[2021-05-20].https://peerj.com/articles/cs-147.pdf.
[27] CHAWLA D S. The unsung heroes of scientific software[J]. Nature, 2016, 529(7584): 115-116.
[28] HWANG L, FISH A, SOITO L, et al. Software and the scientist: coding and citation practices in geodynamics [J]. Earth and space science, 2017, 4(11): 670-680.
[29] RIOS F. Preserving and sharing software for transparent and reproducible research: a review [EB/OL]. [2020-12-20]. https://osf.io/d4kef/.
[30] PARK H, WOLFRAM D. Research software citation in the Data Citation Index: current practices and implications for research software sharing and reuse [J]. Journal of informetrics, 2019(13): 574-582.
[31] HOWISON J, HERBSLEB J D. Scientific software production: incentives and collaboration [C/OL]. [2020-12-10]. https://dl.acm.org/doi/10.1145/1958824.1958904.
[32] HWANG L J, PAULOO R A, CARLEN J. Assessing the impact of outreach through software citation for community software in geodynamics [J]. Computing in science and engineering, 2020, 22(1): 16-25.
[33] LI K, CHEN P-Y, YAN E. Challenges of measuring software impact through citations: an examination of the lme4 R package [J]. Journal of informetrics, 2019, 13(1): 449-461.
[34] 史雅莉. 科学数据引用标准实施的关键问题探析[J]. 现代情报, 2019, 39(4): 34-41.
[35] NIEMEYER K E, SMITH A M, KATZ D S. The challenge and promise of software citation for credit, identification, discovery, and reuse[J]. Journal of data and information quality, 2016, 7(4): e16.
[36] DING Y, SONG M, HAN J, et al. Measuring the impact of entities[J]. PLOS one, 2013, 8(8): e71416.
[37] DING Y, ZHANG G, CHAMBERS T, et al. Content-based citation analysis: the next generation of citation analysis[J]. Journal of the Association for Information Science and Technology, 2014, 65(9): 1820-1833.
[38] YU Q, DING Y, SONG M, et al. Tracing database usage: Detecting main paths in database link networks [J]. Journal of informetrics, 2015, 9(1): 1-15.
[39] BARTZAS A, PEON-QUIROS M, POUCET C, et al. Software metadata: systematic characterization of the memory behaviour of dynamic applications [J]. Journal of systems and software, 2010, 83(6): 1051-1075.
[40] SOCIAS S M, MORIN A, TIMONY M A, et al. AppCiter: a web application for increasing rates and accuracy of scientific software citation [J]. Structure, 2015, 23(5): 807-808.
[41] THE CODEMETA PROJECT [EB/OL]. [2020-12-26]. https://codemeta.github.io/index.html.
[42] COSMO R D, GRUENPETER M, ZACCHIROLI S. Referencing source code artifacts: a separate concern in software citation [J]. Computing in science and engineering, 2020, 22(2): 33-43.
[43] 崔佳伟, 吴思竹, 邬金鸣, 等. 科学数据仓储元数据标准研究与启示[J]. 数字图书馆论坛, 2019(6): 19-28.
[44] TENOPIR C, ALLARD S, DOUGLASS K, et al. Data sharing by scientists: practices and perceptions [J]. PLos one, 2011, 6(6): 1-21.
[45] 浦燕妮, 刘琪, 耿骞. 通用型科学元数据标准研究[J]. 数字图书馆论坛, 2016(12): 33-39.
[46] KATZ D S, BOUQUIN D, CHUE HONG N P, et al. Software citation implementation challenges [EB/OL]. [2020-12-30]. https://arxiv.org/abs/1905.08674.
[47] DRUSKAT S. Software and dependencies in research citation graphs [J]. Computing in science & engineering, 2020, 22(2): 8-21.
[48] HENNEKEN E, DEVELOPER A. Asclepias - Enabling software citation & discovery workflows [EB/OL]. [2020-04-06]. https://adsabs.github.io/blog/asclepias.
[49] HENNEKEN E, ACCOMAZZI A, BLANCO-CUARESMA S, et al. Asclepias-Capturing software citations in astronomy[EB/OL].[2020-10-08].http://doi.org/10.5281/zenodo.1011088.
[50] HENNEKEN E A, ADS T. Capturing software citations in astronomy and planetary sciences [EB/OL]. [2020-10-09]. https://www.hou.usra.edu/meetings/lpsc2019/pdf/1569.pdf.
[51] ALLEN A, BERRIMAN G B, DUPRIE K, et al. Improving software citation and credit [EB/OL]. [2020-10-20]. https://arxiv.org/abs/1512.07919.
[52] 盛小平, 王毅. 利益相关者在科学数据开放共享中的责任与作用——基于国际组织科学数据开放共享政策的分析[J]. 图书情报工作, 2019, 63(17): 31-39.
[53] 张丽丽, 黎建辉. 数据引用的相关利益者分析[J]. 情报理论与实践, 2014(7): 44-47.
[54] 孟祥保, 高凡. 利益相关者视角下科研数据战略规划研究[J]. 图书情报工作, 2016, 60(9): 38-44.
[55] SMITH A M, KATZ D S, NIEMEYER K E, et al. Software citation principles [J]. PeerJ Computer Science, 2016, 2: e86. https://doi.org/10.7717/peerj-cs.86.
[56] 完颜邓邓. 国外科学数据仓储元数据实践调查及启示[J]. 新世纪图书馆, 2016(5): 81-84.
[57] 司莉, 贾欢. 科学数据的标准规范体系框架研究[J]. 图书馆, 2016(5): 5-9.
[58] 刘峰, 张晓林. 科学数据元数据标准述评及其通用化设计研究[J]. 现代图书情报技术, 2015(12): 3-12.
[59] 梁冰, 王莉. 关于国家科技管理信息系统标准体系建设的思考[J]. 情报学报, 2016, 35(9): 911-916.
[60] 司莉, 赵洁. 美国开放政府数据元数据标准及启示[J]. 图书情报工作, 2018, 62(3): 86-93.
[61] HSU C N, BANDROWSKI A E, GILLESPIE T H, et al. Comparing the use of research resource identi?ers and natural language processing for citation of databases, software, and other digital artifacts [J]. Computing in science & engineering, 2020, 22(2): 22-32.
[62] BALL A, DUKE M. ‘How to track the impact of research data with metrics’. DCC how-to guides [EB/OL]. [2020-12-15]. https://www.dcc.ac.uk/guidance/how-guides/track-data-impact-metrics.
[63] DRUSKAT S. Citation file format -core module (CFF-Core)[EB/OL]. [2020-11-20]. https://zenodo.org/record/1108269#.XsibSfZuLIU.
[64] Working towards sustainable software for science: practice and experiences. Proceedings [EB/OL]. [2020-11-26]. http://wssspe.researchcomputing.org.uk/proceedings/.
[65] FORCE 11. Attribution working group[EB/OL].[2020-11-26]. https://www.force11.org/group/attributionwg.
[66] FORCE 11. FORCE11-RDA joint software source code identification working group [EB/OL]. [2020-11-26]. https://www.force11.org/group/force11-rda-joint-software-source-code-identification-working-group.
[67] FORCE 11. Software citation working group [EB/OL]. [2020-11-26]. https://www.force11.org/group/software-citation-working-group.
[68] FORCE 11. Software citation implementation working group [EB/OL]. [2020-11-26]. https://www.force11.org/group/software-citation-implementation-working-group.
[69] Software Sustainability Institute. Other workshops[EB/OL]. [2020-11-26]. https://www.software.ac.uk/programmes-and-events/other-workshops.
[70] American Astronomical Society. Policy statement on software [EB/OL]. [2020-11-26]. https://journals.aas.org/policy-statement-on-software/.
[71] SWMATH[EB/OL]. [2020-11-26]. http://www.swmath.org/.
[72] HSL. A collection of Fortran codes for large scale scientific computation [EB/OL]. [2020-11-26]. http://www.hsl.rl.ac.uk/.
[73] WHITE O, DHAR A, BONAZZI V, et al. NIH software discovery index meeting report [R/OL]. [2020-11-20]. http://www.softwarediscoveryindex.org/.
[74] The journal of open source software [EB/OL]. [2020-11-26]. https://joss.theoj.org/.
[75] The journal of open research software [EB/OL]. [2020-11-26]. https://openresearchsoftware.metajnl.com/j.
[76] SOFTWAREX [EB/OL]. [2020-11-26]. https://www.journals.elsevier.com/softwarex/.
[77] THE CODEMETA PROJECT [EB/OL]. [2020-11-26]. https://codemeta.github.io/index.html.
[78] SHOULDACITE [EB/OL]. [2020-11-23]. https://mr-c.github.io/shouldacite/.
[79] COLLABORATIONS WORKSHOP 16 [EB/OL]. [2020-11-24]. https://www.software.ac.uk/cw16.
[80] DEPSY [EB/OL]. [2020-11-26]. https://github.com/ourresearch/depsy.
[81] IMPACTSTORY [EB/OL]. [2020-11-26]. https://github.com/ourresearch/impactstory-tng.
[82] GITHUB [EB/OL]. [2020-11-26]. https://github.com/.
[83] DATACITE [EB/OL]. [2020-11-26]. https://datacite.org/.
[84] GIL Y, RATNAKAR V, GARIJO D. OntoSoft: capturing scientific software metadata[C/OL]. [2020-11-30]. https://dl.acm.org/doi/10.1145/2815833.2816955.
[85] CITEAS [EB/OL]. [2020-11-29]. http://www.citeas.org/.
[86] HEIBI I, PERONI S, SHOTTON D. Software review: COCI, the open citations index of Crossref open DOI-to-DOI citations [J]. Scientometrics, 2019, 121(2): 1213-1228.
[87] MOZILLA SCIENCE LAB. Code as a rresearch object [EB/OL]. [2020-11-26]. http://mozillascience.github.io/code-research-object/.
[88] Software Carpentry [EB/OL]. [2021-01-03]. https://software-carpentry.org/.
[89] WILSON G. Software carpentry: lessons learned [version 2; peer review: 3 approved] [EB/OL]. [2020-11-18]. https://doi.org/10.12688/f1000research.3-62.v2.
[90] Training for Centres for Doctoral Training[EB/OL].[2021-01-03]. https://www.software.ac.uk/programmes-and-events/carpentry-programmes/tailored-carpentry-workshops/training-centres-doctoral.
[91] The Engineering and Physical Sciences Research Council[EB/OL].[2021-01-03]. https://epsrc.ukri.org/.
[92] CASRAI. CRediT[EB/OL]. [2021-01-03]. https://casrai.org/credit/.
[93] HOWISON J, HERBSLEB J D. Incentives and integration in scientific software production [C]//http://dio.org/10.1145/2441776. 2441828.
[94] GRANT R. Life after my Fellowship [EB/OL].[2021-01-02]. https://www.software.ac.uk/blog/2018-12-19-life-after-my-fellowship.
[95] ARNOLD B. Christopher Woods: how to design and engineer good code for research [EB/OL].[2021-01-02]. https://www.software.ac.uk/blog/2019-03-11-christopher-woods-how-design-and-engineer-good-code-research.
[96] Society OF Research Software Engineering. RSE Fellows [EB/OL]. [2021-01-04]. https://society-rse.org/community/rse-fellows/.
[97] LAIRD J. Applications open for Better Scientific Software (BSSw) Fellowship Program [EB/OL].[2021-01-02]. https://www.software.ac.uk/news/applications-open-better-scientific-software-bssw-fellowship-program.
[98] ARAGON S. US applicants sought for the 2019 Better Scientific Software (BSSw) Fellowship Programme[EB/OL].[2021-01-02]. https://www.software.ac.uk/news/us-applicants-sought-2019-better-scientific-software-bssw-fellowship-programme.
[99] WILSON R. Encouraging citation of software - introducing CITATION files [EB/OL]. [2020-12-25]. https://www.software.ac.uk/blog/2016-10-06-encouraging-citation-software-introducing-citation-files.
[100] DRUSKAT S, SPAAKS J H, STRUCK A. Hacking software citation implementation: the citation file format hack day at RSE18[EB/OL].[2021-12-27]. https://www.software.ac.uk/blog/2018-10-02-hacking-software-citation-implementation-citation-file-format-hack-day-rse18.
[101] DRUSKAT S. Track 2 lightning talk: should CITATION files be standardized? [EB/OL]. [2020-12-27]. https://figshare.com/articles/journal_contribution/Track_2_Lightning_Talk_Should_CITATION_files_be_standardized_/3827058.
[102] citation-file-format/citation-file-format[EB/OL].[2020-12-28]. https://github.com/citation-file-format/citation-file-format#tools.
[103] SPAAKS J H, VERHOEVEN S, KLAVER T, et al. The research software directory and how it promotes software citation - Improve the findability, citability, and reproducibility of research software [EB/OL].[2020-12-29]. https://www.software.ac.uk/blog/2018-12-17-research-software-directory-and-how-it-promotes-software-citation-improve.
Outlines

/