图书情报工作 ›› 2022, Vol. 66 ›› Issue (16): 92-104.DOI: 10.13266/j.issn.0252-3116.2022.16.009

• 情报研究 • 上一篇    下一篇

多元数据出版模式下生物医学领域科研人员数据引用行为研究

邱玉红, 焦红, 杨波   

  1. 南京农业大学信息管理学院 南京 210095
  • 收稿日期:2022-02-14 修回日期:2022-05-12 出版日期:2022-08-20 发布日期:2022-08-19
  • 通讯作者: 杨波,教授,博士,博士生导师,通信作者,E-mail:boyang@njau.edu.cn
  • 作者简介:邱玉红,硕士研究生;焦红,博士研究生。
  • 基金资助:
    本文系国家社会科学基金一般项目“科学数据集的自组织模式和质量评价研究”(项目编号:18BTQ077)研究成果之一。

Research on Data Citation Behaviors of Researchers in the Field of Biomedicine Under Multiple Data Publishing Modes

Qiu Yuhong, Jiao Hong, Yang Bo   

  1. School of Information Management, Nanjing Agricultural University, Nanjing 210095
  • Received:2022-02-14 Revised:2022-05-12 Online:2022-08-20 Published:2022-08-19

摘要: [目的/意义]通过分析多元数据出版模式背景下科研人员数据集引用特征与引用来源,揭示生物医学领域数据引用现状与发展规律,为促进科学数据的出版及高效利用提供决策支持。[方法/过程]以生物医学数据库PubMed Central (PMC)的开放获取文献为样本,利用科研人员在论文中提及的数据集与引用数据集之间的关联关系识别文献中的数据引用行为,并从科研人员引用数据集特征与数据集引用来源特征等视角深入分析科研人员数据引用行为特点及趋势。[结果/结论]生物医学领域数据共享和利用行为较为普遍,但正式的数据引用行为较少。科研人员数据引用意识受数据存储库的数据引用政策影响较大,且新颖的、持续更新的数据集更加受到科研人员青睐。通过引用常规文献的方式引用数据集是科研人员说明数据来源的主要形式,其次是引用数据存储库。数据论文作为数据引用来源开始较晚,但发展态势十分迅猛。在未来,数据存储库和数据论文这两种数据出版模式有望成为主要数据引用来源之一。

关键词: 生物医学, 科学数据集, 数据出版, 数据引用, 数据论文

Abstract: [Purpose/Significance] By analyzing the cited characteristics and sources of researchers' datasets under multiple data publishing modes, this paper reveals the current situation and development rules of data citation in biomedical field, in order to provide decision support for promoting the publication and efficient utilization of scientific data. [Method/Process] Taking the open access articles of the biomedical database PubMed Central as a sample, data citation behaviors in the articles were identified by using the correlation between the dataset mentioned and the dataset cited by researchers in the paper. The characteristics and trends of data citation behaviors of researchers had been deeply analyzed from the perspectives of characteristics of cited datasets and dataset citation sources of researchers. [Result/Conclusion] Data sharing and utilization behaviors are common in the field of biomedicine, but formal data citation behaviors are less. Data citation awareness of researchers is largely affected by the data citation policies of data repositories, and novel and continuously updated datasets are more favored by researchers. Citing datasets by citing regular literature is the primary form for researchers to declare data sources, followed by citing data repositories. Data papers as a data citation source started late, but the development trend is very rapid. In the future, two data publishing modes of data repositories and data papers are expected to become one of the main data citation sources.

Key words: biomedicine, scientific datasets, data publishing, data citation, data paper

中图分类号: