KNOWLEDGE ORGANIZATION

Research and Implementation of Linking Services Between Scientific Data and Academic Literature

  • Huang Yongwen ,
  • Sun Tan ,
  • Zhao Ruixue ,
  • Xian Guojian ,
  • Li Jiao ,
  • Luo Tingting
Expand
  • 1. Agricultural Information Institution, Chinese Academy of Agricultural Sciences, Beijing 100081;
    2. Chinese Academy of Agricultural Sciences, Beijing 100081;
    3. Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs, Beijing 100081

Received date: 2021-08-01

  Revised date: 2021-10-21

  Online published: 2021-12-18

Abstract

[Purpose/significance] To meet researchers' increasing demands for scientific data retrieval and discovery, this study conducts a research on the improvement of the metadata of scientific data and further realizes the in-depth linking discovery between scientific data and academic literature.[Method/process] Based on the investigating the methods and experiences of linking service, this study proposed a system architecture of scientific data retrieval and linking services, and realized collection and integration of academic resource metadata, enrichment and enhancement of scientific data metadata, and retrieval and linking discovery services of scientific data.[Result/conclusion] The improvement of the quality of scientific data metadata can support deeper and more fine-grained semantic linking services between scientific data and academic literature, and help users discover scientific data and its associated academic literature.

Cite this article

Huang Yongwen , Sun Tan , Zhao Ruixue , Xian Guojian , Li Jiao , Luo Tingting . Research and Implementation of Linking Services Between Scientific Data and Academic Literature[J]. Library and Information Service, 2021 , 65(23) : 116 -125 . DOI: 10.13266/j.issn.0252-3116.2021.23.013

References

[1] European Open Science Cloud[EB/OL].[2020-10-20].https://eosc-portal.eu/about/eosc.
[2] BURTON A, KOERS H,MANGHI P, et al. The Scholix framework for interoperability in data-literature information exchange[J/OL].[2020-10-20].http://www.dlib.org/dlib/january17/burton/01burton.html.
[3] WILKINSON M D, DUMONTIER M, AALBERSBERG I J J, et al. The FAIR guiding principles for scientific data management and stewardship[J].Scientific data, 2016:160018.https://doi.org/10.1038/sdata.2016.18.
[4] DataCite metadata schema[EB/OL].[2020-10-20].https://schema.datacite.org/.
[5] Elixir. Interoperability platform[EB/OL].[2020-10-20]. https://elixir-europe.org/platforms/interoperability.
[6] Elsevier. Trust in research[EB/OL].[2020-10-20]. https://www.elsevier.com/connect/trust-in-research.
[7] Discovering associated data in PMC[EB/OL].[2020-10-20].https://ncbiinsights.ncbi.nlm.nih.gov/2018/11/15/discovering-associated-data-in-pmc/.
[8] Elsevier. Linking research data and research articles on ScienceDirect[EB/OL].[2020-10-20].https://www.elsevier.com/authors/tools-and-resources/research-data/data-base-linking.
[9] Web of Science. Data Citation Index[EB/OL].[2020-10-20].https://clarivate.com/webofsciencegroup/solutions/webofscience-data-citation-index/.
[10] Scopus. Data linking[EB/OL].[2021-03-10]. https://blog.scopus.com/topics/data-linking.
[11] Dimensions. Linked research data from idea to impact[EB/OL].[2021-03-10].https://www.dimensions.ai/.
[12] Elixir data platform[EB/OL].[2021-03-10].https://elixir-europe.org/platforms/data.
[13] GARCIA-HERNANDEZ M, BERARDINI T Z, CHEN G H, et al. TAIR:a resource for integrated Arabidopsis data. Functional & integrative genomics, 2002,2(6):239-253.
[14] SULLIVAN D. A reintroduction to our knowledge graph and knowledge panels[EB/OL].[2020-10-20].https://www.blog.google/products/search/about-knowledge-graph-and-knowledge-panels/.
[15] OpenAIRE.OpenAIRE-research graph[EB/OL].[2020-10-20].https://graph.openaire.eu.
[16] RD-Switchboard[EB/OL].[2020-10-20].https://www.rd-switchboard.org.
[17] 杨宁,文奕,张鑫,等.高能物理科学数据与科技文献关联研究[J].图书馆学研究,2019(1):47-52.
[18] 姜恩波, 裴玉香.科学文献与科学数据的融合方法与实例研究[J].知识管理论坛,2019,4(2):69-79.
[19] 涂勇,彭洁.基于DOI技术的科学数据与科技文献融合的研究[J].数字图书馆论坛,2007(10):28-31.
[20] 孙文佳,常娥.科学数据与科技文献关联分析[J].图书馆理论与实践,2017(3):49-53.
[21] 朱江,李欣怡,姜恩波,等.基于ISLI标准的科技文献和科学数据的关联[J].图书馆理论与实践,2020(5):80-83,91.
[22] KRAFT A,DREYER B, LOWE P, et al.14 Years of PID services at the German National Library of Science and Technology (TIB):connected frameworks, research data and lessons learned from a national research library perspective[J]. Data science journal, 2017,16(36):1-10.
[23] 郭学武.基于引文的科学数据与科技文献关联研究[J].情报科学,2014,32(4):59-62.
[24] 张鑫,文奕,杨宁,等.基于引文探针的文献与数据的关联算法与应用——以高能物理领域为例[J].情报理论与实践,2019,42(10):151-156.
[25] RIEDEL N,KIP M,BOBROV E. ODDPub- a text-mining algorithm to detect data sharing in biomedical publications[J].Data science journal,2020,19(42):1-14.
[26] HOU L L, ZHANG J, WU O, et al. Method and dataset entity mining in scientific literature:a CNN + Bi-LSTM model with self-attention[EB/OL].[2021-10-08].https://arxiv.org/abs/2010.13583.
[27] GHAVIMI B, MAYR P, LANGE C. et al. A semi-automatic approach for detecting dataset references in social science texts[J]. Information services & use,2016, 36(3/4):171-187.
[28] 孙志茹, 韩涛, 杨文. 生物信息学科学数据与科学文献的关联关系分析[J]. 图书情报工作, 2008,52(2):88-91.
[29] 黄筱瑾.基于元数据的科学数据与科技文献关联研究[J].情报理论与实践,2013,36(7):27-30.
[30] 黄筱瑾.基于内容特征的科学数据与科技文献关联研究[J].现代情报,2018,38(1):56-59.
[31] 贺姝祎,魏韧,吴茂春,等.科技文献与观测数据的关联性在天文领域的应用研究[EB/OL].[2021-09-10]. https://d.wanfangdata.com.cn/conference/8469846.
[32] 卫军朝.科学文献与科学数据关联实践研究——以Elsevier为例[J].国家图书馆学刊,2017,26(3):93-101.
[33] 孙巍.科学数据与科技文献关联发现系统研究与实现[EB/OL].[2021-09-10]. https://d.wanfangdata.com.cn/conference/7611510.
[34] 丁培.科学文献与科学数据细粒度语义关联研究[J].图书馆论坛,2016,36(7):24-33.
[35] CLARK T. Argument graphs:literature-data integration for robust and reproducible science[EB/OL].[2021-01-20]. http://www.isi.edu/ikcap/sciknow2015/papers/Clark.pdf.
[36] COUSIJIN H,HAAK W, KOERS H. Finding better ways to connect research data with scientific literature[EB/OL].[2021-01-20].https://www.elsevier.com/connect/finding-better-ways-to-connect-research-data-with-scientific-literature.
[37] AALBERSBERG I J, KAHLER O. Supporting Science through the Interoperability of Data and Articles[EB/OL].[2021-10-08]. http://www.dlib.org/dlib/january11/aalbersberg/01aalbersberg.html.
[38] DataCite Search[EB/OL].[2021-03-10].https://search.datacite.org/.
[39] Google Dataset Search[EB/OL].[2021-03-10]. https://datasetsearch.research.google.com/.
[40] ScholeXplore[EB/OL].[2021-03-10].https://scholexplorer.openaire.eu/.
[41] OpenAIRE explore[EB/OL].[2021-03-10].https://explore.openaire.eu/.
[42] DRYAD[EB/OL].[2021-03-10].Our platform. https://datadryad.org/stash/our_platform.
[43] Elsevier and PANGAEA Link Contents for easier access to full earth system research[EB/OL].[2021-03-10]. https://www.elsevier.com/about/press-releases/science-and-technology/elsevier-and-pangaea-link-contents-for-easier-access-to-full-earth-system-research.
[44] HEPData[EB/OL].[2021-03-10].https://www.hepdata.net/.
[45] STM. Research data share-link-cite[EB/OL].[2020-10-20]. https://www.stm-researchdata.org/.
[46] 李娇,黄永文,罗婷婷,等.基于多因子算法的自动分类研究[J].数据分析与知识发现,2020,4(11):43-51.
Outlines

/