知识组织

科研实体名称规范的关联数据模型构建

  • 周毅 ,
  • 张建勇 ,
  • 刘峥 ,
  • 刘秀敏
展开
  • 中国科学院文献情报中心, 北京, 100190
周毅(ORCID:0000-0002-1494-6716),馆员,硕士;张建勇(ORCID:0000-0001-7533-1726),研究馆员,硕士;刘峥(ORCID:0000-0002-2494-436X),副研究馆员,博士,通讯作者,E-mail:liuz@mail.las.ac.cn;刘秀敏(ORCID:0000-0001-6014-9614),馆员,硕士。

收稿日期: 2019-11-25

  修回日期: 2020-02-19

  网络出版日期: 2020-05-20

基金资助

本文系国家科技图书文献中心(NSTL)资助项目"名称规范数据库建设"(项目编号:科1817)研究成果之一。

Research on the Construction of Linked Data Model for Research Entity's Name Authority Data

  • Zhou Yi ,
  • Zhang Jianyong ,
  • Liu Zheng ,
  • Liu Xiumin
Expand
  • National Science Library, Chinese Academy of Sciences, Beijing 100190

Received date: 2019-11-25

  Revised date: 2020-02-19

  Online published: 2020-05-20

摘要

[目的/意义] 旨在研究将国家科技图书文献中心(National Science and Technology Library,NSTL)的科研实体名称规范数据发布为关联数据的难点——关联数据的数据模型。科研实体名称规范数据的数据模型研究,有助于NSTL科研实体数据的共享、互联、质量提升,融入到互联网中,同时也为其他机构使用、发布关联数据提供模型参考。[方法/过程] 首先,分析比较国内外关联数据发布项目中所采用的数据模型,发现关联数据发布项目中的数据模型主要分为以Schema.org为核心和多种标准词表组合两类;结合NSTL名称规范数据的特点,设计两种形式的关联数据模型,并从关联数据模型对名称规范数据的表达程度、模型复杂度等角度进行比较,选择较优方案;最后以D2RQ为工具进行实验,将NSTL名称规范的样例数据发布为关联数据。[结果/结论] 分析发现两种方案中以Schema.org为核心标准词表的方案相对于多种标准词表组合的方案有较优的表达完整度、较低的模型复杂度,更易于融入互联网,因此更适合作为NSTL名称规范数据的关联数据模型。

本文引用格式

周毅 , 张建勇 , 刘峥 , 刘秀敏 . 科研实体名称规范的关联数据模型构建[J]. 图书情报工作, 2020 , 64(10) : 109 -117 . DOI: 10.13266/j.issn.0252-3116.2020.10.012

Abstract

[Purpose/significance] The purpose of this paper is to study the linked data model of publishing the NSTL’s research entity name authority data as linked data. After the name authority data is published as linked data, it can be reused as an open linked data set by other system or organization, and also can be better integrated with other linked data sets to improve data quality. In addition, it also provides a model building reference for other organizations to publish authority data as linked data. [Method/process] First, this paper analyzed and compared the data models used in the linked data publishing projects at home and abroad. It showed that the data models in the linked data publishing projects were mainly divided into two categories. Then, combined with the characteristics of NSTL name authority data, two forms of linked data models were designed. It compared the two models from the expression level of the NSTL’s data and the complexity of the models. The better one was selected. Finally, it used D2RQ as tool to publish the sample data as linked data. [Result/conclusion] The analysis found that the model with Schema.org as the core standard vocabulary has better performance. So it is more suitable as a linked data model for NSTL’s name authority data.

参考文献

[1] 曾建勋.基于海量数字资源的科研关系网络构建探究[J].情报学报,2013,32(9):929-935.
[2] 刘炜, 张春景, 夏翠娟. 万维网时代的规范控制[J]. 中国图书馆学报, 2015(3):2-33.
[3] VIAF[EB/OL].[2019-06-25].https://www.oclc.org/zh-Hans/viaf.html.
[4] THOMAS B H, JEFFREY A Y. Description of the VIAF (virtual international authority file) dataset[EB/OL].[2019-06-25]. http://www.semantic-web-journal.net/sites/default/files/swj294.pdf.
[5] VIAF. RDF changes[EB/OL].[2019-06-25].https://outgoing.typepad.com/outgoing/2015/04/viaf-rdf-changes.html.
[6] OCLC adds linked data to WorldCat.org[EB/OL].[2019-06-25].http://www.oclc.org/news/releases/2012/201238.en.html.
[7] Versionshistorie des linked-data-servicE[EB/OL].[2019-11-22].https://www.dnb.de/DE/Professionell/Metadatendienste/Datenbezug/LDS/lds_versionshistorie.html?nn=250612.
[8] GND ontology[EB/OL].[2019-06-25].https://d-nb.info/standards/elementset/gnd#CharactersOrMorphemes.
[9] Gemeinsame normdatei (GND)[EB/OL].[2019-06-25].https://lod-cloud.net/dataset/dnb-gemeinsame-normdatei.
[10] XIA C J, LIU W. Name authority control in digital humanities:building a name authority database of Shanghai Library[J]. International journal of libraryship 2018,3(1):21-35.
[11] 上海图书馆.人名规范库本体 (shlnames)[EB/OL].[2019-06-25].http://data.library.sh.cn/ont/ontology/tree?g=http://ont.library.sh.cn/graph/shlnames.
[12] WOODS A.Source ontologies for VIVO[EB/OL].[2019-06-25].https://wiki.duraspace.org/display/VIVODOC110x/Source+ontologies+for+VIVO.
[13] NATIONAL LIBRARY OF HUNGARY. National széchényi library (national library of Hungary) on the semantic Web[EB/OL].[2019-06-25].http://nektar.oszk.hu/wiki/Semantic_web.
[14] HANSON E M.A beginner"s guide to creating library linked data:lessons from ncsüs organization name linked data project[J].Serials review,2014,40(4):251-258.
[15] 胡小菁. 文献编目:从数字化到数据化[J]. 中国图书馆学报, 2019, 45(3):49-61.
[16] GOOGLE,YAHOO,MICROSOFT,et al.Schema.org[EB/OL].[2019-10-10].https://schema.org/.
[17] 张雪松, 谈海蓉, 姚湘中. 网络书目资源描述规范SchemaBibEx及其应用[J]. 图书馆杂志, 2016 (5):67-75.
[18] 贾君枝, 石燕青. 中文个人名称规范文档的关联数据化研究[J]. 情报学报, 2016, 35(7):696-703.
[19] 李柏炀.基于关联数据的科研关系揭示研究[D].长春:东北师范大学,2016.
[20] NOGALES A M, ELENA G B.Measuring vocabulary use in the linked data cloud[J].Online information review,2017,41(2):252-271.
[21] IANNELLA R, MCKINNEY J, vCard ontology-for describing people and organizations[EB/OL].[2019-06-25].https://www.w3.org/TR/vcard-rdf/.
[22] D'ARCUS B, GIASSON F. Bibliographic ontology specification[EB/OL].[2019-06-25]. http://bibliontology.com/.
[23] HYLAND B, ATEMEZING G, TERRAZAS V B. Best practices for publishing linked data[EB/OL].[2019-05-23].http://www.w3.org/TR/ld-bp/.
[24] 伍德,扎伊德曼,鲁思,等.关联数据:万维网上的结构化数据[M].蒋楠,译.北京:人民邮电出版社,2018:3.
[25] BIZER C,CYGANIAK R. D2R server-publishing relational databases on the semantic[EB/OL].[2019-07-13].http://richard.cyganiak.de/2008/papers/d2r-server-iswc2006.pdf.
文章导航

/