收稿日期: 2015-05-21
修回日期: 2015-06-20
网络出版日期: 2015-07-05
Research on Construction of Metadata Model for Scientific Experimental Data: An Example as Gene Expression Experiment of Botany
Received date: 2015-05-21
Revised date: 2015-06-20
Online published: 2015-07-05
[目的/意义] 科学实验数据组织现状混乱、丢失现象频繁,严重阻碍科学数据的保存、复用以及公开获取,因此构建规范的科学实验元数据模型对实验数据的组织、保存、检索、复用等有重大的现实意义。[方法/过程] 首先通过文献调研对现有科学实验元数据集进行总结;其次以植物学基因表达实验为例,通过用户访谈对实验室数据组织现状以及实验操作流程、特点等进行调查总结,初步构建植物学基因表达实验元数据描述方案;最后通过德尔菲法对元数据元素集进行打分、评估、筛选、确立。[结果/结论] 构建基于科学实验数据生命周期的植物学基因表达实验元数据模型,能够完整描述包含实验设计、实验数据等在内的科学实验基础信息,同时包括科研成果、数据访问等信息;基于该元数据模型不仅便于科学实验数据的组织,还有利于科学实验数据公开获取以及科研成果的追溯,为不同类型科学数据语义化关联提供支撑。
常颖聪 , 何琳 . 科学实验数据元数据模型构建研究——以植物学基因表达实验为例[J]. 图书情报工作, 2015 , 59(13) : 117 -125 . DOI: 10.13266/j.issn.0252-3116.2015.13.017
[Purpose/significance] The scientific experimental data lacks of suitable and effective organization and loses easily and frequently, which seriously hinders preserving, reusing and publicly accessing the scientific data. So to build the normative scientific experimental data metadata model is of great significance to organize, preserve, retrieval and reuse experimental data.[Method/process] Firstly, this paper summarizes the existing scientific experimental metadata models by literature investigation. Secondly, it establishes a metadata description scheme of botany gene expression experiment, by investigating and summarizing organization condition of experimental data, operating process and characteristic of experiment through expert interviews. Lastly, it scores, evaluates, selects and determines the metadata factors by Delphi method.[Result/conclusion] This paper builds a metadata model for botany gene expression experiment based on scientific experimental data lifecycle. The model can completely describe the basic information such as experiment design and data, and other information such as scientific achievements and data access. The metadata model not only can help organize the scientific experimental data, but also publicly access experimental data and review achievements, to provide support for semantic association of different types of scientific data.
[1] Besiki S, Hinnant C C, Wu Shuheng, et al. Research project tasks,data, and perceptions of data quality in a condensed matter physics community[J].Journal of The Association For Information Science And Technology,2015,66(2):246-263.
[2] Online scientific data curation, publication and archiving[EB/OL].[2015-04-13].http://arxiv.org/ftp/cs/papers/0208/0208012.pdf.
[3] Data curation for e-Science in the UK:An audit to establish requirements for future curation and provision[EB/OL].[2015-04-13]. http://www.jisc.ac.uk/uploaded_documents/e-ScienceReportFinal.pdf.
[4] DCMI metadata terms[EB/OL].[2015-04-13]. http://dublincore.org/documents/dcmi-terms/#terms-abstract.
[5] Requirements specification for the sharing sensitive scientific data test bed[EB/OL].[2015-04-13].http://www.consequence-project.eu/Deliverables_Y1/D6.1.pdf.
[6] The Ontology for biomedical investigations[EB/OL].[2015-04-14]. http://obi-ontology.org/page/Main_Page.
[7] eBank UK: Linking research data, scholarly communication and learning[EB/OL].[2015-04-14].http://eprints.soton.ac.uk/8183/1/eBank_AHM.pdf.
[8] EXPO[EB/OL].[2015-05-13]. http://expo.sourceforge.net/.
[9] Provenance of microarray experiments[EB/OL].[2015-05-13].http://biordfmicroarray.googlecode.com/hg/sparql_endpoint.html.
[10] Lord P, Macdonald A.e-Science curation report: Data curation for e-Science in the UK: An audit to establish requirements for future curation and provision[M].London:Digital Archiving Consultancy Limited,2003.
[11] EPSRC.Research funding policies[EB/OL].[2015-04-13]. http://www.dcc.ac.uk/resources/policy-and-legal/research-funding-policies/epsrc.
[12] Shreeves S, Cragin M. Introduction: Institutonal repositories: Current state and future[J]. Library Trends,2008(2): 89- 97.
[13] The bibliographic ontology[EB/OL].[2015-05-13]. http://bibliontology.com/specification.
[14] 陈慧子,束漫.公共图书馆志愿者服务机制研究——基于德尔菲法的调查分析[J].图书馆建设,2012(4):82-86.
[15] 樊长军,张馨,连宇江,等.基于德尔菲法的高校图书馆公共服务能力指标体系构建[J].情报杂志,2011,30(3):97-100.
[16] 苏学.期刊论文学术水平定量评价指标体系的初步设计[J].情报探索,2010(5):7-9.
[17] White H C.Descriptive metadata for scientific data repositories: A comparison of information scientist and scientist organizing behaviors[J]. Journal of Library Metadata.2014(14):24-51.
[18] Greenberg J, Spurgin K, Crystal A. Functionalities for automatic metadata generation applications: A survey of metadata experts' opinions[J]. International Journal of Metadata, Semantics, and Ontologies,2006, 1(1):3-20.
[19] 欧石燕.面向关联数据的语义数字图书馆资源描述与组织框架设计与实现[J].中国图书馆学报,2012,38(6):58-71.
[20] Heath T, Bizer C. Linked data: Evolving the Web into a global data space[M].San Rafael:Morgan & Claypool,2011.
/
| 〈 |
|
〉 |