[目的/意义]调研和分析国外Data Commons(数据共享空间)的数据管理模式,为建设我国的数据共享空间提供借鉴。[方法/过程]通过梳理、归纳国内外数据共享空间的发展现状,对比和分析二者之间差距,并以美国INRG数据共享空间为例,从原则与协议、数据库与用户接口以及数据标识与关联等方面剖析其数据空间管理模式,为我国数据共享空间的建设及发展提出策略。[结果/结论]结合案例和我国数据共享平台现状,从总体规划、建设目标、要解决的问题、DC总体架构和用户服务等方面提出具体建议。
[Purpose/significance] This paper investigated and analyzed the data management mode of data commons in foreign countries, to promote the research and practice of data management services in China.[Method/process] By combing and summarizing the development state of data commons at home and abroad, comparing and analyzing the gap between the two and taking the US-INRG data commons as an example, from the principle and protocol mode, database and user interface and data identification and association. Analyze its data space management model and propose strategies for construction and development of China data commons.[Result/conclusion] Combining the case and the state of China's data management platform, the paper puts forward specific suggestions such as overall plan, construction goals, problems to be solved, data commons overall architecture and user service so on.
[1] GROSSMAN R L,HEATH A,MURPHY M,et al.A case for data commons:towards data science as a service[J]. Computing in science & engineering,2016,18(5):10-20.
[2] 数据共享空间启动项目[EB/OL].[2019-01-09].http://www.bio-itworld.com/2017/11/07/nih-launches-data-commons-pilot-with-9-projects.
[3] 张先恩.国家科学数据共享工程[J].科学中国人,2004(9):11-13.
[4] 国务院.国务院关于印发促进大数据发展行动纲要的通知[EB/OL].[2019-01-09].http://www.gov.cn/zhengce/content/2015-09/05/content_10137.htm.
[5] National Institutes of Health.Newly launched genomic datacommons to facilitate datavand clinical information sharing[EB/OL].[2019-01-09].http://www.nih.gov/news-events/news-releases/newly-launched-genomic-data-commons-facilitate-data-clinical-information-sharing.
[6] 吴雅威,魏来.国外Data Commons的发展及其构建初探[J].情报资料工作,2017,38(6):41-48.
[7] MOLINARI F,MORELLI N,TORNTOFT L K,et al.OpenDataLabs:new infrastructures for open datacommons[EB/OL].[2019-01-09].https://www.forskningsdatabasen.dk/en/catalog/2372370890.
[8] New Zealand Project.Data commons blueprint[EB/OL].[2019-01-09].http://datacommons.org.nz.
[9] GROSSMAN R L. Data-Commons-Guidelines[EB/OL].[2019-01-09].https://www.healthra.org/wp-content/uploads/2018/08/Data-Commons-Guidelines_Grossman_8_2017.pdf.
[10] FRENCH S P,BARCHERS C V.Designing a data commons for urban big data[EB/OL].[2019-01-09].https://www.rd-alliance.org/final-report-income-streams-data-repositories.html.
[11] VOLCHENBOUM S,HAWKINS D,FRAZIER L,et al.Building pediatric cancer data commons[EB/OL].[2019-01-09].https://ascopubs.org/doi/full/10.1200/EDBK_175029.
[12] VOLCHENBOUM S L,COX S M,HEATH A,et al.Data commons to support pediatric cancer research[J].American Society of Clinical Oncology Educational Book,2017,37(24):746-752.
[13] SANSONE S A,MCQUITON P,ROCCA-SERRA P,et al.FAIR sharing_working_with_and_for_the_community_to_describe_and_link_data_standards_repositories_and_policies[EB/OL].[2019-01-09].https://www.researchgate.net/publication/326462185.
[14] BIZER C,MEUSEL R,PRIMPELI A.The web data commons microdata, RDFa and microformat dataset series[EB/OL].[2019-01-09].https://link.springer.com/chapter/10.1007/978-3-319-11964-918.
[15] PURTOVA N.Health data for common good:defining the boundaries and social dilemmas of data commons[EB/OL].[2019-01-09].http://link.springer.com/chapter/10.1007/978-3-319-48342-910.
[16] MORGAN M,DAVIS S R.Genomic data commons:a bioconductor interface to the NCI genomic data commons[EB/OL].[2019-01-09].https://github.com/seandavi/GenomicDataCommons.
[17] SU Z,BERTAGNOLLI M M,SARTOR A O,et al.A novel,open-access data commons for improved disease management in patients(pts)with Merkel cell carcinoma(MCC)[J].Journal of clinical oncology,2018,36(15):215-255.
[18] SCOTT C,WALTER S,EDDIE S,et al.VDJServer:a cloud-based analysis portal and data commons for immune repertoire sequences and rearrangements[J].Frontiers in immunology, 2018, 9(39):976-1002.
[19] HALPHIN P N,READ A J, BEST B D,et al. OBIS-SEAMAP:developing a biogeographic research data commons for the ecological studies of marine mammals,seabirds,and sea turtles[J].Marine ecology progress series,2006,77(316):239-246.
[20] EVANS B J.Barbarians at the gate:consumer-driven health data commons and the transformation of citizen science[J].American journal of law & medicine,2016,42(4):651-685.
[21] BAMBAUER J Y.Tragedy of the data commons[J].SSRN electronic journal,2011,62(25):120-135.
[22] 宋秀芬,邓仲华.基于数据监护的机构知识库研究[J].图书馆学研究,2016,31(2):44-48.
[23] 覃丹.英美社会科学数据管理与共享服务平台调查分析[J].图书情报工作,2014,58(8):67-75.
[24] 完颜邓邓.澳大利亚高校科学数据管理与共享政策研究[J].信息资源管理学报,2016(1):30-37.
[25] 杨鹤林.从数据监护看美国高校图书馆的机构库建设新思路-来自Data Star的启示[J].大学图书馆学报,2012(2):23-28,73.
[26] 殷沈琴,张计龙,张莹等.社会科学数据管理服务平台系统选型研究——以复旦大学社会科学数据平台为例[J].图书情报工作,2013,57(19):92-96.
[27] 朱玲,聂华,崔海媛,等.北京大学开放研究数据平台建设:探索与实践[J].图书情报工作,2016,60(4):44-51.
[28] 殷沈琴,张计龙,张莹,等.社会科学数据管理服务平台系统选型研究——以复旦大学社会科学数据平台为例[J].图书情报工作,2013,57(19):92-96.
[29] 邓仲华,黄雅婷."互联网+"环境下我国科学数据共享平台发展研究[J].情报理论与实践,2017,40(2):128-132.
[30] 刘兹恒,曾丽莹.我国高校科研数据管理与共享平台调研与比较分析[J].情报资料工作,2017(6):90-95.
[31] 刘桂锋,张裕,刘琼.科研数据开放平台评价指标体系构建及案例研究[J].图书情报知识,2019(1):21-31.
[32] 美国开放数据云联盟[EB/OL].[2019-03-09].www.opensciencedatacloud.org.
[33] 复旦大学数据中心[EB/OL].[2019-03-09].https://dvn.fudan.edu.cn/home/static/profile.jsp.
[34] 北京航空航天大学数据共享平台[EB/OL].[2019-03-09].http://etc.xzit.edu.cn/01/19/c56a281/page.htm.
[35] 清华大学数据共享平台[EB/OL].[2019-03-09].http://www.chinaz.com/news/2016/0105/492077.shtml.
[36] 中国科学院计算机网络信息中心[EB/OL].[2019-03-09].http://www.nsdata.cn/resource/list?code=1803710.
[37] 中国科学院数据共享平台[EB/OL].[2019-03-09].http://www.geodata.cn/.
[38] 武汉大学图书馆数据共享中心[EB/OL].[2019-03-09].http://www.lib.whu.edu.cn/kxsj/aboutus.htm.
[39] 华中科技大学科学数据中心[EB/OL].[2019-03-09].https://cmis.csdc.info/to about.action.
[40] 清华大学经济社会数据中心[EB/OL].[2019-03-09].http://www.sem.tsinghua.edu.cn/sercent/jjshsjzx.html.
[41] 国际神经母细胞瘤风险组.INRG data commons[EB/OL].[2019-03-09].http://europepmc.org/abstract/MED/28561664.
[42] HEATH A P, GREENWAY M, POWELL R,et al. Bionimbus:a cloud for managing, analyzing and sharing large genomics datasets[J]. Journal of the American Medical Informatics Association,2014, 21(6):969-975.
[43] 魏来,高希然.大数据背景下高校数据馆员的角色定位[J].情报资料工作,2015(5):90-94.
[44] MANSELL J,LAKING R,MATHESON B,et al.Data commons blueprint:a high trust,lower cost alternative to enable data integration and reuse[EB/OL].[2019-03-09].http://datacommons.org.nz,2017.