Library and Information Service >
The Application and Research of Data Provenance Technology within Long-term Data Preservation
Received date: 2015-03-03
Revised date: 2015-04-08
Online published: 2015-04-20
[Purpose/significance]This paper combines the content of provenance and the features of the data preservation, makes a comprehensive study in provenance application within long-term data preservation, and provides a reference for information systems of data preservation to organize and manage the provenance.[Method/process]This paper analyzes the explanations of provenance of the relevant standards such as OAIS, PREMIS and TRAC,and makes a comparative study of the application in the existing long-term preservation systems.[Result/conclusion]The results is a provenance application framework in data preservation,which summarizes the contents of provenance, and the method to capture, organize, storage and encapsulate provenance.
Key words: provenance; event; long-term preservation; preservation cycle; practice
Wu Zhenxin , Li Wenyan . The Application and Research of Data Provenance Technology within Long-term Data Preservation[J]. Library and Information Service, 2015 , 59(8) : 118 -125 . DOI: 10.13266/j.issn.0252-3116.2015.08.017
[1] Ram S, Liu J. A new perspective on semantics of data provenance[EB/OL].[2015-03-01]. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.154.8485&rep=rep1&type=pdf.
[2] 王黎维,鲍芝峰,Koehler H,等. 一种优化关系型溯源信息存储的新方法[J]. 计算机学报,2011(10):1863-1875.
[3] Plale B, Gannon D, Simmhan Y L. A survey of data provenance techniques[EB/OL].[2015-03-01]. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.70.6294.
[4] Simmhan Y L, Plale B, Gannon D. A survey of data provenance techniques[J]. Computer Science Department, 2005,34(3):31-36.
[5] 戴超凡,王涛,张鹏程. 数据起源技术发展研究综述[J]. 计算机应用研究,2010(9):3215-3221.
[6] 沈志宏,张晓林. 语义网环境下数据溯源表达模型研究综述[J]. 现代图书情报技术,2011(4):1-8.
[7] 刘通. 基于OPM的安全起源研究[D]. 淄博:山东理工大学,2013.
[8] 倪静,孟宪学. PROV数据溯源模型及Web应用[J]. 图书情报工作,2014,58(3):13-19.
[9] 祝犇. 数字信息长期保存中来源感知技术的研究[D]. 武汉:华中科技大学,2013.
[10] CCSDS 650.0-M-2, Reference model for an open archival information system(OAIS)[S]. Washington : CCSDS,2012.
[11] PREMIS data dictionary for preservation metadata, version 2.0[EB/OL].[2015-03-01].http://www.loc.gov/standards/premis/v2/premis-2-0.pdf.
[12] Preservation metadata and the OAIS information model: A metadata framework to support the preservation of digital objects, a report[M]. Dublin: OCLC/RLG Working Group on Preservation Metadata, 2002.
[13] The Florida Center for Library Automation. DAITSS website[EB/OL]. [2015-03-01].http://daitss.fcla.edu/.
[14] Factor M, Henis E, Naor D, et al. Authenticity and provenance in long term digital preservation: Modeling and implementation in preservation aware storage[EB/OL].[2015-03-01].http://static.usenix.org/event/tapp09/tech/full_papers/factor/factor.pdf.
[15] IBM. Preservation dataStore interface[EB/OL]. [2015-03-01].http://www.casparpreserves.eu/Members/cclrc/Deliverables/updated-preservation-datastores-interface/at_download/file.pdf.
[16] D24.1 Report on authenticity and plan for interoperable authenticity evaluation system[EB/OL]. [2015-03-01]. http://www.alliancepermanentaccess.org/wp-content/uploads/downloads/2014/06/APARSEN-REP-D24_1-01-2_5_incURN.pdf.
[17] D24.2 Implementation and testing of an authenticity protocol on a specific domain[EB/OL]. [2015-03-01]. http://www.alliancepermanentaccess.org/wp-content/uploads/downloads/2014/06/APARSEN-REP-D24_2-01-2_3_incURN.pdf.
[18] CRMdig: A generic digital provenance model for scientific observation[EB/OL]. [2015-03-01]. http://www.cidoc-crm.org/docs/CRMdig-TAPP11.pdf.
[19] SCAPE website[EB/OL].[2015-03-01].http://www.scape-project.eu/.
[20] Withers D, Paton N. Design of provenance [EB/OL]. [2015-03-01].http://www.scape-project.eu/deliverable/d7-1-design-of-provenance-component.
[21] Weise A, Hasan A, Hedges M, et al. Managing provenance in iRODS[EB/OL].[2015-03-01].http://link.springer.com/chapter/10.1007%2F978-3-642-01973-9_75.
[22] Kashi N, Sherwinter N. AV data model: Final specification [EB/OL]. [2015-03-01].https://prestoprimews.ina.fr/public/deliverables/PP_WP2_D2.1.3_AV_Data_Model_R0_v1.00.pdf.
[23] Mayernik M S,DiLauro T, Duerr R, et al. Data conservancy provenance, context, and lineage services:Key components for data preservation and curation[J]. Data Science Journal, 2013,12: 158-171.
[24] 李文燕,吴振新.起源信息模型及标准PROV的研究分析[J].情报理论与实践,2015,38(4):23-29.
[25] Assessment of UKDA and TNA compliance with OAIS and METS standards [EB/OL]. [2015-03-01].http://www.webarchive.org.uk/wayback/archive/20140615012529/http://www.jisc.ac.uk/media/documents/programmes/preservation/oaismets.pdf.
[26] Provenance management[EB/OL]. [2015-03-01].http://www.taverna.org.uk/documentation/taverna-2-x/provenance/.
[27] [2015-03-01].http://www.exlibrisgroup.com/offices.htm.
[28] Missier P, Ludscher B, Dey S, et al. Golden trail: Retrieving the data history that matters from a comprehensive provenance repository[J]. International Journal of Digital Curation, 2012, 7(1): 139-150.
[29] METS profiles[EB/OL]. [2015-03-01].http://www.loc.gov/standards/mets/mets-profiles.html.
[30] CCSDS 661.0-R-1, XML formatted data unit (XFDU)structure and construction rules[S]. Washington: CCSDS,2007.
[31] Dunckley M, Ronen S, Henis E A, et al. Using XFDU for CASPAR information packaging[J]. OCLC Systems & Services: International Digital Library Perspectives, 2010, 26(2): 80-93.
/
〈 | 〉 |