综述述评

网络日志存档研究现状分析

  • 郭红梅 ,
  • 张智雄 ,
  • 刘振
展开
  • 1. 中国科学院大学、中国科学院国家科学图书馆;
    2. 中国科学院国家科学图书馆
郭红梅,中国科学院大学、中国科学院国家科学图书馆博士研究生,E-mail:guohm@mail.las.ac.cn;张智雄,中国科学院国家科学图书馆研究馆员,馆长助理,信息系统部主任;刘振,中国科学院大学、中国科学院国家科学图书馆博士研究生。

收稿日期: 2013-04-25

  修回日期: 2013-06-13

  网络出版日期: 2013-06-20

基金资助

本文系国家社会科学基金后期资助项目"数字资源长期保存的技术研究与实践"(项目编号:09FTQ005)研究成果之一。

Research Status Analysis on Weblog Preservation

  • Guo Hongmei ,
  • Zhang Zhixiong ,
  • Liu Zhen
Expand
  • 1. National Science Library of CAS, Beijing 100190;
    2. University of Chinese Academy of Sciences, Beijing 100190

Received date: 2013-04-25

  Revised date: 2013-06-13

  Online published: 2013-06-20

摘要

指出网络日志作为特殊的数字化资源受到长期保存界的关注,从网络日志归档项目出发对网络日志的研究现状进行概述,对网络日志归档过程中技术、方法以及在采集、保存和管理中存在的局限性进行分析,并对现有网页归档项目进行描述。最后从目标、重要平台、保存策略以及所产生的社会影响几个角度对网络日志归档项目BlogForever进行说明,针对网络日志归档的现状,认为在未来研究中仍需要对技术和方法作进一步改进。

本文引用格式

郭红梅 , 张智雄 , 刘振 . 网络日志存档研究现状分析[J]. 图书情报工作, 2013 , 57(12) : 143 -148 . DOI: 10.7536/j.issn.0252-3116.2013.12.027

Abstract

As the special digital resource, weblog gains the attention from the digital preservation field. This paper concludes the research status of weblog preservation from the view of the weblog archiving projects. It also analyzes the limitations in the arching technology and method as well as in the process of aggregation, preservation and management. Them it introduces the existing weblog archive projects. Finally, this paper analyzes Blogforever project from its objective, important platform, preservation strategy and social effects. According to its research status, it is necessary to improve the quality of weblog preservation from technology and method in the future.

参考文献

[1] Hank C, Choemprayong S, Sheble L. Blogger perceptions on digital preservation[C]//Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries.New York:The Association for Computing Machinery,2007:477-477.
[2] Banos V, Baltas N, Manolopoulos Y. Trends in blog preservation[C]//Proceedings of the 14th International Conference on Enterprise Information Systems (ICEIS).Poland:Springer-Vertag,2012:168-172.
[3] Maureen P, Richard D. ArchivePress: A really simple solution to archiving blog content[C]//The Sixth International Conference on Preservation of Digital Objects( iPRES).San-Francissco:The California Digital Library,2009:264-268.
[4] Agarwal N, Liu H. Blogosphere: Research issues, tools and applications[J]. ACM SIGKDD Explorations, 2008,10(1):18-31.
[5] Ashley K, Davis R, Guy M, et al. A guide to Web preservation[C]//PoWR project.University of London Conputer Centre(ULCC),2010:14-18.
[6] Tumblr numbers: The rapid rise of social blogging[EB/OL].[2013-03-10].http://mashable.com/2011/11/14/tumblr-infographic/.
[7] Sroka T N. Understanding the political influence of blogs: A study of the growing importance of the blogosphere in the US Congress, Institute for Politics, Democracy and the Internet[EB/OL].[2013-03-10].http://www.ipdi.org.
[8] Strodl S, Petrov P, Rauber A. Research on digital preservation within projects co-funded by the European Union in the ICT programme[J].Vienna University of Technology, Tech Rep,2011(5):245-251.
[9] Edelstein O, Factor M, King R, et al.Evolving domains,problems and solutions for long term[C]//The Eighth Proceedings International Conference Preservation of Digital Objects(iPRES).Singapore.The National Library Board,2011:271-274.
[10] LiWA.Living Web Archives Project[EB/OL].[2013-03-26].http://liwa-project.eu.
[11] ARCOMEM. From Collect-All Archives to Community Memories. [EB/OL].[2013-03-26].http://www.arcomem.eu.
[12] SCAPE.Scalable Preservation Environments[EB/OL].[2013-03-26].http://www.scape-project.eu.
[13] LAWA. Longitudinal Analytics of Web Archive Data Project[EB/OL].[2013-03-26].http://www.lawa-project.eu/.
[14] Stepanyan K, Gkotsis G, Kalb H.Blogs as objects of preservation: Advancing the discussion on significant properties[C]//The Proceedings International Conference Preservation of Digital Objects (iPRES).Toronto:University of Toron to iSchool,2012:295-297.
[15] Nardi B A,Schiano D J,Gumbrecht M,et al.Why we blog[J].Communications of the ACM,2004,47(12):41-46.
[16] Pluenpavarn P,Panteli N.Building social identity through blogging[J].Palgrave Macmillan,2008(5):195-198.
[17] Banos V, Stepanyan K, Manolopoulos Y, et al.Technological foundations of the current blogosphere[C]//The International Conference on Web Intelligence, Mining and Semantics (WIMS).Romania:Vniversity of Cralova,2012:125-127.
[18] Heritrix. IA Web Crawler[EB/OL].[2013-03-20].https://webarchive.jira.com/wiki/display/Heritrix/.
[19] Archive-it. Web Archiving Services[EB/OL].[2013-03-20].http://www.archive-it.org/.
[20] DPE.Digital presentation Europe[EB/OL].[2013-03-26].http://www.digitalpresentationeurope.eu.
[21] CASPAR.CASPAR project[EB/OL].[2013-03-26].http://casparpreserves.eu.
[22] PANDORA.PANDORA project[EB/OL].http://pandora.nla.gov.au.
[23] Ronallo J. HTML 5 Microdata and Schema.org[J]. Code 4 Lib Journal,2012(6):296-209.
[24] BlogForever. BlogForever project[EB/OL].[2013-03-26].http://blogforever.eu.
[25] Kim Y, Ross S. BlogForever: D2.5 Weblog spam filtering report and associated methodology[EB/OL].[2013-03-26].http://blogtorever.ell/deliretables/.
[26] Kalb H, Kasioumis N, García Llopis J, et al. BlogForever:D4.1 User Requirements and Platform Specifications Report[EB/OL].[2013-03-26].http://blogforever.ell/deliverables/.

文章导航

/