[Purpose/significance] This paper aims to research the integration of heterogeneous information in the process of data integration. Based on the accuracy requirements of literature information extraction in literature Meta-synthesis system, we proposed a heterogeneous information standardized approach by the case of resource and environment subject.[Method/process] Using the self-discipline ontology of resources and environment subject, we put forward the idea of heterogeneous information standardization process to guide to putting up literature comprehensive integration platform supporting comprehensive integration of human-computer interaction and information integration, through the analysis of standardizing of heterogeneous information in the geographic space, time and attribute extraction in resources and environment subject.[Result/conclusion] Finally, the paper realized the task of data preparation phase in literature Meta-synthesis, according to knowledge extraction and processing of documents from different sources in different data formats. The standardization of heterogeneous information is only the starting point of knowledge discovery process, and we will focus on the statistical analysis and visual display of standardized information to completely implement knowledge discovery process of literature meta-synthesis.
Qu Jiansheng
,
Liu Hongxu
. The Standardization of Heterogeneous Information in Knowledge Discovery: A Case Study of Resources and Environment Literature[J]. Library and Information Service, 2016
, 60(6)
: 84
-90
.
DOI: 10.13266/j.issn.0252-3116.2016.06.013
[1] 邱均平,李小涛,董克. 图情领域可视化研究的发展、演化与创新[J]. 图书情报工作,2014,58(13):125-131.
[2] SWANSON D R. Undiscovered public knowledge[J]. The library quarterly, 1986,56(2):103-118.
[3] LI D, HAN J, SHI X, et al. Knowledge representation and discovery based on linguistic atoms[J]. Knowledge-based systems, 1998, 10(7):431-440.
[4] 波普尔.猜想与反驳——科学知识的增长[M].傅季重, 纪树立, 周昌忠,等译.上海:上海译文出版社,1986.
[5] 荣毅虹,梁战平. 基于文献的发现[J]. 情报学报,2002,21(4):386-390.
[6] 崔瑞琴,孟连生. 数字信息资源整合问题研究[J]. 图书情报工作,2007,51(7):35-37.
[7] 牛奉高. 数字文献资源高维聚合模型研究[D].武汉:武汉大学,2014.
[8] 窦天芳,姜爱蓉,张成昱,等. WEB环境下多源数据的集成服务——以清华大学新期刊导航为例[J]. 大学图书馆学报,2010(3):80-84.
[9] 肖希明,田蓉. 国外公共数字文化资源整合的现状与发展趋势[J]. 国家图书馆学刊,2014(5):48-56.
[10] CALISTRU C, RIBEIRO C, DAVID G. Multimedia in cultural heritage manuscripts:integrating description,transcription, and image content[J]. Eurasip Journal on image & video processing, 2009, 7238(3):347-386.
[11] HERNÁNDEX F, WERT C, RECIO I, et al. XML for libraries, archives, and museums:the project COVAX[J]. Applied artificial intelligence, 2003, 17(8/9):797-816.
[12] RAYWARD W B. Electronic information and the functional integration of libraries, museums, and archives[M]. E Higgs History & Electronic Artefacts. Oxford:Clarendon Press,1998:207-226.
[13] 马文峰,杜小勇,卢晓惠. 基于知识的资源整合[J]. 情报资料工作,2007(1):51-56.
[14] Simfinder:a flexible clustering tool for summarization[EB/OL].[2016-02-22].https://www0.comp.nus.edu/~kanmy/papers/simfinder.pdf.
[15] 赵新勇.基于多源异构数据的高速公路交通安全评估方法[D].哈尔滨:哈尔滨工业大学,2013.
[16] 刘亚东,彭舰,张达平. 基于智能的网页信息提取系统的研究与设计[J]. 四川大学学报(自然科学版),2009(4):957-962.
[17] 郑建明. 数字文献资源的整合与服务——以江苏省高校文献资源保障体系建设为原型的个案研究[J]. 大学图书馆学报,2007(5):6-9.
[18] 智立方.知识发现系统[EB/OL].[2016-02-28].http://zlf.cqvip.com/help/about.html.
[19] 廖崇粮. Web信息自动抽取技术的研究[D].成都:电子科技大学,2012.
[20] 和延立,杨海成,何卫平,等.信息集成与知识集成[J].计算机工程与应用,2003(4):38-41.
[21] BUCCELLA A, CECHICH A. An ontology approach to data integration[J].Journal of Computer Science and Technology, 2003,3(2):62-68.
[22] STUDER R, BENJAMINS V R, FENSEL D. Knowledge engineering:principles and methods[J]. Data and knowledge engineering, 1998, 25(1/2):161-197.
[23] 徐少坤. 地理空间元数据可视化研究与实践[D].郑州:解放军信息工程大学,2013.
[24] 柯青.网络环境下异构信息检索标准体系研究[D].武汉:武汉大学,2004.