[目的/意义] 数字人文研究的图像资源中蕴含大量信息但利用率极低,不能在异构数据库和不同的应用程序中得到有效的共享与重用,国际图像互操作框架打破了图像资源交换和共享的障碍。[方法/过程] 研究结合国际图像互操作框架和语义知识图谱(关联数据技术)进行图像资源的整合、共享与知识发现,对资源之间的关系进行揭示和知识推理,并通过CNNs算法对图像特征的提取与识别实现基于图像特征的语义检索辅助知识发现。[结果/结论] 提出一套数字人文图像资源整合与知识发现解决方案,并以印章图像资源为应用对象构建"印章知识中心"对以上解决方案的可行性和实践性进行实证检验。
[Purpose/significance] The image resources of digital humanities research contain a lot of information but the utilization rate is extremely low, so it cannot be effectively shared and reused in heterogeneous databases and different applications. The International Image Semantic Interoperability Framework (IIIF) breaks the barriers to image resource exchange and sharing. [Method/process] This study combined IIIF and semantic knowledge graph (linked data technology) to integrate, share and discover knowledge of image resources, reveal the relationship between resources and knowledge reasoning, and it realized semantic retrieval based on image features to assist knowledge discovery by the feature extraction and recognition of image features through CNNs algorithm. [Result/conclusion] Finally, a set of digital human image resource integration and knowledge discovery solutions was proposed, and the "Seal Knowledge Center" was constructed with the seal image resources as the application object to empirically test the feasibility and practicality of the above solutions.
[1] 刘炜,叶鹰. 数字人文的技术体系与理论结构探讨[J]. 中国图书馆学报,2017,43(5):32-41.
[2] IIIF Consortium. IIIF presentation API 2.0[EB/OL].[2019-12-26]. http://iiif.io/api/presentation/2.0/.
[3] Open Annotation Community Group. Open annotation data model.[EB/OL].[2019-12-26]. http://www.openannotation.org/spec/core/.
[4] 曾蕾,王晓光,范炜. 图档博领域的智慧数据及其在数字人文研究中的角色[J]. 中国图书馆学报,2018,44(1):17-34.
[5] 陈涛,刘炜,单蓉蓉,等. 知识图谱在数字人文中的应用研究[J]. 中国图书馆学报, 2019,45(6):1-19.
[6] Linked canvas, engaging people, art and ideas[EB/OL].[2019-10-26]. https://www.synaptica.com/wp-content/uploads/2015/03/Linked_Canvas_Factsheet.pdf.
[7] ALISON A. The ‘time machine’ reconstructing ancient Venice's social networks[J]. Nature, 2017, 7658(546):341-344.
[8] 夏翠娟,张磊,贺晨芝. 面向知识服务的图书馆数字人文项目建设:方法、流程与技术[J].图书馆论坛,2018,38(1):1-9.
[9] 曾子明,秦思琪. 面向数字人文的移动视觉搜索模型研究[J]. 情报资料工作,2018,39(6):21-28.
[10] 侯西龙,谈国新,庄文杰,等基于关联数据的非物质文化遗产知识管理研究[J].中国图书馆学报,2019,45(2):88-108.
[11] 中国历代人物传记资料库(CBDB)[EB/OL].[2019-10-26].http://cbdb.library.sh.cn/.
[12] IIIF Presentation API 1.0[EB/OL].[2019-10-26]. http://iiif.io/api/search/1.0/.
[13] TOUSCH A M, HERBIN S, AUDIBERT J Y. Semantic hierarchies for image annotation:a survey[J]. Pattern recognition, 2012, 45(1):333-345.
[14] 陈涛,刘炜,朱庆华. 中文百科概念术语服务平台SinoPedia的构建研究[J]. 中国图书馆学报,2018,44(4):4-18.
[15] 陈涛,张永娟,刘炜,等. 关联数据发布的若干规范及建议[J].中国图书馆学报,2019,45(1):34-46.
[16] CHEN T, ZHANG Y, WANG Z, et al. (2019) SinoPedia-A linked data services platform for decentralized knowledge base[J]. PLOS ONE, 2019, 14(8):e0219992.
[17] 印章知识中心[EB/OL].[2019-10-26].http://sinopedia.library.sh.cn:8180/seal/seal/search.