图书情报工作 ›› 2022, Vol. 66 ›› Issue (19): 4-14.DOI: 10.13266/j.issn.0252-3116.2022.19.001

所属专题: 面向数字人文研究的稷下学文献资料数据库建设研究

• 专题:面向数字人文研究的稷下学文献资料数据库建设研究 • 上一篇    下一篇

数字人文视域下古籍数据库建设关键技术研究——兼评稷下学文献资料数据库的建设思路

鞠孜涵1, 白如江1, 张玉洁1, 王志民2   

  1. 1 山东理工大学信息管理研究院 淄博 255049;
    2 山东理工大学齐文化研究院 淄博 255049
  • 收稿日期:2022-04-21 修回日期:2022-07-09 出版日期:2022-10-05 发布日期:2022-10-25
  • 通讯作者: 鞠孜涵,硕士研究生;白如江,教授,博士生导师,通信作者,E-mail:brj@sdut.edu.cn。
  • 作者简介:张玉洁,硕士研究生;王志民,教授
  • 基金资助:
    本文系教育部哲学社会科学研究重大课题攻关项目"稷下学派文献整理与数据库建设研究"(项目编号:19JZD011)研究成果之一。

Research on Key Technologies of Ancient Books Database Construction from the Perspective of Digital Humanities——Also Comment on the Construction Idea of Jixia Literature Database

Ju Zihan1, Bai Rujiang1, Zhang Yujie1, Wang Zhimin2   

  1. 1 Institute of Information Management, Shandong University of Technology, Zibo 255049;
    2 Qiculture Research Institute, Shandong University of Technology, Zibo 255049
  • Received:2022-04-21 Revised:2022-07-09 Online:2022-10-05 Published:2022-10-25

摘要: [目的/意义] 随着数字人文的迅速发展,用户对知识服务的需求日益增长,对承载着中国优秀传统文化的古籍进行数字化转型,建设能够支撑起人文计算的古籍文献数据库迫在眉睫。[方法/过程] 数字人文视域下古籍的数据库建设需要依靠先进的计算机技术,在深度调研数据库建设过程中依赖的关键技术基础上,将古籍文献数据库的建设过程划分为数字化、文本化、知识化和图谱化4个阶段,详细论述古籍汉字识别技术、命名实体识别、关联数据以及GIS技术等,深入阐述相关技术细节和指标。[结果/结论] 提出稷下学文献资料数据库建设的整体思路。最后,通过分析与总结,指出古籍数据库建设仍需解决的问题和未来的发展方向。

关键词: 数字人文, 古籍数据库, 数字化, 文本化, 知识化, 图谱化

Abstract: [Purpose/Significance] With the rapid development of digital humanities, users' demand for knowledge services is increasing day by day. It is extremely urgent to carry out digital transformation of ancient books carrying excellent traditional Chinese culture and build ancient books literature database that can support humanistic computing.[Method/Process] The construction of the database of ancient books from the perspective of digital humanities needed to rely on advanced computer technology. This paper deeply investigated the key technologies relied on in the process of database construction, and divided the construction process of the database of ancient books into four stages:digitization, textuality, knowledgeable and map. It also discussed in detail the Chinese character recognition technology, named entity recognition, associated data and GIS technology of ancient books, and expounded the relevant technical details and indicators.[Result/Conclusion] The whole idea of constructing jixia literature database is put forward. Finally, the paper analyzes and summarizes the problems still to be solved in the construction of ancient books database and points out the future development direction.

Key words: digital humanities, database of ancient books, digitization, textuality, knowledgeable, map

中图分类号: