图书情报工作 ›› 2020, Vol. 64 ›› Issue (7): 30-38.DOI: 10.13266/j.issn.0252-3116.2020.07.004

• 专题:先秦典籍的语义组织与挖掘研究 • 上一篇    下一篇

春秋时期社会发展的主题挖掘与演变分析——以《左传》为例

何琳, 乔粤, 刘雪琪   

  1. 南京农业大学信息管理系 南京 210095
  • 收稿日期:2019-07-10 修回日期:2019-10-26 出版日期:2020-04-05 发布日期:2020-04-05
  • 作者简介:何琳(ORCID:0000-0002-4207-3588),教授,博士,博士生导师,E-mail:helin@njau.edu.cn;乔粤(ORCID:0000-0002-1968-9608),硕士研究生;刘雪琪(ORCID:0000-0002-5346-7291),硕士研究生。
  • 基金资助:
    本文系国家社会科学基金项目"基于典籍的中华传统文化知识表达体系自动构建方法"(项目编号:18BTQ063)研究成果之一。

Topic Mining and Evolution Analysis of Social Development in Spring and Autumn Period——A Case of Studying Zuo Zhuan

He Lin, Qiao Yue, Liu Xueqi   

  1. College of Information Science & Technology, Nanjing Agricultural University, Nanjing 210095
  • Received:2019-07-10 Revised:2019-10-26 Online:2020-04-05 Published:2020-04-05

摘要: [目的/意义] 在人文计算迅速发展的背景下,利用文本挖掘技术对《左传》进行聚类计算,为春秋时期社会发展状况的主题挖掘等定量分析提供参考,同时对典籍文本多维度重组和分析也具有一定的借鉴意义。[方法/过程] 采用文本聚类方法对《左传》进行多维度的定量分析,打破《左传》线性的编年体记载顺序,先运用词匹配算法从《左传》特征词语料中得到各个诸侯国语料,再将LDA主题模型先后用于处理《左传》特征词语料和选取的诸侯国语料,最后结合时间信息进行主题强度计算。[结果/结论] 实验结果表明,根据主题-词分布可以挖掘出春秋时期社会和诸侯国各方面的发展内容,通过主题强度变化曲线可以总结出春秋时期社会和各诸侯国的各方面发展态势。通过LDA主题聚类方法最终展现出了春秋时期整个社会以及不同诸侯国在战争、政治及外交等的发展变迁。

关键词: 《左传》, 主题挖掘, LDA主题模型, 主题演化, 春秋时期社会变迁

Abstract: [Purpose/significance] In the context of the rapid development of humanistic computing, this paper uses text mining technology to cluster Zuo Zhuan, which provides a reference for quantitative analysis such as topic mining in Spring and Autumn Period, and has a certain reference significance for multi-dimensional reorganization and analysis of classical texts. [Method/process] This paper uses text clustering method to analyze Zuo Zhuan quantitatively in many dimensions, breaking the linear and chronological record order of Zuo Zhuan. Firstly, using the word matching algorithm, the corpus of each vassal state is obtained from the characteristic words of Zuo Zhuan. Then the LDA topic model is used to process the characteristic words of Zuo Zhuan and the corpuses of selected vassal states. Finally, the topic strength calculation is performed in combination with the time information. [Result/conclusion] The experimental results show that the development of the Spring and Autumn Society and the vassal states can be explored according to the theme-word distribution. The development trend of the Spring and Autumn Society and various vassal states can be summarized through the theme intensity curve. Through the LDA topic clustering method, the development of war, politics and diplomacy in the whole society and different vassal states in the Spring and Autumn Period is finally revealed.

Key words: Zuo Zhuan, topic mining, LDA topic model, topic evolution, social changes in the Spring and Autumn Period

中图分类号: