图书情报工作 ›› 2021, Vol. 65 ›› Issue (9): 41-50.DOI: 10.13266/j.issn.0252-3116.2021.09.005

• 情报研究 • 上一篇    下一篇

融合句法特征和句法相似度的网络舆情突发事件识别方法研究

陈健瑶, 翟姗姗, 夏立新, 刘德印   

  1. 华中师范大学信息管理学院 武汉 430079
  • 收稿日期:2020-12-09 修回日期:2021-02-23 出版日期:2021-05-05 发布日期:2021-06-02
  • 作者简介:陈健瑶(ORCID:0000-0003-1890-7404),硕士研究生,E-mail:1041403539@qq.com;翟姗姗(ORCID:0000-0002-2787-0183),副教授,博士;夏立新(ORCID:0000-0002-4162-2282),教授,博士,博士生导师;刘德印(ORCID:0000-0002-1769-3160),硕士研究生。

Research on Network Public Opinion Emergency Recognition Method Based on Syntactic Features and Syntactic Similarity

Chen Jianyao, Zhai Shanshan, Xia Lixin, Liu Deyin   

  1. School of Information Management, Central China Normal University, Wuhan 430079
  • Received:2020-12-09 Revised:2021-02-23 Online:2021-05-05 Published:2021-06-02

摘要: [目的/意义] 快速、准确地从突发网络舆情文本中识别事件。[方法/过程] 提出一种融合句法特征和句法相似度的网络舆情突发事件识别方法。结合句法特征提出面向事件的句法特征提取方法,利用事件语义标注和句法特征提取方法构造事件句法特征库,通过计算待测文本与句法库的句法相似度来识别网络舆情突发事件。[结果/结论] 以新型冠状病毒肺炎疫情为例,所提出网络舆情突发事件识别方法在该舆情下的最优相似度为0.93,在此相似度下从一段新的文本中识别出160个事件和30个非事件,F1值达到了0.848。通过方法测评证明网络舆情突发事件识别方法在利用句法相似度识别事件和进行相同相邻词性合并等方面创新的有效性。

关键词: 网络舆情, 事件识别, 句法特征, 句法相似度

Abstract: [Purpose/significance] This study aims to identify events from the text of sudden network public opinion quickly and accurately.[Method/process] This paper proposed a method to identify network public opinion emergencies by integrating syntactic features and syntactic similarity. An event oriented syntactic feature extraction method was proposed based on syntactic features. Event syntactic feature database was constructed by using event semantic annotation and syntactic feature extraction methods. The network public opinion emergencies were identified by calculating the syntactic similarity between the text to be tested and the syntax database.[Result/conclusion] Taking the novel coronavirus pneumonia epidemic as an example, the optimal similarity of the network public opinion emergency identification method proposed by the author is 0.93 in this public opinion. 160 events and 30 non events are identified from a new text under this similarity, and the F1 value reaches 0.848. Through the method evaluation, it is proved that the proposed method is effective in using syntactic similarity to identify events and merge the same adjacent parts of speech.

Key words: internet public opinion, event identification, syntax features, syntactic similarity

中图分类号: