图书情报工作 ›› 2021, Vol. 65 ›› Issue (13): 87-95.DOI: 10.13266/j.issn.0252-3116.2021.13.009

• 情报研究 • 上一篇    下一篇

突发公共卫生事件中微博谣言的识别

石锴文, 刘勘   

  1. 中南财经政法大学信息与安全工程学院 武汉 430073
  • 收稿日期:2020-12-21 修回日期:2021-04-13 出版日期:2021-07-05 发布日期:2021-07-10
  • 通讯作者: 刘勘(OCRID:0000-0001-9339-7315),教授,博士,通讯作者,E-mail:liukan@zuel.edu.cn
  • 作者简介:石锴文(ORCID:0000-0002-3563-982X),本科生。
  • 基金资助:
    本文系中央高校基本科研业务费交叉学科创新研究项目"大数据支持下网络谣言的智能消解机制研究"(项目编号:2722021EK016)研究成果之一。

Weibo Rumor Identification in Public Health Emergencies

Shi Kaiwen, Liu Kan   

  1. School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan 430073
  • Received:2020-12-21 Revised:2021-04-13 Online:2021-07-05 Published:2021-07-10

摘要: [目的/意义] 在"新冠"疫情这类突发公共卫生事件中,网络社交媒体上迅速产生大量关于疫情的言论,其中包含不少蓄意传播的谣言,不仅危害公众心理健康,而且会影响应对公共卫生事件的方案实施。识别突发公共卫生事件的谣言能够使民众正确面对危机,为社会安定、网络治理起到积极的维护作用。[方法/过程] 首先对采集到的疫情期间已被证实的谣言进行深度分析,提取谣言文本的主要特征,包括上下文特征、话题类别特征、情感程度特征、关键词特征等;然后针对文本分类模型中的文本特征表达较为单一的问题,利用不同的模型对提取的谣言文本特征进行向量化,并对各类文本特征进行加强和融合。其中通过TF-IDF计算的词向量权重在捕获上下文特征的同时,能够加强词粒度的关键词特征信息。最后,使用BiLSTM+DNN模型对融合的特征向量进行分类判别。[结果/结论] 实验结果表明,话题类别、情感程度等特征对谣言识别均有贡献,特别是经过强化后的词向量与其他特征融合后对识别准确率有明显提升,召回率、F1值等指标均达到90%以上,效果超过其他的谣言识别模型,说明笔者所构建的方法能够很好地实现对突发公共卫生事件背景下的谣言识别。

关键词: 公共卫生事件, 谣言识别, 微博, 多特征融合

Abstract: [Purpose/significance] In public health emergencies such as the COVID-19 epidemic, a large number of statements about the epidemic have quickly been generated on social media on the Internet, including many rumors that endanger public mental health and affect the implementation of national policies. Detecting these remarks and identifying the rumors can enable the people to respond to public health emergencies correctly, and play a positive role in maintaining social stability and network governance.[Method/process] Firstly, the confirmed rumors during the epidemic were collected for in-depth analysis, and the main features of the rumor text were extracted, including context features, topic category features, sentiment level features, keyword features, etc.; then aiming at the problem that the text feature expression in the text classification model was relatively single, different models were used to vectorize the extracted rumor text features, and then a rumor recognition model based on multi-feature fusion was constructed. In the construction of this model, TF-IDF was used to strengthen the word vector, so that the word vector can merge the keyword feature information of the word granularity while capturing the context feature. Finally, this paper used the BiLSTM+DNN model to classify the fused feature vectors.[Result/conclusion] The experimental results show that features such as topic category and emotional level all contribute to the recognition of rumors, especially the fusion of the strengthened word vector and other features to significantly improve the recognition accuracy, recall rate, F1 measure, etc. The indicators all reached more than 90%, and the effect surpassed other rumor recognition models, indicating that the method constructed in this article can respond well to the task of rumor recognition in the context of public health emergencies.

Key words: public health emergencies, rumor recognition, Weibo, multi-feature fusion

中图分类号: