图书情报工作 ›› 2018, Vol. 62 ›› Issue (19): 77-85.DOI: 10.13266/j.issn.0252-3116.2018.19.010

• 情报研究 • 上一篇    下一篇

微博舆情传播周期中不同传播者的主题挖掘与观点识别

廖海涵1, 王曰芬1,2, 关鹏1   

  1. 1. 南京理工大学经济管理学院 南京 210094;
    2. 江苏省社会公共安全科技协同创新中心 南京 210094
  • 收稿日期:2018-03-12 修回日期:2018-06-11 出版日期:2018-10-05 发布日期:2018-10-05
  • 通讯作者: 王曰芬(ORCID:0000-0002-7143-7766),教授,博士生导师,通讯作者,E-mail:yuefen163@163.com
  • 作者简介:廖海涵(ORCID:0000-0003-4953-1075),博士研究生;关鹏(ORCID:0000-0002-2308-3019),副教授,博士。
  • 基金资助:
    本文系国家社会科学基金重点项目"大数据环境下社会舆情与决策支持方法体系研究"(项目编号:14AZD084)研究成果之一。

Topic Mining and Viewpoint Recognition of Different Communicators in the Transmission Cycle of Micro-blog Public Opinion

Liao Haihan1, Wang Yuefen1,2, Guan Peng1   

  1. 1. School of economics and management, Nanjing University of Science and Technology, Nanjing 210094;
    2. Social Public Safety Science and Technology Co-Innovation Center, Jiang Su Province, Nanjing 210094
  • Received:2018-03-12 Revised:2018-06-11 Online:2018-10-05 Published:2018-10-05

摘要: [目的/意义] 探索微博舆情传播周期中不同传播者关注的舆情热点和传播内容的主要观点,进而发现舆情传播的特点和规律,为舆情分析与决策提供依据。[方法/过程] 以特定舆情事件的事实文本数据为来源,以生命周期理论和LDA方法为指导,设计研究流程与构建研究模型,对微博舆情事件中不同传播者的话题进行主题研究,其中包括主题抽取和结果语义标注、各阶段的不同传播者主题的语义分析、基于时间维度的舆情主题观点识别与刻画。[结果/结论] 研究发现,论文所提出的研究模型能够挖掘出舆情传播周期中不同传播者的主题结构、观点脉络以及特征,研判出分布在文字当中有关联性的、代表性的、重要的词语。同时,结论中还发现微博中的官媒、大众媒体发布信息中的话题和用户谈论的热点话题具有明显的差异性。

关键词: 微博舆情, 不同传播者, 主题挖掘, 观点识别, 生命周期理论, LDA主题模型

Abstract: [Purpose/significance] This paper aims to explore the hot spot of public opinion and the main point view of the communication of different communicators in the transmission cycle of micro-blog public opinion and to discover the characteristics and laws of public opinion transmission, which can provide the basis for public opinion analysis and decision making.[Method/process] This study is based on the text data of a true public opinion event. It adopted life cycle theory and LDA method to design research process and construct research model, and researched topics of different communicators in micro-blog public opinion events, including topic extraction and semantic annotation, semantic analysis of different communicators at various stages, recognition and characterization of theme views of public opinion based on time dimension.[Result/conclusion] It is found that the research model proposed in this paper can excavate topic theme structure, view and characteristics of different communicators in the communication cycle of public opinion. And the words with actual meaning and irritating function are related, representative and important. At the same time, the conclusion also found a hot topic in the mass media or the official micro-blog is totally different from micro-blog users.

Key words: micro-blog public opinion, different communicators, topic mining, view identification, life cycle theory, LDA theme model

中图分类号: