知识组织

基于信息分类的网络书评内容挖掘与整合研究

  • 祝振媛
展开
  • 北京大学信息管理系 北京 100871
祝振媛(ORCID:ORCID:0000-0002-2391-7706),博士研究生,E-mail:zhuzhenyuan68@pku.edu.cn。

收稿日期: 2015-11-23

  修回日期: 2015-12-20

  网络出版日期: 2016-01-05

Content Mining and Integration Study of Online Book Reviews Based on Information Classification

  • Zhu Zhenyuan
Expand
  • Department of Information Management, Peking University, Beijing 100871

Received date: 2015-11-23

  Revised date: 2015-12-20

  Online published: 2016-01-05

摘要

[目的/意义]从定量分析和定性分析两个方面对英文网络书评进行内容挖掘,形成一套基于信息分类的英文网络书评的内容挖掘方法体系,实现多文本书评的信息整合。[方法/过程]对书评文本中句子的分类方法、关键信息的提取方法、情感分类的方法以及内容的呈现方式等几方面进行实验和改进。[结果/结论]用户评价结果表明,本文所设计的内容挖掘方法所生成的书评信息摘要在生成质量和有用性两方面都有较好的表现。

本文引用格式

祝振媛 . 基于信息分类的网络书评内容挖掘与整合研究[J]. 图书情报工作, 2016 , 60(1) : 114 -124 . DOI: 10.13266/j.issn.0252-3116.2016.01.016

Abstract

[Purpose/significance] This paper focuses on the study of content mining of English online book reviews from the quantitative and qualitative analysis, and gives the content mining methods for English online book reviews based on information classification in order to achieve information integration of multiple-text book reviews.[Method/process] This paper also does research of sentence classification method, extraction method of key information, sentiment classification method and presentation of the abstract content of book reviews.[Result/conclusion] Analysis of the evaluation shows that the results which are based on content mining method of this paper have better performance in writing quality and usefulness.

参考文献

[1] 姚天昉,程希文,徐飞玉,等.文本意见挖掘综述[J].中文信息学报,2008,22(3):71-80.
[2] SentiWordNet[EB/OL].[2015-08-18].http://sentiwordnet.isti.cnr.it.
[3] LIU B,HU M,CHENG J.Opinion observer:analyzing and comparing opinions on the Web[ED/OL].[2015-11-16].http://ccc.inaoep.mx/~villasen/index_archivos/cursoTATⅡ/ClasificacionOpiniones/Liu-OpinionObserver05.pdf.
[4] The Micro-WNOp Corpus[EB/OL].[2015-08-18].http://www-3.unipv.it/wnop/.
[5] BALAHUR A,MONTOYO A.Applying a culture dependent Emotion Triggers database for text valence and emotion classification[J].Procesamiento del lenguaje natural,2008,40:107-114.
[6] DAVE K,LAWRENCE S,PENNOCK D M.Mining the peanut gallery:opinion extraction and semantic classification of product reviews[ED/OL].[2015-11-16].http://www.kushaldave.com/p451-dave.pdf.
[7] 李纲,王忠义.基于语义的情感挖掘系统的设计与实现[J].现代图书情报技术,2011(7):97-103.
[8] RILOFF E,WIEBE J,WILSON T.Learning subjective nouns using extraction pattern bootstrapping[ED/OL].[2015-11-16].http://www.aclweb.org/anthology/W03-0404.
[9] LENHART A FOX SUSANNAH.Bloggers:a portrait of the Internet's new storytellers[J/OL].[2015-11-16].http://www.pewinternet.org/files/old-media/Files/Reports/2006/PIP%20Bloggers%20Report%20July%2019%202006.pdf.pdf.
[10] 侯锋,王传廷,李国辉.网络意见挖掘、摘要与检索研究综述[J].计算机科学,2009,36(7):15-19.
[11] 邓凯英,彭超.网络舆情监测系统的研究与实现[J].现代情报,2013(33):38-41.
[12] 林达真,李绍滋,曹冬林.基于时间分布特征的博客突发事件检测[J].计算机工程与科学,2010(10):145-149.
[13] 马彦.大数据环境下微博舆情热点话题挖掘方法研究[J].现代情报,2014(11):29-33.
[14] CHAOVALIT P,ZHOU L.Movie reviews mining:a comparison between supervised and unsupervised classification approaches[ED/OL].[2015-11-16].http://www.computer.org/csdl/proceedings/hicss/2005/2268/04/22680112c.pdf.
[15] KO M,KIM H W,YI M Y,et al.Movie commenter:aspect-based collaborative filtering by utilizing user comments[ED/OL].[2015-11-16].https://www.researchgate.net/publication/221391559_MovieCommenter_Aspect-based_collaborative_filtering_by_utilizing_user_comments.
[16] The Stanford Natural Language Processing Group[EB/OL].[2015-08-16].http://nlp.stanford.edu/software/corenlp.shtml.
[17] MURTHY S K.Automatic construction of decision trees from data:a multi-disciplinary Survey[J].Data mining & knowledge discovery,2000,2(4):345-389.
[18] LANGLEY P,SAGE S.Induction of selective Bayesian classifiers[ED/OL].[2015-11-16].http://www.isle.org/~langley/papers/select.uai94.pdf.
[19] SentiWordNet[EB/OL].[2015-08-12].http://sentiwordnet.isti.cnr.it.
[20] About Wordnet[EB/OL].[2015-08-12].http://wordnet.princeton.edu/.
[21] LIN C Y,HOVY E.Automatic evaluation of summaries using n-gram co-occurrence statistics[ED/OL].[2015-11-16].http://www.aclweb.org/anthology/N03-1020.pdf.
[22] Textcompactor[EB/OL].[2015-08-12].http://www.textcompactor.com/about.
[23] MANI I.Summarization evaluation:An overview[J/OL].[2015-11-16].http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings2/sum-mani.pdf].2001.
[24] SAGGION H,RADEV D,TEUFEL S,et al.Developing infrastructure for the evaluation of single and multi-document summarization systems in a cross-lingual environment[ED/OL].[2015-11-16].http://www.researchgate.net/profile/Wai_Lam/publication/228761623_Developing_infrastructure_for_the_evaluation_of_single_and_multi-document_summarization_systems_in_a_cross-lingual_environment/links/0046351a49fa4e5115000000.pdf.
文章导航

/