[Purpose/significance] This paper focuses on the study of content mining of English online book reviews from the quantitative and qualitative analysis, and gives the content mining methods for English online book reviews based on information classification in order to achieve information integration of multiple-text book reviews.[Method/process] This paper also does research of sentence classification method, extraction method of key information, sentiment classification method and presentation of the abstract content of book reviews.[Result/conclusion] Analysis of the evaluation shows that the results which are based on content mining method of this paper have better performance in writing quality and usefulness.
Zhu Zhenyuan
. Content Mining and Integration Study of Online Book Reviews Based on Information Classification[J]. Library and Information Service, 2016
, 60(1)
: 114
-124
.
DOI: 10.13266/j.issn.0252-3116.2016.01.016
[1] 姚天昉,程希文,徐飞玉,等.文本意见挖掘综述[J].中文信息学报,2008,22(3):71-80.
[2] SentiWordNet[EB/OL].[2015-08-18].http://sentiwordnet.isti.cnr.it.
[3] LIU B,HU M,CHENG J.Opinion observer:analyzing and comparing opinions on the Web[ED/OL].[2015-11-16].http://ccc.inaoep.mx/~villasen/index_archivos/cursoTATⅡ/ClasificacionOpiniones/Liu-OpinionObserver05.pdf.
[4] The Micro-WNOp Corpus[EB/OL].[2015-08-18].http://www-3.unipv.it/wnop/.
[5] BALAHUR A,MONTOYO A.Applying a culture dependent Emotion Triggers database for text valence and emotion classification[J].Procesamiento del lenguaje natural,2008,40:107-114.
[6] DAVE K,LAWRENCE S,PENNOCK D M.Mining the peanut gallery:opinion extraction and semantic classification of product reviews[ED/OL].[2015-11-16].http://www.kushaldave.com/p451-dave.pdf.
[7] 李纲,王忠义.基于语义的情感挖掘系统的设计与实现[J].现代图书情报技术,2011(7):97-103.
[8] RILOFF E,WIEBE J,WILSON T.Learning subjective nouns using extraction pattern bootstrapping[ED/OL].[2015-11-16].http://www.aclweb.org/anthology/W03-0404.
[9] LENHART A FOX SUSANNAH.Bloggers:a portrait of the Internet's new storytellers[J/OL].[2015-11-16].http://www.pewinternet.org/files/old-media/Files/Reports/2006/PIP%20Bloggers%20Report%20July%2019%202006.pdf.pdf.
[10] 侯锋,王传廷,李国辉.网络意见挖掘、摘要与检索研究综述[J].计算机科学,2009,36(7):15-19.
[11] 邓凯英,彭超.网络舆情监测系统的研究与实现[J].现代情报,2013(33):38-41.
[12] 林达真,李绍滋,曹冬林.基于时间分布特征的博客突发事件检测[J].计算机工程与科学,2010(10):145-149.
[13] 马彦.大数据环境下微博舆情热点话题挖掘方法研究[J].现代情报,2014(11):29-33.
[14] CHAOVALIT P,ZHOU L.Movie reviews mining:a comparison between supervised and unsupervised classification approaches[ED/OL].[2015-11-16].http://www.computer.org/csdl/proceedings/hicss/2005/2268/04/22680112c.pdf.
[15] KO M,KIM H W,YI M Y,et al.Movie commenter:aspect-based collaborative filtering by utilizing user comments[ED/OL].[2015-11-16].https://www.researchgate.net/publication/221391559_MovieCommenter_Aspect-based_collaborative_filtering_by_utilizing_user_comments.
[16] The Stanford Natural Language Processing Group[EB/OL].[2015-08-16].http://nlp.stanford.edu/software/corenlp.shtml.
[17] MURTHY S K.Automatic construction of decision trees from data:a multi-disciplinary Survey[J].Data mining & knowledge discovery,2000,2(4):345-389.
[18] LANGLEY P,SAGE S.Induction of selective Bayesian classifiers[ED/OL].[2015-11-16].http://www.isle.org/~langley/papers/select.uai94.pdf.
[19] SentiWordNet[EB/OL].[2015-08-12].http://sentiwordnet.isti.cnr.it.
[20] About Wordnet[EB/OL].[2015-08-12].http://wordnet.princeton.edu/.
[21] LIN C Y,HOVY E.Automatic evaluation of summaries using n-gram co-occurrence statistics[ED/OL].[2015-11-16].http://www.aclweb.org/anthology/N03-1020.pdf.
[22] Textcompactor[EB/OL].[2015-08-12].http://www.textcompactor.com/about.
[23] MANI I.Summarization evaluation:An overview[J/OL].[2015-11-16].http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings2/sum-mani.pdf].2001.
[24] SAGGION H,RADEV D,TEUFEL S,et al.Developing infrastructure for the evaluation of single and multi-document summarization systems in a cross-lingual environment[ED/OL].[2015-11-16].http://www.researchgate.net/profile/Wai_Lam/publication/228761623_Developing_infrastructure_for_the_evaluation_of_single_and_multi-document_summarization_systems_in_a_cross-lingual_environment/links/0046351a49fa4e5115000000.pdf.