图书情报工作 ›› 2012, Vol. 56 ›› Issue (08): 26-55.

• 专题一 • 上一篇    下一篇

文本挖掘工具述评

张雯雯,许鑫   

  1. 华东师范大学商学院信息学系
  • 收稿日期:2011-11-18 修回日期:2012-01-04 出版日期:2012-04-20 发布日期:2012-04-20
  • 通讯作者: 许鑫

Review of Text Mining Tools

Zhang Wenwen ,Xu Xin   

  1. Department of Informatics, Business School, East China Normal University,
  • Received:2011-11-18 Revised:2012-01-04 Online:2012-04-20 Published:2012-04-20
  • Contact: Xu Xin

摘要:

简要介绍一些商业文本挖掘工具和开源文本挖掘工具,针对其中四款典型的开源工具进行详细的比较,包括数据格式、功能模块和用户体验三个方面;选取三种各具特色的工具就其文本分类功能进行测评。最后,针对开源文本挖掘工具的现状,提出几点建议。

关键词: 文本挖掘, 文本挖掘工具, 开源文本挖掘工具

Abstract:

The authors briefly describe some commercial text mining tools and open source text mining tools, coupled with detailed comparisons of four typical open source tools concerning data format, functional module and user experience firstly. Then, the authors realize the testing of text classification function for three kinds of distinctive tool design. Finally, the authors offer some suggestions for the status of open source text mining tools.

Key words: text mining, text mining tools, open source text mining tools