图书情报工作 ›› 2020, Vol. 64 ›› Issue (17): 103-113.DOI: 10.13266/j.issn.0252-3116.2020.17.011

• 情报研究 • 上一篇    下一篇

多维特征下社会化问答社区答案排序研究

易明, 张婷婷, 李梓   

  1. 华中师范大学信息管理学院 武汉 430079
  • 收稿日期:2020-01-13 修回日期:2020-04-09 出版日期:2020-09-05 发布日期:2020-09-05
  • 通讯作者: 李梓奇(ORCID:0000-0003-1880-2426),博士研究生,通讯作者,E-mail:lzq911015@qq.com
  • 作者简介:易明(ORCID:0000-0002-4864-6025),教授,博士生导师;张婷婷(ORCID:0000-0002-5068-8232),硕士研究生.
  • 基金资助:
    本文系国家社会科学基金项目"基于人类动力学的信息网络信息交流行为研究"(项目编号:16BTQ076)和中央高校基本科研业务费重大培育项目"智慧图书馆系统关键技术与应用研究"(项目编号CCNU18JCXK04)研究成果之一。

Research on the Ranking of Social Q&A Community Answers Based on Multidimensional Features

Yi Ming, Zhang Tingting, Li Ziqi   

  1. School of Information Management, Central China Normal University, Wuhan 430079
  • Received:2020-01-13 Revised:2020-04-09 Online:2020-09-05 Published:2020-09-05

摘要: [目的/意义] 研究多维特征对社会化问答社区答案排序的影响,以提高问答社区服务质量并尽可能优化用户体验。[方法/过程] 从答案特征、回答者特征和投票者特征多个维度构建社会化问答社区答案排序特征体系,比较基于深度学习、树、神经网络、支持向量机等11种排序学习算法在问答社区数据集上的适用性,并训练随机森林分类算法,得到每个特征的重要程度。[结果/结论] 实验结果表明,基于深度学习的排序学习算法在NDCG@k和MRR指标上的性能均优于其他排序算法,投票者的影响力特征最为重要,其次是答案内容特征,最后是回答者的专业度特征,可以考虑从增加答案排序方式的多样性和提高答案排序算法的综合性两个维度进一步优化答案排序。

关键词: 社会化问答社区, 答案质量, 排序学习算法, 深度学习算法

Abstract: [Purpose/significance] This paper studies the impact of multi-dimensional characteristics on Social Q&A Communities answer ranking, which can improve the service quality in Social Q&A Communities and optimize the user experience.[Method/process] This paper constructed a Social Q&A Communities answer ranking feature system from the answer feature, respondent feature and voter feature dimensions, and then we compared the applicability of 11 ranking learning algorithms based on deep learning, tree, neural network and support vector machine in Social Q&A Communities data set, and train random forest classification algorithm to get the importance of each feature.[Result/conclusion] The experimental results show that the sorting learning algorithm based on deep learning performs better than other sorting algorithms in NDCG@k and MRR indexes, and the influence characteristics of voters are very important, followed by the content characteristics of the answers, and finally the professional characteristics of the respondents. From the two dimensions of increasing the diversity of the answer ranking method and improving the comprehensiveness of the answer ranking algorithm, we provide some suggestions for the optimization of community answer ranking.

Key words: Social Q&A Community, answer quality, ranking learning algorithm, deep learning algorithm

中图分类号: