图书情报工作 ›› 2021, Vol. 65 ›› Issue (4): 73-80.DOI: 10.13266/j.issn.0252-3116.2021.04.008

• 知识组织 • 上一篇    下一篇

学术查询意图类目体系构建与分析:百度学术查询日志的实证

王瑞雪1, 方婧1, 李信1, 陆伟1,2, 张显3   

  1. 1 武汉大学信息管理学院, 武汉 430072;
    2 信息检索与知识挖掘研究所, 武汉 430072;
    3 百度时代网络技术(北京)有限公司, 北京 100085
  • 收稿日期:2020-06-12 修回日期:2020-10-12 出版日期:2021-02-20 发布日期:2021-04-14
  • 作者简介:王瑞雪(ORCID:0000-0001-5932-9036),博士研究生,E-mail:ruixue_wang@whu.edu.cn;方婧(ORCID:0000-0002-9538-7812),硕士;李信(ORCID:0000-0002-8169-6059),博士研究生;陆伟(ORCID:0000-0002-0929-7416),院长,教授,博士;张显(ORCID:0000-0002-8274-9523),硕士。
  • 基金资助:
    本文系国家社会科学基金青年项目“面向学术搜索的查询意图研究”(项目编号:19CTQ023)研究成果之一。

The Construction and Analysis of Academic Query Intent Taxonomy: An Empirical Study of Baidu's Academic Search Query Log

Wang Ruixue1, Fang Jing1, Li Xin1, Lu Wei1,2, Zhang Xian3   

  1. 1 School of Information Management, Wuhan University, Wuhan 430072;
    2 Information Retrieval and Knowledge Mining Laboratory, Wuhan University, Wuhan 430072;
    3 Baidu Times Network Technology (Beijing) Co., Ltd. Beijing 100085
  • Received:2020-06-12 Revised:2020-10-12 Online:2021-02-20 Published:2021-04-14

摘要: [目的/意义] 了解、分析和识别用户学术搜索时所表达的信息需求是优化查询结果、提高学术搜索引擎用户体验的首要步骤,而用户进行学术搜索时通过查询表达式所表达的用户表意信息需求及潜在信息需求可称之为学术查询意图。本文总结学术查询意图类目体系有助于学术查询意图识别和检索结果页面的呈现。[方法/过程] 在A.Broder的查询意图类目体系的基础上,结合百度学术搜索查询日志中查询表达式实例,构建学术查询意图的类目体系。以此为基础,总结不同类别的学术查询意图,并分析不同类别学术查询意图下查询表达式的特点。[结果/结论] 学术查询意图主要分为学术文献类、学术实体类、学术探索类、知识问答类和非学术文献类五大类;得出不同类别学术查询意图在学术搜索中的大致比例;给出每类学术查询意图的查询表达式特征、查询情景和查询结果页。

关键词: 学术搜索, 查询意图, 类目体系, 查询日志, 百度学术

Abstract: [Purpose/significance] During academic search, understanding, analyzing and identifying the information needs expressed by users is the first step to optimize query results and improve the user experience of academic search engines. In this paper, we called it the academic query intent, which refers to the user's ideographic information needs and potential information expressed through query. Summarizing the academic query intent taxonomy is helpful for the identification of academic query intent and the presentation of search result pages.[Method/process] Based on the A.Broder's taxonomy of query intent, this study combined with the Baidu's academic search query log to construct a taxonomy of query intent in academic search. On this basis, this paper identified the different academic query categories manually and analyzed the characteristics of query intent in different types of academic queries.[Result/conclusion] The user's academic query intentions are divided into five categories:academic literature, academic entities, academic exploration, knowledge quiz and non-academic literature. For different types of academic query intent, the study draw the approximate proportions and given the characteristics, scenarios and result pages of the query.

Key words: academic search, query intent, taxonomy, query log, Baidu Academic

中图分类号: