情报研究

基于日志挖掘的用户健康信息检索行为研究

  • 王若佳 ,
  • 李培
展开
  • 1. 南开大学商学院, 天津, 300100;
    2. 天津图书馆, 天津, 300201
王若佳(ORCID:0000-0003-1806-0688),E-mail:wrjvswxl@qq.com;李培,馆长。

收稿日期: 2015-05-05

  修回日期: 2015-05-20

  网络出版日期: 2015-06-05

A Study on Health Information Search Behavior Based on Log Mining

  • Wang Ruojia ,
  • Li Pei
Expand
  • 1. Business School, Nankai University, Tianjin 300100;
    2. Tianjin Library, Tianjin 300201

Received date: 2015-05-05

  Revised date: 2015-05-20

  Online published: 2015-06-05

摘要

[目的/意义] 针对当前我国网络用户的健康信息检索行为, 探索利用中文搜索引擎的健康信息检索规律, 为完善健康搜索引擎和网站建设提供参考。[方法/过程] 基于搜狗搜索引擎的大规模查询日志, 采用日志挖掘的方法, 从查询行为和点击行为两个角度对网络用户的健康信息检索行为进行研究。查询行为的研究指标包括会话层(会话长度、用户重复查询), 查询串层(查询串长度、重复查询)和词项层(高频词汇, 主题分类);点击行为的研究指标为点击位置和点击内容。[结果/结论] 健康相关查询的重复率较高, 提示相关网站可缓存高重复率查询串的返回结果;大众关注的热点领域为疾病、保健、母婴、医疗机构与美容整形, 提示网站的导航设计注意导航方向;用户更偏爱使用问答型平台, 提示网站设计者应更加关注与用户间问答型的互动模式。

本文引用格式

王若佳 , 李培 . 基于日志挖掘的用户健康信息检索行为研究[J]. 图书情报工作, 2015 , 59(11) : 111 -118 . DOI: 10.13266/j.issn.0252-3116.2015.11.016

Abstract

[Purpose/significance] This paper studied the Chinese internet users'search behaviors of health information, explored the search rules of Chinese search engines, to provide reference for health search engines and websites.[Method/process] With the methods of log mining from the health queries in Sogou search engine,itstudied online health search behaviors from the respects of inquiring behavior and clicking behavior. The research indicators of inquiring behavior consist of session level, query level and term level; clicking behavior's research indicators include the click distribution and high frequency URLs. [Result/conclusion] Results show that (1)the repetition rate of health-related searches is relatively higher which suggests that search engines should buffer the returned results of these queries, (2) the top topics are about diseases, health care, pregnancy/baby, medical institutions and cosmetic plastics which provides a direction on website navigation, (3) the users prefer question-answer platforms which means that the website operators should focus on such pattern of interaction.

参考文献

[1] 中国互联网络信息中心(CNNIC).第16次中国互联网络发展状况统计报告[EB/OL]. [2015-04-16].http://www.cnnic.net.cn/.
[2] 黄成. 基于非医学专业信息用户需求的我国医学健康网站可用性评价研究[D]. 重庆:西南大学, 2008.
[3] Hersh W R, Hickam D H. How well do physicians use electronic information retrieval systems?: A framework for investigation and systematic review[J]. Jama, 1998, 280(15): 1347-1352.
[4] 张馨遥, 曹锦丹. 网络环境下用户健康信息需求的影响因素分析[J]. 医学与社会, 2010, 23(9): 25-27.
[5] Shuyler K S, Knight K M. What are patients seeking when they turn to the Internet? Qualitative content analysis of questions asked by visitors to an orthopaedics Web site[J]. Journal of Medical Internet Research, 2003, 5(4):e24.
[6] 张洪武, 冯思佳, 赵文龙, 等. 基于网络用户搜索行为的健康信息需求分析[J]. 医学信息学杂志, 2011, 32(5): 13-18.
[7] Spink A, Yang Yin, Jansen J, et al. A study of medical and health queries to Web search engines[J]. Health Information & Libraries Journal, 2004, 21(1): 44-51.
[8] Zeng Qing, Kogan S, Ash N, et al. Characteristics of consumer terminology for health information retrieval[J]. Methods of Information in Medicine, 2002, 41(4): 289-298.
[9] 李菲, 张嘉熙, 李宁. 医生网络信息检索行为与医学图书馆信息服务策略探究——以山西省为例[J]. 图书馆理论与实践, 2014 (4): 40-43.
[10] Wildemuth B M. The effects of domain knowledge on search tactic formulation[J]. Journal of the American Society for Information Science and Technology, 2004, 55(3): 246-258.
[11] Sillence E, Briggs P, Fishwick L, et al. Trust and mistrust of online health sites[C]//Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. Vienna:ACM, 2004: 663-670.
[12] Eysenbach G, Kohler C. How do consumers search for and appraise health information on the world wide Web? Qualitative study using focus groups, usability tests, and in-depth interviews[J]. BMJ, 2002, 324(7337): 573-577.
[13] Toms E G, Latter C. How consumers search for health information[J]. Health Informatics Journal, 2007, 13(3): 223-235.
[14] 吴丹,李一喆. 老年人网络健康信息检索行为实验研究[J]. 图书情报工作,2014,58(12):102-108.
[15] Rice R E. Influences, usage, and outcomes of Internet health information searching: Multivariate results from the Pew surveys[J]. International Journal of Medical Informatics, 2006, 75(1): 8-28.
[16] Zeng Qing, Kogan S, Ash N, et al. Patient and clinician vocabulary: how different are they?[J]. Studies in Health Technology and Informatics, 2001 (1): 399-403.
[17] 王继民, 李雷明子, 孟涛. Web 搜索引擎日志挖掘研究框架[J]. 数字图书馆论坛, 2011(8):25-31.
[18] Spink A, Wolfram D, Jansen M B J, et al. Searching the Web: The public and their queries[J]. Journal of the American Society for Information Science and Technology, 2001, 52(3): 226-234.
[19] 中国互联网络信息中心(CNNIC).第35次中国互联网络发展状况统计报告[EB/OL]. [2015-02-03].http://www.cnnic.net.cn/.
[20] 搜狗实验室.用户查询日志(SogouQ)[EB/OL]. [2015-02-03].http://www.sogou.com/labs/dl/q.html.
[21] Eysenbach G, Kohler C. What is the prevalence of health-related searches on the World Wide Web? Qualitative and quantitative analysis of search engine queries on the Internet[OL].[2015-02-03]. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1480194/.
[22] 窦志成, 袁晓洁, 何松柏. 大规模中文搜索日志中查询重复性分析[J]. 计算机工程, 2008, 34(21):40-41.
[23] 王浩, 姚长利, 郭琳, 等. 基于中文搜索引擎网络信息用户行为研究[J]. 计算机应用研究, 2009, 26(12):4665-4668.
[24] 董志安, 吕学强. 基于百度搜索日志的用户行为分析[J]. 计算机应用与软件, 2013, 30(7): 17-20.
[25] White R W, Dumais S, Teevan J. How medical expertise influences web search interaction.[C]//Proceedings of the 31st Annual international ACM SIGIR Conference on Research and Development in Information retrieval. Singapore:ACM, 2008:791-792.

文章导航

/