图书情报工作 ›› 2022, Vol. 66 ›› Issue (7): 88-98.DOI: 10.13266/j.issn.0252-3116.2022.07.009

• 工作研究 • 上一篇    下一篇

基于文本挖掘的我国省级政府开放数据平台比较研究

陈美1,2, 何祺1,2   

  1. 1. 中南财经政法大学公共管理学院 武汉 430073;
    2. 中南财经政法大学国家治理与公共政策研究中心 武汉 430073
  • 收稿日期:2021-11-11 修回日期:2022-01-17 出版日期:2022-04-05 发布日期:2022-04-15
  • 作者简介:陈美,副教授,博士,E-mail:chenmei672236@163.com;何祺,硕士研究生。
  • 基金资助:
    本文系国家自然科学基金项目"面向用户的开放政府数据使用行为机理及隐私风险控制研究"(项目编号:72004056)和中南财经政法大学中央高校基本科研业务费专项资金资助项目"开放政府数据政策优化研究"(项目编号:2722022BQ039)研究成果之一。

A Comparative Study on Open Data Platforms of Provincial Government in China Based on Text Mining

Chen Mei1,2, He Qi1,2   

  1. 1. School of Public Administration, Zhongnan University of Economics and Law, Wuhan 430073;
    2. National governance and Public Policy Research Center, Zhongnan University of Economics and Law, Wuhan 430073
  • Received:2021-11-11 Revised:2022-01-17 Online:2022-04-05 Published:2022-04-15

摘要: [目的/意义] 以我国14个省级政府开放数据平台为研究对象,从多个维度对其进行比较分析,为我国政府开放数据平台的发展提供参考建议。[方法/过程] 通过爬虫技术获取数据,对数据进行描述性分析,并采用Tf-idf模型进行文本挖掘。以数据层维度和平台层维度为出发点,使用定性和定量分析方式,对数据资源细粒度、领域分布、时效性、格式种类、检索种类、访问转换率、用户反馈方面进行比较。[结果/结论] 目前各省开放数据平台发展程度不同,存在一定的改进空间,如应当结合本省特点、数据集数量等综合考量数据集的发布方案,建设过程中需要注意开放平台数据检索方式、培训工作以及用户反馈等方面。

关键词: 开放数据, 政府开放数据, 开放政府, 比较

Abstract: [Purpose/Significance] Taking 14 provincial government open data platforms in China as the research object, this paper makes a comparative analysis of them from multiple dimensions, providing references and suggestions for the development of government open data platforms in China.[Method/Process] The crawler technology was used to acquire data, and the descriptive analysis of the data was carried out, and the Tf-idf model was used for text mining. Starting from the dimensions of data layer and platform layer, qualitative and quantitative analysis methods were used to compare fine granularity of data, domain distribution, timeliness, type of format, type of retrieval, access conversion rate and user feedback.[Result/Conclusion] At present, open data platforms in different provinces have different degrees of development, and there is certain room for improvement. For example, the release plan for data sets should take into account the province characteristic and the number of data sets, etc. In the process of construction, attention should be paid to the open platform data retrieval methods, training and user feedback.

Key words: open data, government open data, open government, comparison

中图分类号: