图书情报工作 ›› 2020, Vol. 64 ›› Issue (24): 63-72.DOI: 10.13266/j.issn.0252-3116.2020.24.008

• 纪念中国科学院文献情报中心成立70周年专辑 • 上一篇    下一篇

面向智慧知识服务的科技文献大数据体系建设

吴振新1,2, 钱力1,2, 谢靖1,2, 常志军1,2, 许丽媛1, 赵艳1,2   

  1. 1. 中国科学院文献情报中心 北京 100190;
    2. 中国科学院大学经济与管理学院图书情报与档案管理系 北京 100190
  • 收稿日期:2020-11-05 修回日期:2020-12-20 出版日期:2020-12-20 发布日期:2020-12-20
  • 通讯作者: 许丽媛(ORCID:0000-0002-8326-4372),馆员,通讯作者,E-mail:xuly@mail.las.ac.cn
  • 作者简介:吴振新(ORCID:0000-0003-4966-1961),研究馆员,博士生导师,;钱力(ORCID:0000-0002-0931-2882),研究馆员,硕士生导师;谢靖(ORCID:0000-0001-6698-1786),副研究馆员,硕士生导师;常志军(ORCID:0000-0001-9211-8599),副研究馆员,硕士生导师;赵艳(ORCID:0000-0002-0515-1954),研究馆员,博士,硕士生导师。
  • 基金资助:
    本文受"中国科学院文献情报中心成立七十周年主题论坛与纪念文集出版"项目资助出版。

Construction of Sci-Tech Big Data System oriented to Intelligent Knowledge Service

Wu Zhenxin1,2, Qian Li1,2, Xie Jing1,2, Chang Zhijun1,2, Xu Liyuan1, Zhao Yan1,2   

  1. 1. National Science Library, Chinese Academy Sciences, Beijing 100190;
    2. Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190
  • Received:2020-11-05 Revised:2020-12-20 Online:2020-12-20 Published:2020-12-20

摘要: [目的/意义] 探索构建文献情报大数据知识资源体系,支撑面向多领域的智慧知识服务。[方法/过程] 基于AI应用需求,借鉴业界经验,梳理现有资源体系的问题,从多层次多维度扩展资源体系;构建可靠数据处理流程和计算平台,支持高效数据采集和处理;研发智能化数据治理工具,实现知识资源的有效治理,确保提供高质量数据资源。[结果/结论] 已初步形成覆盖多类型、多学科的科技文献大数据知识资源体系,构建完成高度自动化的数据采集治理流程,实施多重数据质量控制,积累数亿高质量数据,且为多个知识服务提供数据支撑。

关键词: 科技大数据, 知识资源体系, 数据汇聚, 智慧知识服务

Abstract: [Purpose/significance] The paper explores the construction of literature intelligence big data knowledge resource system, which supports multi-domain intelligent knowledge service.[Method/process] Based on the AI application requirements, drawing on the industry experience, combing the problems of existing resource system, the paper expanded the resource system from multi-level and multi-dimensional, built a reliable data processing process and computing platform to support efficient data collection and processing, and developed intelligent data governance tools to achieve effective governance of knowledge resources and ensure the provision of high-quality data resources.[Result/conclusion] It has initially formed a knowledge resource system covering multiple types and disciplines of sci-tech literature, constructed and completed a highly automated data collection and governance process, implemented multiple data quality control, and accumulated hundreds of millions of high-quality data. At present, it has provided data support for multiple knowledge services.

Key words: science and technological big data, knowledge resource architecture, data aggregation, intelligent knowledge services

中图分类号: