SPECIAL TOPIC:Research on the Preservation and Utilization of Web Information Resources

Construction of Subject Knowledge Base of Government Website Pages Based on Domain Ontology:
Taking the Subject of “COVID-19 Vaccine Science Popularization” as an Example

  • Huang Xinping ,
  • Pan Rongzhuang ,
  • Mao Yinghao ,
  • Zhu Siyuan ,
  • Xu Songqin
Expand
  • School of Business and Management, Jilin University, Changchun 130012

Received date: 2022-04-09

  Revised date: 2022-06-19

  Online published: 2022-09-09

Abstract

[Purpose/Significance] From the perspective of knowledge management, this paper takes a large number of isolated and scattered government Web pages collected by selected topics as the knowledge source to construct the corresponding subject knowledge base, the aim is to help the public quickly and efficiently obtain the required key information and precise knowledge from the massive Web archiving resources.[Method/Process] Based on the technologies of Web crawlers of selected topics, natural language processing, domain ontology and knowledge reasoning, a method to construct" COVID-19 vaccine science popularization" knowledge base was proposed, which includes subject knowledge source, knowledge acquisition, knowledge representation, knowledge reasoning, and knowledge service. Firstly, design a Web crawler to obtain text data for thematic Web pages, and use a hybrid method to extract domain concept knowledge from it. Then, the domain ontology was constructed by defining ontology classes and hierarchical structure between classes, object attributes, data attributes, and adding instances, meanwhile, the knowledge rules were formalized, so as to complete the construction of subject knowledge base. Finally, Protégé software and its plug-in units, knowledge reasoning and other methods were used to realize the semantic knowledge retrieval, ontology visualization query and knowledge Q&A service of "COVID-19 vaccine science popularization" knowledge base.[Result/Conclusion] The research results show that the subject knowledge base has good reasoning and analysis functions, which can effectively realize the accurate acquisition of knowledge in COVID-19 vaccine science popularization. Its application has important practical significance for improving the effect of COVID-19 vaccine science popularization.

Cite this article

Huang Xinping , Pan Rongzhuang , Mao Yinghao , Zhu Siyuan , Xu Songqin . Construction of Subject Knowledge Base of Government Website Pages Based on Domain Ontology:
Taking the Subject of “COVID-19 Vaccine Science Popularization” as an Example[J]. Library and Information Service, 2022
, 66(17) : 35 -46 . DOI: 10.13266/j.issn.0252-3116.2022.17.004

References

[1] 黄新平,王洁.面向Web Archive的政府网站网页专题知识库构建研究[J].图书馆学研究,2021(15):64-70.
[2] 谈春梅,段卫华,曹松强.网络专题知识库关键技术的研究与实现[J].现代图书情报技术,2009(4):70-74.
[3] 董燕,侯酉娟,张伟娜,等.基于数字人文技术的中国历代医家传记专题知识库构建[J].中华医学图书情报杂志,2021,30(1):31-38.
[4] MATINFAR F. Linking Web resources in Web of data to encyclopedic knowledge base[J]. Open computer science, 2020, 10(1):357-368.
[5] 岳丽欣,刘文云.国内外领域本体构建方法的比较研究[J].情报理论与实践,2016,39(8):119-125.
[6] 张建华,李方方,叶建文.基于领域本体与CBR的案例知识供需匹配研究[J].情报杂志,2020,39(10):144-150.
[7] 马雨萌,王昉,黄金霞,等.基于文献知识抽取的专题知识库构建研究——以中药活血化瘀专题知识库为例[J].情报学报,2019,38(5):482-491.
[8] 许鑫,郭金龙.基于领域本体的专题库构建——以中华烹饪文化知识库为例[J].现代图书情报技术,2013(12):2-9.
[9] KRUIT B, HE H, URBANI J. Tab2know:building a knowledge base from tables in scientific papers[C]//International semantic Web conference 2020. Berlin:Springer, 2020:349-365.
[10] 丁树芹,张燕蕾.图书馆突发公共卫生事件专题资源构建研究[J].图书馆建设,2020(S1):47-49,66.
[11] 蒋勋,朱晓峰.基于政府大数据能力建构的智库应急情报服务——以新冠肺炎疫情防控为例[J].图书与情报,2020(1):64-74.
[12] 熊励,王成文,王锟.基于事件本体的疫情知识库构建策略[J].图书情报工作,2021,65(14):138-148.
[13] 王斌,刘涛,王广志,等.支持新型冠状病毒肺炎的中医智能处方推荐和知识库系统[J].中国数字医学,2020,15(5):25-27.
[14] 乔宇,崔亮亮,李帅,等.智能问答机器人系统研发及应用研究——以济南市新型冠状病毒肺炎疫情处置应对为例[J].山东大学学报(医学版),2020,58(4):17-22.
[15] 张文秀,朱庆华.领域本体的构建方法研究[J].图书与情报,2011(1):16-19,40.
[16] DOMBEU J, HUISMAN M. Combining ontology development methodologies and semantic Web platforms for e-government domain ontology development[J]. International journal of Web & semantic technology, 2011, 2(2):12-25.
[17] 肖宇,郑翔文,宋伟,等.新冠肺炎领域本体构建及应用[J/OL].军事医学:1-6[2022-06-17].http://kns.cnki.net/kcms/detail/11.5950.R.20211029.1155.002.html.
[18] 刘峰,张晓林,孔丽华.科研数据知识库研究述评[J].现代图书情报技术,2014(2):25-31.
[19] YANG X, LIU Y, QUAN Y. Intelligent fuzzy information retrieval based on ontology knowledge-base[J]. International journal of Internet protocol technology, 2018,11(3):180-191.
[20] 耿骞,邓斯予,靳健.融合词语义表示和新词发现的领域本体演化——以产品评论数据为例[J].图书情报工作,2021,65(8):85-96.
[21] 岳丽欣,刘文云.国内外领域本体构建方法的比较研究[J].情报理论与实践,2016,39(8):119-125.
[22] 张宝隆.基于网络资源的煤矿事故本体知识库的构建研究[D].淮南:安徽理工大学,2018.
[23] Vaccine ontology[EB/OL].[2022-01-27]. http://www.ontobee.org/ontology/VO.
[24] 新冠病毒疫苗科普手册[EB/OL].[2022-01-29]. http://www.zaq.gov.cn/zaqzf/yqzt/s_296_8102.html.
[25] 新冠病毒疫苗接种技术指南[EB/OL].[2022-01-23]. http://www.nhc.gov.cn/xcs/gzzcwj/202103/c2febfd04fc5498f916b1be080905771.shtml.
[26] 王晓慧,罗军,余淑良.本体的查询与推理研究[J].计算机技术与发展,2012,22(5):130-133.
[27] ANDREW F, NARY S. Comparison of the performance of drools and jena rule-based systems for event processing on the semantic Web[C]//14th international conference on software engineering research, management and applications. Towson:IEEE, 2016:24-30.
Outlines

/