知识组织

网络学术文档细粒度聚合本体构建研究

  • 马翠嫦 ,
  • 曹树金
展开
  • 1. 中山大学图书馆 广州 510275;
    2. 中山大学资讯管理学院 广州 510006
马翠嫦(ORCID:0000-0002-2478-4714),副研究馆员,博士,E-mail:xx00217@163.com;曹树金(ORCID:0000-0003-1855-4522),教授,博士生导师。

收稿日期: 2019-07-30

  修回日期: 2019-10-28

  网络出版日期: 2019-12-20

基金资助

本文系中央高校基本科研业务费项目"支持跨学科知识发现的学术论文信息单元识别与聚合研究"(项目编号:17wkpy56)和国家社会科学基金重大项目"基于特定领域的网络资源知识组织与导航机制研究"(项目编号:12&ZD222)研究成果之一。

Study on the Construction of Fine-grained Aggregation Ontology for Academic Documents in the Internet Environment

  • Ma Cuichang ,
  • Cao Shujin
Expand
  • 1. Sun Yat-sen University Librariy, Guangzhou 510275;
    2. School of Information Management, Sun Yat-sen University, Guangzhou 510006

Received date: 2019-07-30

  Revised date: 2019-10-28

  Online published: 2019-12-20

摘要

[目的/意义] 旨在探索网络学术文档细粒度聚合本体构建的理论和方法。[方法/过程] 在梳理相关理论与方法的基础上,首先明晰细粒度聚合本体概念的基本类型、粒度特征和定义等基本理论问题,然后以网络环境下图书情报学领域"引文分析"主题语料为数据来源,从概念、属性和关系、实例等方面对细粒度聚合单元本体构建进行逐一探讨,并对本体进行评估和讨论。[结果/结论] 首次提出基于聚合单元知识体系构建细粒度聚合本体的思路与方法,可为基于聚合单元的细粒度组织、检索和导航中知识组织系统工具的构建提供参考。

本文引用格式

马翠嫦 , 曹树金 . 网络学术文档细粒度聚合本体构建研究[J]. 图书情报工作, 2019 , 63(24) : 107 -118 . DOI: 10.13266/j.issn.0252-3116.2019.24.012

Abstract

[Purpose/significance] Fine-grained information aggregation has become the focus in the field of knowledge organization. This paper aims at exploring the construction of fine-grained aggregation ontology for academic documents in the Internet environment.[Method/process] This study clarified the types, granularity characteristics and definitions of the concepts of the fine-grained aggregation ontology. Then, with the corpus of "citation analysis" documents in the field of library and information science in the Internet environment, the ontology was built through the concepts, attributes and relationships. At last, the ontology was evaluated and discussed.[Result/conclusion] This paper is among the first to propose the idea of the fine-grained aggregated ontology construction by using the concept of aggregation unit. This paper can inform the construction of knowledge organization systems for fine-grained organization, retrieval and navigation based on aggregation unit.

参考文献

[1] 文庭孝,罗贤春,刘晓英,等. 知识单元研究述评[J]. 中国图书馆学报,2011, 37(5):75-86.
[2] 温有奎. 焦玉英. 基于知识元的知识发现[M]. 西安:西安电子科技大学出版社, 2011.
[3] 温有奎.基于"知识元"的知识组织与检索[J].计算机工程与应用, 2005, 41(1): 55-57,91.
[4] 温有奎,徐国华. 知识元链接理论[J]. 情报学报, 2003, 22(6): 665-670.
[5] 温有奎,温浩,徐端颐,等. 基于知识元的文本知识标引[J]. 情报学报, 2006, 25(3): 282-288.
[6] 王燕,温有奎.文本单元向知识单元转化的研究[J].情报理论与实践, 2007, 30(3): 409-411,362.
[7] 温有奎,焦玉英. 基于范畴论的知识单元组织与检索研究[J]. 情报学报, 2010, 29(3): 387-392.
[8] 温有奎,焦玉英. Wiki知识元语义图研究[J]. 情报学报,2009, 28(6): 870-876.
[9] 温有奎,焦玉英.知识元语义链接模型研究[J]. 图书情报工作, 2010, 54(12): 27-31.
[10] 周秀会.知识元搜索引擎:CNKI知识搜索平台[J].现代情报,2007, 27(5):220-222.
[11] 陶善菊,刘清堂,王凡,等. 基于知识元的教育技术学科资源库构建[J]. 现代教育技术, 2011, 21(5): 115-120.
[12] ZOU J,LIU Q. A knowledge element model for knowledge abstract and fusion system[C]//2009 International conference on new trends in information and service science.Washington, DC: IEEE Computer Society, 2009: 23-26.
[13] TRACE C B, DILLON A. The evolution of the finding aid in the United States-from physical to digital document genre[J]. Archival science, 2012, 12(4): 501-519.
[14] SWALES J M. Aspects of article introductions[M]. Birmingham: the University of Aston in Birmingham, 1981.
[15] CROOKES G. Towards a validated analysis of scientific text structures[J]. Applied linguistics, 1986, 7(1): 57-70.
[16] HOPKINS A, DUDLEY-EVANS T. A genre-based investigation of the discussion sections in articles and dissertations[J]. English for specific purposes, 1988, 7(2): 113-121.
[17] SAMRAJ B. Introductions in research articles: variations across disciplines[J]. English for specific purposes, 2002, 21(1): 1-17.
[18] POSTEGUILLO S. The schematic structure of computer science research articles[J]. English for specific purposes, 1999, 18(2): 139-160.
[19] BRUCE I. Cognitive genre structures in methods sections of research articles: a corpus study[J]. Journal of English for academic purposes, 2008, 7(1): 38-54.
[20] BRETT P. A genre analysis of the results section of sociology articles[J]. English for specific purposes, 1994, 13(1): 47-59.
[21] KANOKSILAPATHAM B. Rhetorical structure of biochemistry research articles[J]. English for specific purposes, 2005,24(5):269-292.
[22] NWOGU K N. The medical research paper: structure and functions[J]. English for specific purposes, 1997, 16(2):119-138.
[23] LEWIN B A, FINE J, YOUNG L. Expository discourse: a genre based approach to social science research texts[M]. London: Continuum, 2001.
[24] 赵福利. 英语电视新闻导语的语步结构分析[J]. 外语教学与研究,2001,33(2):99-104.
[25] 葛冬梅, 杨瑞英. 学术论文摘要的体裁分析[J]. 现代外语, 2005, 28(2): 138-146,219.
[26] 崔艳嫣, 王同顺. 英语学术讲座的宏观结构与微观结构——体裁分析在学术语篇分析中的应用[J]. 山东外语教学, 2004(5): 27-30.
[27] 杨瑞英. 体裁分析的应用:应用语言学学术文章结构分析[J]. 外语与外语教学,2006(10):29-34.
[28] BISHOP A P. Document structure and digital libraries: how researchers mobilize information in journal articles[J]. Information processing & management, 1999,35,(3): 255-279.
[29] DILLON A. Designing usable electronic text[M]. Boca Raton FL: CRC Press, 2004.
[30] DILLON A, SCHAAP D. Expertise and the perception of shape in information[J]. Journal of the American Society for Information Science and Technology, 1996, 47(10): 786-788.
[31] VAUGHAN M W. Identifying regularities in users’ conceptions of information spaces: designing for structural genre conventions and mental representations of structure for Web-based newspapers[D]. Indiana: Indiana University, 1999.
[32] VAUGHAN M W, DILLON A. Learning the shape of information: a longitudinal study of Web-news reading[C]//Proceedings of the fifth ACM conference on digital libraries. New York:ACM, 2000: 236-237.
[33] DILLON A, SCHAAP D. Expertise and the perception of shape in information[J]. Journal of the American Society for Information Science and Technology, 1996, 47(10): 786-788.
[34] DILLON A. Spatial-Semantics: how users derive shape from information space[J]. Journal of the American Society for Information Science, 2000, 51(6): 521-528.
[35] ZHANG L, KOPAK L R, FREUND L, et al. A taxonomy of functional units for information use of scholarly journal articles[C]//Proceedings of the American Society for Information Science and Technology. MD, USA: American Society for Information Science Silver Springs, 2010,47(1): 1-10.
[36] ZHANG L, KOPAK L R, FREUND L, et al. Making functional units functional- the role of rhetorical structure in use of scholarly journal articles[J]. International journal of information management, 2011,31(1): 21-29.
[37] ZHANG L. Grasping the structure of journal articles: utilizing the functions of information units[J]. Journal of the American Society for Information Science and Technology, 2012, 63(3): 469-480.
[38] MA C-C,CAO S-J. Identifying structural genre conventions across academic web documents for information use[C]//Proceedings of the Association for Information Science & Technology. Somerset, NJ: John Wiley & Sons, 2017: 260-267.
[39] 马雨萌, 刘凤红,黄金霞. STKOS中领域本体模型框架研究[J]. 图书情报工作, 2015. 59(3): 119-125, 139.
[40] 邱均平, 杨强,楼雯. 资源本体构建理论与实证研究[J]. 情报理论与实践, 2014,37(5): 1-6.
[41] MIZOGUCHI R. YAMATO: Yet another more advanced top-level ontology[EB/OL]. [2019-10-28].http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=00B4895D3EF153E0F74DC6B248D307FB?doi=10.1.1.221.1614&rep=rep1&type=pdf.
[42] 张囡囡. 面向语义网的领域本体半自动构建方法的研究[D]. 大连:大连海事大学,2008.
[43] GUARINO N. Semantic matching: formal ontological distinctions for information organization, extractiong and integration[C]//PAZIENZA M T. Information extraction: a multidisciplinary approach to an emerging information technology. Berlin: Springer Verlag, 1997: 139-170.
[44] 郭嘉琦. 领域本体的构建及其在信息检索中的应用研究[D]. 北京:北京邮电大学,2007.
[45] 王向前, 张宝隆,李慧宗. 本体研究综述[J]. 情报杂志, 2016,35(6): 163-170.
[46] GRUBER T. Towards principles for the design of ontologies used for knowledge sharing[J]. International journal of human-computer studies, 1995, 43(5/6):907-928.
[47] 李景,孟宪血,苏晓路. 领域本体的构建方法与应用研究[M].北京:中国农业科学技术出版社,2009.
[48] DANIEL L R,NATALYA F N, MARK A M.Protégé: a tool for managing and using terminology in radiology applications[J].Journal of digital imaging,2007,20(S1): 34-46.
[49] FREUND L. A cross-domain analysis of task and genre effects on perceptions of usefulness[J]. Information processing and management, 2013, 49(5): 1108-1121.
[50] 朱嘉贤,白伟华,李吉桂. Web资源的多粒度语义标注及其应用技术研究[J]. 2011,38(8):83-87.
[51] 岳丽欣,刘文云. 国内外领域本体构建方法的比较研究[J]. 情报理论与实践,2016. 39(8): 119-125.
[52] LI Y, BELKIN N J. A faceted approach to conceptualizing tasks in information seeking [J]. Information processing and management, 2008, 44(6): 1822-1837.
文章导航

/