知识组织

受控词表中多维坐标系统构建——以公共数字文化资源整合为例

  • 张芳源 ,
  • 司莉
展开
  • 武汉大学信息管理学院 武汉 430072
张芳源(ORCID:0000-0002-8670-7589),博士研究生,E-mail:fyzhang@whu.edu.cn;司莉(ORCID: 0000-0003-1028-8338),教授,博士生导师。

收稿日期: 2015-02-06

  修回日期: 2015-03-09

  网络出版日期: 2015-03-20

基金资助

本文系国家社会科学基金重点项目"公共数字文化服务中的资源整合研究"(项目编号:13ATQ001)研究成果之一。

Building Multi-Dimensional Coordinate System in Controlled Vocabulary: Taking Digital Resource Integration of Public cultural institutes for an Example

  • Zhang Fangyuan ,
  • Si Li
Expand
  • School of Information Management, Wuhan University, Wuhan 430072

Received date: 2015-02-06

  Revised date: 2015-03-09

  Online published: 2015-03-20

摘要

[目的/意义]以公共图书馆、博物馆、美术馆和群众艺术馆数字资源整合为例,探讨通过赋予受控词汇"身份",提高资源检索的效率的方法。[方法/过程]定义多维坐标系统空间面、主题坐标轴和坐标点;通过为词汇概念赋予标识符,建立概念与词汇的关联,按一定规则为词汇赋予"身份",以概念优选机制、关联数据技术与索引表构建作为其辅助。此外,通过解析用户检索词语义,构建语义标识符,并对概念标识符进行拆分、组合,利用测算标识符点距的方法建立语义标识符与概念标识符之间的映射关系,实现检索维度优选。[结果/结论]多维坐标系统的坐标关系模型以"面-线-点"的坐标关系处理层次为基础,以"概念定位-词汇定位-资源定位"的检索层次为依据,并结合优选、关联与索引,拆分、组合与点距等相关实现机制,通过量化方法来处理词汇关系,能够提高机器对词汇的理解。

本文引用格式

张芳源 , 司莉 . 受控词表中多维坐标系统构建——以公共数字文化资源整合为例[J]. 图书情报工作, 2015 , 59(6) : 97 -103 . DOI: 10.13266/j.issn.0252-3116.2015.06.015

Abstract

[Purpose/significance] Taking digital resource integration of libraries, museums, art museums and mass art museums as examples, this paper discusses a way to improve retrieval efficiency by assigning the controlled vocabularies with their "identifiers".[Method/process] A coordinate system is defined, which consists of three parts: space surfaces,coordinate axes and coordinate points that represent different vocabulary concepts. Following certain regulations, the concepts are assigned "identifiers" to establish relationship between vocabularies. Meanwhile, the system is supported by the "concept optimization" mechanism, the data-linking technology and the index list construction mechanism. For search engine, words inputted by users are firstly assigned with semantic "identifiers", then the "identifiers" of concepts will be analyzed in parts, before the mapping relationship of the two identifiers is generated. With these processes, retrieval dimensionality optimization is achieved.[Result/conclusion] The coordinate relationship model of the multi-dimensional coordinate system is based on the hierarchical model of "plane-line-point", and is in accordance with the "concept locating, vocabulary locating, and resource locating" retrieval hierarchy. With the help of implementation mechanisms like optimization, association and indexing, split and merging, it processes the relationship of words numerically, and the understandability of vocabulary is enhanced.

参考文献

[1] 范炜. 受控词表的术语服务研究[J]. 图书情报工作, 2012, (14): 34-39,97.
[2] Baca M. Practical issues in applying metadata schemas and controlled vocabularies to cultural heritage information[J]. Cataloging & Classification Quarterly, 2003, 36(3/4): 47-55.
[3] 吴雯娜, 王星. 叙词表融合方法研究[J]. 中国图书馆学报, 2012(4): 110-118.
[4] National Information Standards Organization. Guidelines for the construction, format, and management of monolingual controlled vocabularies[M].Baltimore:NISO Press, 2005.
[5] 张继宏. 专利标准化视角的多维集成创新研究[D].武汉:华中科技大学,2011.
[6] Volker G, Günther O. Multidimensional access methods[J]. ACM Computing Surveys (CSUR), 1998, 30(2):170-231.
[7] UMLS® Reference Manual[EB/OL].[2015-01-12]. http://www.ncbi.nlm.nih.gov/books/NBK9684/.
[8] 李丹亚, 胡铁军, 李亚子,等. UMLS多词表整合机制研究[J]. 数字图书馆论坛, 2012(4): 28-36.
[9] Isaac A. Europeana Data Model Primer[EB/OL].[2015-01-12]. http://pro.europeana.eu/files/Europeana_Professional/Share_your_data/Technical_requirements/EDM_Documentation/EDM_Primer_130714.pdf.
[10] 李永兵, 陈旭瑞, 胡俊峰, 等. 基于GIS的地质数据库系统:研究现状和发展趋势[J]. 地球物理学进展, 2002, 17(3): 532-539,558.
[11] Guenther O, Buchmann A. Research issues in spatial databases[J]. ACM Sigmod Record, 1990, 19(4): 61-68.
[12] Zeng Marcia Lei. Construction of controlled vocabularies, a primer (based on Z39.19)[EB/OL].[2015-01-12].http://www.slis.kent.edu/~mzeng/Z3919/index.htm.
[13] Zhou Bing, Yao Yiyu. Evaluating information retrieval system performance based on user preference[J]. Journal of Intelligent Information Systems, 2010, 34(3): 227-248.
[14] Yao Y Y. Measuring retrieval effectiveness based on user preference of documents[J]. Journal of the American Society for Information Science, 1995, 46(2): 133-145.
[15] Cole T W, Han M K, Vannoy J A. Descriptive metadata, iconclass, and digitized emblem literature[C]//Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries.New York:ACM, 2012: 111-120.
[16] 葛本仪. 论词汇静态、动态形式的结合研究[J]. 山东大学学报(哲学社会科学版), 2004(6): 37-41.
[17] 司莉, 李鑫. 图书馆应用关联数据的策略分析[J]. 图书馆工作与研究, 2013(10): 32-35.
[18] 张宇, 刘雨东, 计钊. 向量相似度测度方法[J]. 声学技术, 2009, 28(4): 532-536.
[19] 郑仕辉, 周傲英, 张龙. XML文档的相似测度和结构索引研究[J]. 计算机学报, 2003, 26(9): 1116-1122.

文章导航

/