知识组织

面向数字人文应用的高校特藏通用标引方案探索与建议

  • 李彦霖 ,
  • 王乐
展开
  • 1 复旦大学图书馆 上海 200433;
    2 复旦大学国家智能评价与治理实验基地 上海 200433
李彦霖,馆员,档案编码标准技术委员会(Technical Subcommittee on Encoded Archival Standards,TS-EAS)委员,硕士。

收稿日期: 2022-12-09

  修回日期: 2023-02-05

  网络出版日期: 2024-01-06

Exploration and Proposals on Universal Cataloging Scheme for Special Collections in University Libraries for Digital Humanities Application

  • Li Yanlin ,
  • Wang Le
Expand
  • 1 Fudan University Library, Shanghai 200433;
    2 Fudan University National Experiment Base for the Intelligent Evaluation and Governance, Shanghai 200433

Received date: 2022-12-09

  Revised date: 2023-02-05

  Online published: 2024-01-06

摘要

[目的/意义]随着文献形式日趋多样化,数字人文技术要求的不断提高,对文献标引的深度和标引标准化需求也越来越高,迫切需要更利于数据交流和复用的标引方案以满足当前数据需求,形成顺畅的工作流及数据交换方式。[方法/过程]提出以多种XML嵌套或配合使用的特藏标引方案,统一数据格式,实行树形结构管理,进行高度控制语言的深度著录,从而实现较为通用的文献信息数据化。同时结合复旦大学图书馆的实际应用对实践的成效和局限加以讨论。[结果/结论]该特藏标引方案成效显著:采用开源软件进行标引项目管理,降低了标引成本和难度;标引方案具有开放性,生成符合数字人文需求的数据。但在实践过程中也暴露出国内数据基础建设薄弱、关联数据应用受限等问题。

本文引用格式

李彦霖 , 王乐 . 面向数字人文应用的高校特藏通用标引方案探索与建议[J]. 图书情报工作, 2023 , 67(24) : 111 -121 . DOI: 10.13266/j.issn.0252-3116.2023.24.010

Abstract

[Purpose/Significance] As the forms of literature become increasingly diverse and the demand for digital humanities technology continues to rise, there is a growing need for deeper and more standardized literature indexing. There is an urgent demand for indexing schemes that facilitate data exchange and reuse to meet current data needs, establishing a smooth workflow and data exchange method. [Method/Process] A specialized indexing scheme is proposed, employing various XML schemas nesting or combination methods, unifying data formats, implementing tree-structured management, and conducting in-depth recording in a highly controlled language. This allows for the more general digitization of document information data. Simultaneously, the article discussed the practical application, effectiveness, and limitations of this theory combining the practice of Fudan University Library. [Result/Conclusion] The specialized indexing scheme has shown significant effectiveness. The use of open-source software for indexing project management has reduced indexing costs and difficulty. The indexing scheme is open, generating data that meets the needs of digital humanities. However, in practice, it has also revealed issues such as weak domestic data infrastructure and limited application of associated data.

参考文献

[1] 王乐.略论高校图书馆特色馆藏建设的价值与发展方向[J].大学图书馆学报, 2020, 38(3):12-17. (WANG L.The unique role and trends on special collection of university libraries[J].Journal of academic libraries, 2020, 38(3):12-17.)
[2] 龙泉.双一流高校数字人文研究现状与知识图谱构建[J].图书馆学刊, 2021, 43(5):79-89. (LONG Q.Research status and knowledge map construction of digital humanities in double firstclass universities[J].Journal of library science, 2021, 43(5):79-89.)
[3] 陈以敏, 张青青.数字人文下高校图书馆手稿特色数据资源库建设研究[J].图书馆, 2021(6):87-93. (CHEN Y M, ZHANG Q Q.Research on the construction of manuscript characteristic database in university library under digital humanities[J].Library, 2021(6):87-93.)
[4] KIRSCHENBAUM M.What is digital humanities and what's it doing in English departments?[M]//GOLD M K.Debates in the digital humanities.Minneapolis:University of Minnesota Press, 2012:3-11.
[5] Museum:real, virtual, and argumented[M]//BROWN K.The routledge companion to digital humanities and art history.New York:Routledge, 2020:189-286.
[6] 欧阳剑, 蔡迎春, 王健.数字人文项目可持续性研究[J].图书馆杂志, 2021, 40(11):90-98, 116. (OUYANG J, CAI Y C, WANG J.Research on the sustainability of digital humanities projects[J].Library journal, 2021, 40(11):90-98, 116.)
[7] 罗婷予, NUNES M B.面向智能资源发现服务的城市记忆资源元数据方案构建[J].图书馆建设, 2021(5):98-106. (LUO T Y, NUNES M B.Construction and application of metadata schema of city memory resources[J].Library development, 2021(5):98-106.)
[8] GIANNETTI F.'So near while apart':correspondence editions as critical library pedagogy and digital humanities methodology[J].Journal of academic librarianship, 2019, 45(5):1-11.
[9] 赵雪芹, 莫长镭, 雷春蓉.美国高校图书馆数字人文项目调研与启示——以美国排名前10位的高校为例[J].图书馆, 2021(1):70-76. (ZHAO X Q, MO C L, LEI C R.Investigation and enlightenment of digital humanities projects in American university libraries:case study of the top 10 universities in the United States[J].Library, 2021(1):70-76.)
[10] 徐彤阳, 王淑怡.多样合作与机构引导:德国数字人文项目特点及启示探析[J].图书馆建设, 2022(4):92-101, 146. (XU T Y, WANG S Y.Diversified cooperation and institutional guidance:characteristics and enlightenment of German digital humanities projects[J].Library development, 2022(4):92-101, 146.)
[11] 饶梓欣, 许鑫.基于数据基础设施建设视角的全球图书馆、档案馆与博物馆机构合作网络研究[J].图书馆学研究, 2022(9):54-64. (RAO Z X, XU X.Research on the institutional collaboration network of global libraries, archives and museums from the perspective of data infrastructure construction[J].Research on library science, 2022(9):54-64.)
[12] 贾君枝.LAM馆藏资源的元数据整合方法比较分析[J].档案学研究, 2022(1):79-84. (JIA J Z.Matadata integration methods for LAM collection resource[J].Archives science study, 2022(1):79-84.)
[13] 卢彤, 李明杰.中文古籍数字化成果辅助人文学术研究功能的调查[J].图书与情报, 2019(1):70-79. (LU T, LI M J.Investigation on functions of digital productions of Chinese ancient books in assisting humanities research[J].Library & information, 2019(1):70-79.)
[14] 蒋鸿标.三大中文期刊全文数据库质量述评[J].现代情报, 2015, 35(9):84-88, 170. (JIANG H B.Review on the quality of 3 chinese full-text journal databases[J].Journal of modern information, 2015, 35(9):84-88, 170.)
[15] 夏翠娟, 张磊, 贺晨芝.面向知识服务的图书馆数字人文项目建设:方法、流程与技术[J].图书馆论坛, 2018, 38(1):1-9. (XIA C J, ZHANG L, HE C Z.Construction of library digital humanities projects for knowledge services:method, process and technology[J].Library tribune, 2018, 38(1):1-9.)
[16] 鲁丹, 李欣, 陈金传.基于API技术的数字人文基础设施的构建[J].图书馆学研究, 2019(13):42-46, 57. (LU D, LI X, CHEN J C.The construction of infrastructure for digital humanities based on API technology[J].Library science study, 2019(13):42-46, 57.)
[17] 汤萌, 孙翌, 刘宁静, 等.徽州文书特色资源的主题设计与标引方法研究[J].图书馆杂志, 2019, 38(4):61-68. (TANG M, SUN Y, LIU N J, et al.Subject design and indexing for special collections on huizhou documents[J].Library journal, 2019, 38(4):61-68.)
[18] 陈博, 陈建龙.基于文本挖掘和可视化技术的主题自动标引方法——以《英雄格萨尔》为例[J].现代情报, 2019, 39(8):45-51, 102. (CHEN B, CHEN J L.Subject automatic indexing method based on text mining and visualization technology:take the hero gesar as an example[J].Journal of modern information, 2019, 39(8):45-51, 102.)
[19] 夏翠娟.面向人文研究的"数据基础设施" 建设——试论图书馆学对数字人文的方法论贡献[J].中国图书馆学报, 2020, 46(3):24-37. (XIA C J.The construction of "data infrastructure" for humanities research:the methodological contribution of library science to digital humanities[J].Journal of library science in China, 2020, 46(3):24-37.)
[20] 李欣, 张毅, 汪志莉.图书馆异构特藏资源整合的数字人文研究需求[J].数字图书馆论坛, 2017(11):48-53. (LI X, ZHANG Y, WANG Z L.Digital humanities research demand of library's hereogeneous special resource integration[J].Digital library forum, 2017(11):48-53.)
[21] 贾君枝.基于关联数据的LAM馆藏资源整合[J].晋图学刊, 2022(3):28-33. (JIA J Z.Integration of LAM collection resources based on linked data[J].Shanxi library journal, 2022(3):28-33.)
[22] 王铮, 曾丽军, 周春霞, 等.CASHL特藏++平台调研与框架设计[EB/OL].[2023-07-18].http://www.cashl.edu.cn/sites/default/files/2019-11/CASHL%E7%89%B9%E8%97%8F%2B%2B%E5%B9%B3%E5%8F%B0%E8%B0%83%E7%A0%94%E4%B8%8E%E6%A1%86%E6%9E%B6%E8%AE%BE%E 8%AE%A1-%E5%8C%97%E5%A4%A7. (WANG Z, ZENG L J, ZHOU C X, et al.Research and framework design of CASHL special collection ++ platform[EB/OL].[2023-07-18].http://www.cashl.edu.cn/sites/default/files/2019-11/CASHL%E7%89% B9%E8%97%8F%2B%2B%E5%B9%B3%E5%8F%B0%E8%B 0%83%E7%A0%94%E4%B8%8E%E6%A1%86%E6%9E%B6%E8%AE%BE%E8%AE%A1-%E5%8C%97%E5%A4%A7.)
[23] TENNANT R.MARC must die[J].Library journal, 2002:26-28.
[24] 李然, 张云霞, 汪卫, 等.数字图书馆中XML元数据的存储模式及其优化[J].计算机科学, 2002, 29(11):93-97. (LI R, ZHANG Y X, WANG W, et al.The storage optimization of XML metadata in digital libraries[J].Computer science, 2002, 29(11):93-97.)
[25] FREIRE N, ISAAC A, ROBSON G, et al.A survey of web technology for metadata aggregation in cultural heritage[J].Information services & use, 2017, 37(4):425-436.
[26] 魏清华, 孙林, 胡文静.高校图书馆特藏文献数字化建设研究——以CASHL为例[J].大学图书情报学刊, 2020, 38(1):81-86. (WEI Q H, SUN L, HU W J.Research on digital construction of special collection literature in university libraries:a case study of CASHL[J].Journal of academic libraries, 2020, 38(1):81-86.)
[27] 王树锋.XML数据集中挖掘关联规划算法的比较[J].常州工学院学报, 2009, 22(6):55-59. (WANG S F.A comparison between mining association rule algorithms in XML datasets[J].Journal of Changzhou Institute of Technology, 2009, 22(6):55-59.)
[28] 仝召娟, 许鑫, 钱佳轶.基于关联数据的非遗数字资源聚合研究[J].图书情报工作, 2014, 58(21):21-26. (TONG Z J, XU X, QIAN J Y.Research on aggregation of intangible cultural heritage digital resources based on linked data[J].Library and information service, 2014, 58(21):21-26.)
[29] 谢明亮.MODS在图书馆元数据整合中的应用[J].河北科技图苑, 2015, 28(3):27-29. (XIE M L.Application of MODS in library metadata integration[J].Hebei library journal of science and technology, 2015, 28(3):27-29.)
[30] 张娟.描述性元数据MODS特性及应用[J].现代情报, 2011, 31(8):69-72. (ZHANG J.Properties and applications of MODS metadata[J].Modern information, 2011, 31(8):69-72.)
[31] CIULA A, SPENCE P, VIEIRA J M.Expressing complex associations in medieval historical documents:the Henry IIIfine rolls project[J].Journal of modern information, 2008, 23(3):311-325.
[32] The technical subcommittee for encoded archival standards of the society of American archivists.Encoded archival description tag library version EAD31.1.1[M].Chicago:Society of American Archivists, 2019.
[33] BROWN G, HARVEY K.Adding archival finding aids to the library catalogue:simple crosswalk or data traffic jam?[J].The Canadian journal of library and information practice and research, 2007, 2(2):1-18.
[34] WACKER M, HAN M J, DARTT J.Testing resource description and access (RDA) with non-MARCmetadata standards[J].Cataloging & classification quarterly, 2011, 49(7/8):655-675.
[35] 李彦霖, 王乐.利用开源平台ArchivesSpace进行大型特藏深度揭示与服务研究[J].大学图书馆学报, 2020, 38(6):89-95. (LI Y L, WANG L.Conducting in-depth cataloging and services on grand special collections using the opensource platform ArchivesSpace[J].Journal of academic libraries, 2020, 38(6):89-95.)
[36] TILLMAN R K.Opportunities for encoding EAD for linked data extraction and publication[J].Journal of archival organization, 2016, 13(1/2):19-36.
[37] CANTARA L.METS:the metadata encoding and transmission standard[J].Cataloging & classification quarterly, 2005, 40(3/4):237-253.
[38] 丁亮.数字资源长期保存元数据:Premis以及和METS的结合使用[J].无线互联科技, 2013(11):177-178, 188. (DING L.Metadata to support long-term preservation of digital assets:PREMIS and its use with METS[J].Wireless Internet technology, 2013(11):177-178, 188.)
[39] Network Development and MARC Standards Office of the Library of Congress.Metadata encoding and transmission standard:primer and reference manual:before there is a METS document[EB/OL].[2023-07-18].https://www.loc.gov/standards/mets/METSPrimer.pdf.
[40] NELLHAUS T.XML, TEI, and digital libraries in the humanities[J].Portal:libraries and the academy, 2001, 1(3):257-277.
[41] 王萍, 宋雪雁.EAD、DC、TEI著录实例及其比较分析[J].图书情报工作, 2006, 50(12):79-82. (WANG P, SONG X Y.Comparatives analysis on description examples of EAD, DC, and TEI cataloging[J].Library and information service, 2006, 50(12):79-82.)
[42] O'DELL A J.Book artists unbound:providing access to creator metadata with EAC-CPF[J].Art documentation:journal of the Art Libraries Society of North America, 2014, 33(2):267-278.
[43] TIAN T, COLE T W, YU K.Name and subject heading reconciliation to linked open data authorities using virtual international authority file and library of congress linked data service APIs:a case study featuring emblematica online[J].Library resources & technical services, 2021, 65(4):132-41.
[44] Technical Subcommittee for Encoded Archival Context of the Society of American Archivists, Staatsbibliothek Zu Berlin.档案脉络编码-组织, 人物, 及家族(Eac-Cpf)元素字典2010年更新版[M].李彦霖, 侯鑫鑫, 苗青, 等, 译.2010修订版.Chicago:Sosiety of American archivists, 2018. (Technical Subcommittee for Encoded Archival Context of the Society of American Archivists, Staatsbibliothek Zu Berlin.Encoded archival context-corporate bodies, persons, and families (EAC-CPF) tag library[M].LI Y L, HOU X X, MIAO Q, et al, trans.2010 Revised ed.Chicago:Sosiety of American archivists, 2018.)
[45] MARK A, MATIENZO K K.ArchivesSpace:a next-generation archives management system in museums and the web[EB/OL].[2023-07-18].https://mw2013.museumsandtheweb.com/paper/archivesspace-a-next-generation-archives-management-system/index.html.
文章导航

/