RESEARCH PAPERS

Research on Logical Relationship Mining Between Academic Papers from the Perspective of Structural Element Semantic Relations

  • Cong Tianshi ,
  • Zheng Dejun ,
  • Cheng Wei
Expand
  • College of Information Management, Nanjing Agricultural University, Nanjing 210095
Cong Tianshi, doctoral candidate; Zheng Dejun, professor, doctoral supervisor, corresponding author, E-mail: zdejun@njau.edu.cn;Cheng Wei, doctoral candidate.

Received date: 2024-08-07

  Revised date: 2024-11-19

  Online published: 2025-07-21

Abstract

[Purpose/Significance] This study aims to conduct a deep analysis of the semantic associations between structural elements of academic full-texts, uncover the logical relationships between papers, provide a comprehensive overview of research trends for academic innovation, and offer new perspectives for knowledge management and academic evaluation. [Method/Process] Firstly, academic paper structuring was modeled based on bibliographic data and chapter structure, combining deep learning and rule-matching techniques to perform vector representation and content extraction. Secondly, methods such as semantic similarity and word co-occurrence were employed to analyze the combination, comparison, and association of different paper structural elements, inductively identifying progressive and supplementary logical relationships between academic papers. Finally, a two-layer event evolutionary graph was constructed based on these relationships to extract the logical relationship network of academic papers, providing an in-depth analysis of two types of logical relationships from multidimensional perspectives dominated by different structural elements, thereby clarifying the knowledge evolution pathway and accelerating the knowledge innovation process. [Result/Conclusion] Empirical results show that the proposed method accurately identifies logical associations between papers, addressing the limitations of citation analysis in terms of relationship dimensions and accuracy. This provides a new approach for determining relationships between academic papers.

Cite this article

Cong Tianshi , Zheng Dejun , Cheng Wei . Research on Logical Relationship Mining Between Academic Papers from the Perspective of Structural Element Semantic Relations[J]. Library and Information Service, 2025 , 69(15) : 124 -136 . DOI: 10.13266/j.issn.0252-3116.2025.15.011

References

[1] KONG X J, SHI Y J, YU S, et al. Academic social networks: modeling, analysis, mining and applications[J]. Journal of network and computer applications, 2019, 132: 86-103.
[2] 张昱, 张均胜, 姚长青. 科技文献语义关联网络及其应用探析[J]. 中国科技资源导刊, 2020, 52(1): 40-47. (ZHANG Y, ZHANG J S, YAO C Q. Semantic link network of scientific literature and its application[J]. China science & technology resources review, 2020, 52(1): 40-47.)
[3] 刘耀, 穗志方, 胡永伟, 等. 基于内容与形式交互的图书馆资源组织语义化方法研究[J]. 情报理论与实践, 2010, 33(10): 105-107, 112. (LIU Y, SUI Z F, HU Y W, et al. Research on the method of library resources organization semantization based on content-format interaction[J]. Information studies: theory & application, 2010, 33(10): 105-107, 112.)
[4] 赵京胜, 张丽, 朱巧明, 等. 中文文学作品中的社会网路抽取与分析[J]. 中文信息学报, 2017, 31(2): 99-106. (ZHAO J S, ZHANG L, ZHU Q, et al. Extracting and analyzing social networks from Chinese literary[J]. Journal of Chinese information processing, 2017, 31(2): 99-106)
[5] CAMERON D, KAVULURU R, RINDFLESCH T C, et al. Context-driven automatic subgraph creation for literature-based discovery[J]. Journal of biomedical informatics, 2015, 54: 141-157.
[6] 曹树金, 曹茹烨. 基于知识图谱支持科研创新的跨学科知识发现研究[J]. 情报理论与实践, 2022, 45(11): 10-20. (CAO S J, CAO R H. Research on interdisciplinary knowledge discovery based on knowledge graph to support scientific research innovation[J]. Information studies: theory & application, 2022, 45(11): 10)
[7] CHENG Q, WANG J, LU W, et al. Keyword-citation-keyword network: a new perspective of discipline knowledge structure analysis[J]. Scientometrics, 2020, 124: 1923-1943.
[8] 刘忠宝, 王宇飞, 赵文娟. 基于章节段落的学术文献结构功能识别方法研究[J]. 文献与数据学报, 2020, 2(3): 37-47. (LIU Z B, WANG Y F, ZHAO W J. Research on the recognition method of academic literature structure function based on paragraph[J]. Journal of library and data, 2020, 2(3): 37-47.)
[9] 郑梦悦, 秦春秀, 马续补. 面向中文科技文献非结构化摘要的知识元表示与抽取研究——基于知识元本体理论[J]. 情报理论与实践, 2020, 43(2): 157-163. (ZHENG M Y, QIN C X, MA J B. Research on knowledge unit representation and extraction for unstructured abstracts of Chinese scientific and technical literature: ontology theory based on knowledge unit[J]. Information studies: theory & application, 2020, 43(2): 157-163.)
[10] HAMID Z, KHAFAJI H K. A general algorithm of association rule-based machine learning dedicated for text classification[J]. Journal of physics conference series, 2021, 1773(1): 11-12.
[11] 余丽, 钱力, 付常雷, 等. 基于深度学习的文本中细粒度知识元抽取方法研究[J]. 数据分析与知识发现, 2019, 3(1): 38-45. (YU L, QIAN L, FU C L, et al. Extracting fine-grained knowledge units from texts with deep learning[J]. Data analysis and knowledge discovery, 2019, 3(1): 38-45.)
[12] 罗卓然, 蔡乐, 钱佳佳, 等. 学术论文创新贡献句识别研究[J]. 图书情报工作, 2021, 65(12): 93-100. (LUO Z R, CAI L, QIAN J L, et al. Research on the recognition of innovative contribution sentences of academic papers[J]. Library and information service, 2021, 65(12): 93-100.)
[13] 周海晨, 郑德俊, 郦天宇. 学术全文本的学术创新贡献识别探索[J]. 情报学报, 2020, 39(8): 845-851. (ZHOU H C, ZHENG D J, LI T Y. Research on the identification of academic innovation contributions of full academic texts[J]. Journal of the China Society for Scientific and Technical Information, 2020, 39(8): 845-851.)
[14]张颖怡, 章成志. 基于学术论文全文的研究方法句自动抽取研究[J]. 情报学报, 2020, 39(6): 640-650. (ZHANG Y Y, ZHANG C Z. Methodological and automatic sentence extraction from academic article.s full-text[J]. Journal of the China Society for Scientific and Technical Information, 2020, 39(6): 640-650.)
[15] 张颖怡, 章成志, DAQING H. 学术论文中问题与方法识别及其关系抽取研究综述[J]. 图书情报工作, 2022, 66(12): 125-138. (ZHANG Y Y, ZHANG C Z, DAQING H. A review of problem and method recognition and relation extraction in academic papers[J]. Library and information service, 2022, 66(12): 125-138.)
[16] 章成志, 谢雨欣, 张恒. 学术文献全文内容中的方法实体细粒度抽取及演化分析研究[J]. 情报学报, 2023, 42(8): 952-966. (ZHANG C Z, XIE Y X, ZHANG H. Extraction and evolution analysis of fine-grained method entities from full text of academic articles[J]. Journal of the China Society for Scientific and Technical Information, 2023, 42(8): 952-966.)
[17] 齐小英, 李昕尉, 杨海平. 基于意图和情感的南海学术论文引用特征研究[J]. 数据分析与知识发现, 2022, 6(12): 53-69. (QI X Y, LI X W, YANG H P. Citation characteristics of scholarly articles on south china sea based on intentions and sentiments[J]. Data analysis and knowledge discovery, 2022, 6(12): 53-69.)
[18] 王佳敏, 陆伟, 程齐凯, 等. 基于细粒度关键词引用网络的领域知识多维分析[J]. 情报学报, 2022, 41(7): 733-744. (WANG J M, LU W, CHENG Q K, et al. Multi-dimensional analysis of domain knowledge based on a fine-grained keyword citation network[J]. Journal of the China Society for Scientific and Technical Information, 2022, 41(7): 733-744.)
[19] 戎军涛, 索传军, 周彦廷, 等. 基于创新知识元谱系的学术论文新颖性测度研究[J]. 图书情报工作, 2024, 68(1): 27-38. (RONG J T, SUO C J, ZHOU Y T, et al. Novelty measurement of academic literature based on the innovative knowledge elements genealogy[J]. Library and information service, 2024, 68(1): 27-38.)
[20] 黄晓捷, 熊回香, 肖兵, 等. 基于“问题——方法”知识元挖掘的学科知识流动研究[J]. 图书情报工作, 2024, 68(8): 80-96. (HAUNG X J, XIONG H X, XIAO B, et al. Research on subject knowledge flow based on “problem-method” knowledge units mining[J]. Library and information service, 2024, 68(8): 80-96.)
[21] 杨金庆, 庞业佳, 刘智锋, 等. “问题—方法”关联视角下领域知识创新网络演化机制研究——以信息资源管理学科群为例[J]. 图书情报工作, 2024, 68(10): 97-108. (YANG J Q, PANG J Y, LIU Z F, et al. Research on the evolution mechanism of domain knowledge innovation network from the perspective of “lssue-method" correlation: taking the discipline group of information resources management as an example[J]. Library and information service, 2024, 68(10): 97-108.)
[22] 曹树金, 赵浜. 创新视角的论文间关联挖掘研究[J]. 现代情报, 2022, 42(12): 17-27. (CAO S J, ZHANG B. Research on correlation mining between papers from the perspective of innovation[J]. Journal of modern information, 2022, 42(12): 17-27.)
[23] LI Z, DING X, LIU T. Constructing narrative event evolutionary graph for script event extraction[C]//Proceedings of the 27th international joint conference on artificial intelligence. Stockholm: ACM, 2018: 4201-4207.
[24] 徐海玲. 基于事理图谱的多维特征网络舆情演化路径研究[J]. 情报科学, 2022, 40(7): 48-54. (XU H L. The evolution path of multi-dimensional feature network public opinion based on the event evolutionary graph[J]. Information science, 2022, 40(7): 48-54.)
[25] 张海涛, 李佳玮, 周红磊, 等. 重大突发事件演变机制:认知框架与理论方法[J]. 情报学报, 2021, 40(9): 914-923. (ZHANG H T, LI JIA W, ZHOU H L, et al. The evolution mechanism of major emergencies:cognitive framework and theoretical methods[J]. Journal of the China Society for Scientific and Technical Information, 2021, 40(9): 914-923.)
[26] 王晓光, 宋宁远. 语义出版物的内容组织架构研究——基于纳米出版物和微型出版物的比较分析[J]. 出版科学, 2017, 25(4): 20-27. (WANG X G, SONG N Y. Content schema of semantic publication: a comparative analysis based on nano publication and micro publication[J]. Publishing journal, 2017, 25(4): 20-27.)
[27] 单晓红, 庞世红, 刘晓燕, 等. 基于事理图谱的网络舆情事件预测方法研究[J]. 情报理论与实践, 2020, 43(10): 165-170, 156. (SHAN X H, PANG S H, LIU X Y, et al. Research on Internet public opinion event prediction method based on event evolution graph[J]. Information studies:theory & application, 2020, 43(10): 165-170, 156.)
[28] SOLLACI L B, PEREIRA M G. The introduction, methods, results, and discussion (IMRAD) structure: a fifty-year survey[J]. Journal of the Medical Library Association, 2004, 92(3): 364-367.
[29] 赵丽莹, 苗秀芝, 国荣. 中文科技期刊采用结构式长摘要的建议[J]. 编辑学报, 2017, 29(S1): 59-61. (ZHANG L Y, MIAO X Z, GUO R. Suggestions on extended structured abstract of chinese-language sci-tech journal[J]. Acta editologica, 2017, 29(S1): 59-61.)
[30] TOULMIN S E. The uses of argument[M]. Cambridge: Cambridge University Press, 1958.
[31] 杜新玉, 李宁. 学术论文表述本体的构建与应用探索[J]. 北京信息科技大学学报(自然科学版), 2022, 37(6): 9-18. (DU X Y, LI N. Construction and application exploration of academic paper expression ontology[J]. Journal of Beijing Information Science & Technology University(science and technology edition), 2022, 37(6): 9-18.)
[32] 曲佳彬, 欧石燕. 语义出版驱动的科学论文论证结构语义建模研究[J]. 现代情报, 2021, 41(12): 48-59. (QU J B, OU S Y. Semantic modeling for scientific paper argumentation structure driven by sematic publishing[J]. Journal of modern information, 2021, 41(12): 48-59.)
[33] 石湘, 刘萍. 基于知识元语义描述模型的领域知识抽取与表示研究——以信息检索领域为例[J]. 数据分析与知识发现, 2021, 5(4): 123-133. (SHI X, LIU P. Extraction and representation of domain knowledge with semantic description model and knowledge elements-case study of information retrieval[J]. Data analysis and knowledge discovery, 2021, 5(4): 123-133.)
[34] 王思茗, 孙熊兰, 滕广青, 等. 知识图谱在领域知识多维分析中的应用途径研究[J]. 数字图书馆论坛, 2019(3): 18-27. (WANG S M, SUN X L, TENG G Q, et al. Application of knowledge graph in multidimensional analysis of domain knowledge[J]. Digital library forum, 2019(3): 18-27.)
[35] LIU Y, OTT M, GOYAL N, et al. Roberta: a robustly optimized bert pretraining approach[DB/OL]. [2025-06-01]. https://arxiv.org/abs/1907.11692.
[36] DEVLIN J, CAHNG M W, LEE K, et al. Bert: pre-training of deep bidirectional transformers for language understanding[DB/OL]. [2025-06-01]. https://arxiv.org/abs/1810.04805.
[37] BELTAGY I, LO K, COHAN A. SciBERT: a pretrained language model for scientific text[DB/OL]. [2025-06-01]. https://arxiv.org/abs/1903.10676.
[38] SHAPERE D. The structure of scientific revolutions[J]. The philosophical review, 1964, 73(3): 383-394.
[39] 刘越, 郑德俊, 程为. 面向信息资源管理领域的学术创新特征分析[J]. 图书情报工作, 2024, 68(4): 83-96. (LIU Y, ZHENG D J, CHENG W. Analysis of academic innovation characteristics in the field of information resource management[J]. Library and imformation service, 2024, 68(4): 83-96.)
[40] 何玉荣. 补充类话语标记分析及其习得研究[D]. 南宁:广西大学, 2023. (HE Y R. Analysis of supplementary discourse markers and research on its acquisition[D]. Nangning: Guangxi University, 2023.)
[41] GAO T, YAO X, CHEN D. Simcse: simple contrastive learning of sentence embeddings[DB/OL]. [2025-06-01]. https://arxiv.org/abs/2104.08821.
[42] 索传军, 于淼, 牌艳欣, 等. 数据驱动的学术评价理论框架研究[J]. 图书情报工作, 2024, 68(1): 5-12. (SUO C J, YU M, PAI Y X, et al. Theoretical framework of data-driven academic evaluation[J]. Library and information service, 2024, 68(1): 5-12.)
[43] 金鑫, 张润莲. 递进关联标记“进一步”及其相关结构的句法语义分析[J]. 语言研究, 2024, 44(2): 11-16. (JIN X, ZHANG R L.A syntactic and semantic analysis of the progressive relation marker jinyibu and its related structures[J]. Studies in language and linguistics, 2024, 44(2): 11-16.)
[44] 李秀霞, 邵作运. 内容信息与引文信息融合的作者影响力评价研究[J]. 情报理论与实践, 2020, 43(2): 72-77. (LI X X, SHAO Y Z. The evaluation research on author’s influence by integrating the content information and citation information[J]. Information studies: theory & application, 2020, 43(2): 72-77.)
[45] 陈亮, 张志强. 技术演化研究方法进展分析[J]. 图书情报工作, 2012, 56(17): 59-66. (CHEN L, ZHANG Z Q. Development of technological evolution research methods[J]. Library and imformation service, 2012, 56(17): 59-66.)
Outlines

/