北京大学信息管理系成立75周年学术专辑

学者画像研究综述

  • 王世奇 ,
  • 刘智锋 ,
  • 王继民
展开
  • 1. 北京大学信息管理系 北京 100871;
    2. 北京大学大数据分析与应用技术国家工程实验室 北京 100871
王世奇,博士研究生; 刘智锋,博士研究生。

收稿日期: 2022-07-29

  修回日期: 2022-08-24

  网络出版日期: 2022-11-17

基金资助

本文系国家社会科学基金重点项目"开放科学数据集统一发现的关键问题与平台构建研究"(项目编号:20ATQ007)和北京大学重庆大数据研究院北京基地项目研究成果之一。

A Review of Scholar Profiling Research

  • Wang Shiqi ,
  • Liu Zhifeng ,
  • Wang Jimin
Expand
  • 1 Department of Information Management, Peking University, Beijing 100871;
    2 National Engineering Laboratory for Big Data Analysis and Applications, Peking University, Beijing 100871

Received date: 2022-07-29

  Revised date: 2022-08-24

  Online published: 2022-11-17

摘要

[目的/意义] 对学者画像研究进行梳理,为其相关研究提供参考。[方法/过程] 通过文献调研与分析,对学者画像及其相关概念进行辨析,归纳总结学者画像的构建流程、关键技术以及主要的应用,并分析目前研究面临的挑战。[结果/结论] 学者画像的构建流程包含数据搜集、数据预处理、学者标签构造与可视化分析,主要实践应用包括专家推荐、学术资源推荐和科研能力评价。当前相关研究面临多源数据获取与融合难度大、学者画像动态更新研究困难以及有效评价机制缺乏等挑战。

本文引用格式

王世奇 , 刘智锋 , 王继民 . 学者画像研究综述[J]. 图书情报工作, 2022 , 66(20) : 73 -81 . DOI: 10.13266/j.issn.0252-3116.2022.20.008

Abstract

[Purpose/Significance] This paper summarizes the research on scholar profile, and provides a reference for the related research. [Method/Process] Through literature research and analysis, this paper discriminated the scholar profile and its related concepts, summarized the construction process, key technologies and main applications of the scholar profile, and analyzed the challenges faced by the current research. [Result/Conclusion] The construction process of scholar profile includes data collection, data preprocessing, scholar label construction and visual analysis. The main practical applications include expert recommendation, academic resource recommendation and scientific research ability evaluation. At present, there are still some challenges in related research, such as the difficulty of multi-source data acquisition and fusion, difficulties in research on dynamic update of the scholar profile and the lack of effective evaluation mechanism.

参考文献

[1] 许鹏程,毕强,张晗,等.数据驱动下数字图书馆用户画像模型构建[J].图书情报工作,2019,63(3):30-37.
[2] 李佳慧,赵刚.基于大数据的电子商务用户画像构建研究[J].电子商务,2019(1):46-49.
[3] 滕春娥,何春雨.在线医疗社区用户画像构建与应用[J].图书情报工作,2021,65(12):147-154.
[4] 徐海玲,张海涛,魏明珠,等.社交媒体用户画像的构建及资源聚合模型研究[J].图书情报工作,2019,63(9):109-115.
[5] 王雅娇,路佳,柯晓静.学术画像在科技期刊中的应用研究[J].中国编辑,2021(4):45-49.
[6] 董文慧,熊回香,杜瑾,等.基于学者画像的科研合作者推荐研究[J/OL].数据分析与知识发现 [2022-05-23].http://kns.cnki.net/kcms/detail/10.1478.G2.20220407.1054.002.html.
[7] 范晓玉. 基于多源科技管理数据的重大项目团队成员推荐研究[D].西安:西安电子科技大学,2018.
[8] GENG Q, CHUAI Z, JIN J. Automatic construction of academic profile: a case of information science domain[J]. Journal of information science, 2021(4):016555152199804.
[9] HOLANDA O, ELIAS E, COSTA E, et al. Towards an agent-based approach for automatic generation of researcher profiles using multiple data sources [C]// IEEE/WIC/ACM international joint conferences on Web intelligence. New York:IEEE, 2013:163-166.
[10] TANG J, ZHANG J, YAO L, et al. ArnetMiner: extraction and mining of academic social networks [C]// Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining. Las Vegas.:ACM, 2008.
[11] 陈慧香,邵波.国外图书馆领域用户画像的研究现状及启示[J].图书馆学研究,2017(20):16-20.
[12] 刘海鸥,孙晶晶,苏妍嫄,等.国内外用户画像研究综述[J].情报理论与实践,2018,41(11):155-160.
[13] 宋美琦,陈烨,张瑞.用户画像研究述评[J].情报科学,2019,37(4):171-177.
[14] 张海涛,徐海玲,张枭慧,等.国内外图书情报领域用户画像研究现状及展望[J].图书情报工作,2019,63(7):127-134.
[15] 袁莎,唐杰,顾晓韬.开放互联网中的学者画像技术综述[J].计算机研究与发展,2018,55(9):1903-1919.
[16] COOPER A. The inmates are running the asylum: why high tech products drive us crazy and how to restore the sanity[M]. 2nd ed. New York:Pearson Higher Education, 2004.
[17] COOPER A, REIMANN R, CRONIN D. About face 3: the essentials of interaction design[M]. Hoboken:John Wiley & Sons, 2007.
[18] BAXTER K, COURAGE C, CAINE K. Understanding your users: a practical guide to user research methods[M]. Burlington:Morgan Kaufmann, 2015.
[19] 许棣华,王志坚,林巧民,等.一种基于偏好的个性化标签推荐系统[J].计算机应用研究,2011,28(7):2573-2575,2579.
[20] MALESZKA M, MIANOWSKA B, NGUYEN N T. A method for collaborative recommendation using knowledge integration tools and hierarchical structure of user profiles[J]. Knowledge-based systems, 2013, 47: 1-13.
[21] 张海涛,栾宇,周红磊. 用户画像:向知识迈进[J]. 图书情报知识,2020(5):131-134.
[22] YAO L, TANG J, LI J. A unified approach to researcher profiling [C]//IEEE/WIC/ACM international conference on Web intelligence. Fremont:IEEE, 2007: 359-366.
[23] 范晓玉,窦永香,赵捧未,等.融合多源数据的科研人员画像构建方法研究[J].图书情报工作,2018,62(15):31-40.
[24] 秦成磊,章成志.大数据环境下同行评议面临的问题与对策[J].情报理论与实践,2021,44(4):99-112.
[25] 姚远,张蕙,郝群,等. 基于本体的用户画像构建方法[C]//中国计算机用户协会网络应用分会2018年第二十二届网络新技术与应用年会论文集. 2018:232-238.
[26] 高广尚.用户画像构建方法研究综述[J].数据分析与知识发现,2019,3(3):25-35.
[27] 王锐杰. 基于多源信息融合的科研学者画像及应用研究[D].成都:电子科技大学,2020.
[28] 池雪花. 学者精准画像的自动构建研究[D].南京:南京理工大学,2019.
[29] ZHANG J,TANG J. Name disambiguation in AMiner[J].Science China(information sciences),2021,64(4):214-216.
[30] 牛海波,罗威,尹忠博,等. 一种基于互联网信息的开放学者画像方法:CN108090223B [P].2020-05-12.
[31] 昌宁,窦永香,徐薇.基于多源数据的科技文献作者同名消歧研究[J].情报科学,2021,39(6):108-116.
[32] LIU G, YANG L. Popular research topics in the recent journal publications of library and information science[J]. The Journal of academic librarianship, 2019, 45(3):278-287.
[33] Research Gate[EB/OL].[2022-05-10].https://www.researchgate.net.
[34] Academia.edu - Share research[EB/OL].[2022-05-10]. https://www.academia.edu/.
[35] 学者网-SCHOLAT[EB/OL].[2022-05-10]. https://www.scholat.com/.
[36] BRAVO M, REYES-ORTIZ J A, CRUZ I. Researcher profile ontology for academic environment[C]//Science and information conference. Cham:Springer, 2019: 799-817.
[37] SUN J, XU J G, CEN Z W. Chinese researcher profile annotation based on conditional random fields with semantic rules [C]//World congress on engineering. v.III.:international association of engineers. London:WCE,2011:1818-1822.
[38] 张华平,商建云.NLPIR-Parser:大数据语义智能分析平台[J].语料库语言学,2019,6(1):87-104.
[39] CHE W, LI Z, LIU T. LTP: a Chinese language technology platform [C]// 23rd international conference on computational linguistics. Beijing: Coling 2010, Demonstrations,2010:13-16.
[40] DEMARTINI G. Finding experts using Wikipedia [C]// International conference on finding experts on the Web with semantics. Busan:CEUR-WS.org, 2007.
[41] 曾健荣,张仰森,王思远,等.基于多特征融合的同名专家消歧方法研究[J].北京大学学报(自然科学版),2020,56(4):607-613.
[42] 朱云霞.中文文献题录数据作者重名消解问题研究[J].图书情报工作,2014,58(23):143-148,142.
[43] 温萍梅,叶志炜,丁文健,等.命名实体消歧研究进展综述[J].数据分析与知识发现,2020,4(9):15-25.
[44] 赵刚,姚兴仁.基于用户画像的异常行为检测模型[J].信息网络安全,2017(7):18-24.
[45] 谢鹏.面向学术文献的学者兴趣标签识别方法[J].情报工程,2019,5(3):65-73.
[46] 池雪花,刘丽帆,章成志.基于学术论文的学者研究兴趣标签发现研究[J].情报工程,2019,5(2):28-39.
[47] AMINI B, IBRAHIM R, OTHMAN M S, et al. Capturing scholar’s knowledge from heterogeneous resources for profiling in recommender systems[J]. Expert systems with applications, 2014, 41(17): 7945-7957.
[48] 石湘,刘萍.学者研究兴趣识别综述[J/OL].数据分析与知识发现[2022-05-05].http://kns.cnki.net/kcms/detail/10.1478.G2.20211213.1739.008.html.
[49] HIRSCH J E. An index to quantify an individual’s scientific research output[J]. Proceedings of the National Academy of Sciences of the United States of America, 2005, 102(46):16569-16572.
[50] 熊回香,杨雪萍,蒋武轩,等.基于学术能力及合作关系网络的学者推荐研究[J].情报科学,2019,37(5):71-78.
[51] WordArt.com-Word Cloud Art Creator[EB/OL].[2022-05-05]. https://wordart.com/.
[52] Tagxedo-Word Cloud with Styles[EB/OL].[ 2022-05-05]. http://www.tagxedo.com/.
[53] LIU J, TANG T, WANG W, et al. A survey of scholarly data visualization[J]. IEEE access, 2018, 6: 19205-19221.
[54] THIAGARAJAN R, MANJUNATH G, STUMPTNER M. Finding experts by semantic matching of user profiles[D]. Karlruhe:CEUR-WS, 2008.
[55] 胡承芳,李季,王春芳,等.基于画像技术的澜湄水资源合作领域专家库构建研究[J].长江技术经济,2021,5(6):100-106.
[56] DE CAMPOS L M, FERNNDEZ-LUNA J M, HUETE J F, et al. Automatic construction of multi-faceted user profiles using text clustering and its application to expert recommendation and filtering problems[J]. Knowledge-based systems, 2020, 190: 105337.
[57] DING Y, YAN E, FRAZHO A, et al. PageRank for ranking authors in co-citation networks[J].Journal of the American Society for Information Science and Technology, 2009, 60(11):2229-2243.
[58] 何娟.基于用户个人及群体画像相结合的图书个性化推荐应用研究[J].情报理论与实践,2019,42(1):129-133.160.
[59] 刘海鸥,刘旭,姚苏梅,等.基于大数据深度画像的个性化学习精准服务研究[J].图书馆学研究,2019(15):68-74.
[60] 李宇佳,王益成.基于用户动态画像的学术新媒体信息精准推荐模型研究[J].情报科学,2022,40(1):88-93,101.
[61] 韩旭,李寒,张丽敏,等.基于学术行为的学者排名技术及实现[J].电脑知识与技术,2019,15(26):1-3,5.
[62] 熊回香,杜瑾,代沁泉,等.基于主题与多维计量指标的学者学术影响力评价研究[J].情报理论与实践,2021,44(8):22-27,21.
[63] LEE M,CHO M, JEONG C, et al. Researcher profiling for researcher analysis service[C]// SWCIB2014 workshop, collocated with JIST2014 conference. Chiang Mai:JIST (Workshops & Posters). 2014: 18-23.
[64] ZHAO J P, LIU T W, SHI J Q. Improving academic homepage identification from the Web using neural networks[C]// International conference on computational science. London:Springer, 2019: 551-558.
[65] 沈喆,王毅,姚毅凡,等.面向学术文献的作者名消歧方法研究综述[J].数据分析与知识发现,2020,4(8):15-27.
[66] 胡媛,毛宁.基于用户画像的数字图书馆知识社区用户模型构建[J].图书馆理论与实践,2017(4):82-85,97.
[67] 中国科学院知识服务平台 (las.ac.cn)[EB/OL].[2022-05-18]. https://www.las.ac.cn/.
[68] 粤港澳科技资源大数据服务平台[EB/OL].[2022-05-18]. https://talent.dgut-gba.cn/.
[69] CCKS 2021: AMiner学者画像-Biendata[EB/OL].[2022-05-18]. https://www.biendata.xyz/competition/ccks_aminer_profiling/.
文章导航

/