引用
计算机科学
人工神经网络
领域(数学)
人工智能
机器学习
图层(电子)
特征(语言学)
质量(理念)
构造(python库)
数据挖掘
数据科学
万维网
数学
语言学
化学
哲学
有机化学
认识论
纯数学
程序设计语言
作者
Tianchen Gao,Jingyuan Liu,Rui Pan,Hansheng Wang
标识
DOI:10.1016/j.eswa.2023.121634
摘要
Citation counts is a crucial factor in evaluating the quality of research papers. Therefore, it is vital to accurately predict citation counts and explore the mechanisms underlying citations. In this study, we focus on predicting the citation counts in the field of statistics. We collect 55,024 academic papers published in 43 statistics journals between 2001 and 2018. Furthermore, we collect and clean a high-quality dataset and then construct multi-layer networks from different perspectives, including journal networks, author citation networks, co-citation networks, co-authorship networks, and keyword co-occurrence networks. Additionally, we extract 77 factors for citation counts prediction, including 22 traditional and 55 network-related factors. To address the issues of zero-inflated and over-dispersed citation counts, a neural network model is designed to achieve high prediction accuracy. Furthermore, we adopt a leave-one-feature-out approach to investigate the importance of these factors. The proposed neural network model achieves an MAE value of 7.352, which outperforms other machine learning models in the comparison. Thus, this study provides a useful guide for researchers to predict citation counts and can be easily extended to other research fields.
科研通智能强力驱动
Strongly Powered by AbleSci AI