Revisiting Embedding Based Graph Analyses: Hyperparameters Matter!

超参数 计算机科学 嵌入 图形 功率图分析 机器学习 启发式 图嵌入 理论计算机科学 人工智能 操作系统
作者
Dingqi Yang,Bingqing Qu,Rana Hussein,Paolo Rosso,Philippe Cudre-Mauroux,Jie Liu
出处
期刊:IEEE Transactions on Knowledge and Data Engineering [Institute of Electrical and Electronics Engineers]
卷期号:35 (11): 11830-11845
标识
DOI:10.1109/tkde.2022.3230743
摘要

Graph embeddings have been widely used for many graph analysis tasks. Mainstream factorization-based and graph-sampling-based embedding learning schemes both involve many hyperparameters and design choices. However, existing techniques often adopt some heuristics for these hyperparameters and design choices with little investigation into their impact, making it unclear what is the exact performance gains of these techniques on graph analysis tasks. Against this background, this paper presents a systematic study on the impact of an extensive list of hyperparameters for both factorization-based and graph-sampling-based graph embedding techniques for homogeneous graphs. We design generalized factorization-based and graph-sampling-based techniques involving these hyperparameters, and conduct a comprehensive set of experiments with over 3,000 embedding models trained and evaluated per dataset. We reveal that much of the performance gains are indeed due to optimal hyperparameter settings/design choices rather than the sophistication of embedding models; appropriate hyperparameter settings for typical embedding techniques can outperform a sizeable collection of 18 state-of-the-art graph embedding techniques by 0.30-35.41% across different tasks. Moreover, we find that there is no one-size-fits-all hyperparameter setting across tasks, but we can indeed provide a list of task-specific practical recommendations for these hyperparameter settings/design choices, which we believe can serve as important guidelines for future research on embedding based graph analyses.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
天璇完成签到,获得积分10
1秒前
1秒前
Pretrial完成签到 ,获得积分10
2秒前
灿灿陈发布了新的文献求助10
2秒前
路人丨安完成签到,获得积分10
2秒前
3秒前
汉堡包应助机灵的芒果采纳,获得10
3秒前
从容的巧曼完成签到,获得积分10
3秒前
wwwww发布了新的文献求助10
3秒前
DTiverson完成签到,获得积分10
4秒前
4秒前
tivyg'lk发布了新的文献求助10
5秒前
困倦南瓜发布了新的文献求助10
6秒前
6秒前
阿木完成签到,获得积分10
7秒前
糊涂生活糊涂过完成签到 ,获得积分10
9秒前
莫小北发布了新的文献求助10
9秒前
9秒前
哈哈恬完成签到,获得积分10
10秒前
10秒前
Rlx完成签到,获得积分10
12秒前
tivyg'lk完成签到,获得积分20
12秒前
wwwww完成签到,获得积分20
13秒前
调皮的蓝天完成签到,获得积分10
13秒前
AU完成签到 ,获得积分10
13秒前
科研狗完成签到,获得积分10
13秒前
自由饼干完成签到,获得积分10
14秒前
14秒前
14秒前
xly完成签到,获得积分10
14秒前
菠萝蜜完成签到,获得积分10
15秒前
vantine发布了新的文献求助10
15秒前
15秒前
16秒前
dy1994完成签到,获得积分10
17秒前
纪外绣完成签到,获得积分10
17秒前
等待的代容完成签到,获得积分10
18秒前
李健应助max采纳,获得10
18秒前
黄猿完成签到,获得积分10
19秒前
儒雅雅琴完成签到,获得积分10
19秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 500
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 500
Chinese-English Translation Lexicon Version 3.0 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 440
薩提亞模式團體方案對青年情侶輔導效果之研究 400
3X3 Basketball: Everything You Need to Know 310
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2387692
求助须知:如何正确求助?哪些是违规求助? 2094085
关于积分的说明 5270719
捐赠科研通 1820837
什么是DOI,文献DOI怎么找? 908306
版权声明 559289
科研通“疑难数据库(出版商)”最低求助积分说明 485217