计算机科学
理论计算机科学
散列函数
页面排名
嵌入
非负矩阵分解
图形
量化(信号处理)
矩阵分解
算法
人工智能
计算机安全
量子力学
物理
特征向量
作者
Defu Lian,Zhihao Zhu,Kai Zheng,Yong Ge,Xing Xie,Enhong Chen
标识
DOI:10.1109/tkde.2022.3151474
摘要
Information network embedding is an important way to enable efficient graph analytics. However, it still faces with computational challenges in problems such as link prediction and node recommendation, particularly with the increasing scale of networks. Both hashing and quantization are promising approaches for accelerating these problems by orders of magnitude. In the preliminary work, we have proposed to learn binary codes for information networks, but graph analytics may suffer from large accuracy degradation. To reduce information loss while achieving memory and search efficiency, we further propose to learn quantized codes for information networks. In particular, each node is represented by compositing multiple latent vectors, each of which is optimally selected from a distinct set. Since (generalized) matrix factorization unifies several well-known embedding methods with high-order proximity preserved, we propose a \underline{N}etwork \underline{R}epresentation \underline{L}ightening framework based on \underline{M}atrix \underline{F}actorization (NRL-MF) to learn binary and quantized codes. We also propose an alternating optimization algorithm for efficient parameter learning, even for the generalized matrix factorization case. We finally evaluate NRL-MF on four real-world information network datasets with respect to the tasks of node classification and node recommendation. The results show that NRL-MF significantly outperforms competing baselines in both tasks, and that quantized representations indeed incur much smaller information loss than binarized codes.
科研通智能强力驱动
Strongly Powered by AbleSci AI