High-order Proximity Preserving Information Network Hashing

Authors:
Defu Lian University of Science and Technology of China
Kai Zheng University of Science and Technology of China
Vincent W. Zheng Advanced Digital Sciences Center
Yong Ge University of Arizona
Longbing Cao University of Technology, Sydney
Ivor W. Tsang University of Technology, Sydney
Xing Xie Microsoft

Introduction:

This paper studies Information network embedding. The authors propose a MF-based underlineI nformation underlineN etwork underlineH ashing (INH-MF) algorithm, to learn binary codes which can preserve high-order proximity.

Abstract:

Information network embedding is an effective way for efficient graph analytics. However, it still faces with computational challenges in problems such as link prediction and node recommendation, particularly with increasing scale of networks. Hashing is a promising approach for accelerating these problems by orders of magnitude. However, no prior studies have been focused on seeking binary codes for information networks to preserve high-order proximity. Since matrix factorization (MF) unifies and outperforms several well-known embedding methods with high-order proximity preserved, we propose a MF-based \underlineI nformation \underlineN etwork \underlineH ashing (INH-MF) algorithm, to learn binary codes which can preserve high-order proximity. We also suggest Hamming subspace learning, which only updates partial binary codes each time, to scale up INH-MF. We finally evaluate INH-MF on four real-world information network datasets with respect to the tasks of node classification and node recommendation. The results demonstrate that INH-MF can perform significantly better than competing learning to hash baselines in both tasks, and surprisingly outperforms network embedding methods, including DeepWalk, LINE and NetMF, in the task of node recommendation. The source code of INH-MF is available online\footnote\urlhttps://github.com/DefuLian/network .

You may want to know: