Link Prediction on Evolving Data using Matrix and Tensor Factorizations

被引:98
作者
Acar, Evrim [1 ]
Dunlavy, Daniel M. [2 ]
Kolda, Tamara G. [1 ]
机构
[1] Sandia Natl Labs, Livermore, CA 94551 USA
[2] Sandia Natl Labs, Dept Comp & Informat Sci, Albuquerque, NM 87123 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009) | 2009年
基金
美国能源部;
关键词
link mining; link prediction; evolution; tensor factorization; CANDECOMP; PARAFAC;
D O I
10.1109/ICDMW.2009.54
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The data in many disciplines such as social networks, web analysis, etc. is link-based, and the link structure can be exploited for many different data mining tasks. In this paper, we consider the problem of temporal link prediction: Given link data for time periods l through T, can we predict the links in time period T+1? Specifically, we look at bipartite graphs changing over time and consider matrix- and tensor-based methods for predicting links. We present a weight-based method for collapsing multi-year data into a single matrix. We show how the well-known Katz method for link prediction can be extended to bipartite graphs and, moreover, approximated in a scalable way using a truncated singular value decomposition. Using a CANDECOMP/PARAFAC tensor decomposition of the data, we illustrate the usefulness of exploiting the natural three-dimensional structure of temporal link data. Through several numerical experiments, we demonstrate that both matrix- and tensor-based techniques are effective for temporal link prediction despite the inherent difficulty of the problem.
引用
收藏
页码:262 / +
页数:2
相关论文
共 30 条
[1]   Unsupervised Multiway Data Analysis: A Literature Survey [J].
Acar, Evrim ;
Yener, Buelent .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (01) :6-20
[2]  
Acar E, 2006, LECT NOTES COMPUT SC, V3975, P213
[3]  
[Anonymous], 2006, P SIAM DAT MIN WORKS
[4]  
[Anonymous], 1992, Numerical Methods for Large Eigenvalue Problems
[5]  
[Anonymous], 2005, SIGKDD Explor. Newsl
[6]  
[Anonymous], 2007, ACM SIGKDD Explorations Newsletter, DOI DOI 10.1145/1345448.1345462
[7]  
[Anonymous], P AI STATS 07
[8]  
[Anonymous], 2006, ICDCSW
[9]  
[Anonymous], 2005, ACM SIGKDD EXPLOR NE
[10]   Efficient MATLAB computations with sparse and factored tensors [J].
Bader, Brett W. ;
Kolda, Tamara G. .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2007, 30 (01) :205-231