Time Series Join on Subsequence Correlation

被引:41
作者
Mueen, Abdullah [1 ]
Hamooni, Hossein [1 ]
Estrada, Trilce [1 ]
机构
[1] Univ New Mexico, Dept Comp Sci, Albuquerque, NM 87131 USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2014年
关键词
D O I
10.1109/ICDM.2014.52
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of joining two long time series based on their most correlated segments. Two time series can be joined at any locations and for arbitrary length. Such join locations and length provide useful knowledge about the synchrony of the two time series and have applications in many domains including environmental monitoring, patient monitoring and power monitoring. However, join on correlation is a computationally expensive task, specially when the time series are large. The naive algorithm requires O(n(4)) computation where n is the length of the time series. We propose an algorithm, named Jocor, that uses two algorithmic techniques to tackle the complexity. First, the algorithm reuses the computation by caching sufficient statistics and second, the algorithm prunes unnecessary correlation computation by admissible heuristics. The algorithm runs orders of magnitude faster than the naive algorithm and enables us to join long time series as well as many small time series. We propose a variant of Jocor for fast approximation and an extension to a GPU-based parallel method to bring down the running-time to interactive level for analytics applications. We show three independent uses of time series join on correlation which are made possible by our algorithm.
引用
收藏
页码:450 / 459
页数:10
相关论文
共 20 条
[1]  
[Anonymous], 2013, P 26 IEEE CAN C EL C
[2]  
[Anonymous], 2003, Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, SIGMOD'03, DOI DOI 10.1145/872757.872765
[3]  
[Anonymous], 2010, SIGMOD, DOI 10.1145/1807167.1807188
[4]  
[Anonymous], 2009, P 2009 SIAM INT C DA
[5]  
Bugenhagen Scott M, 2010, Physiol Genomics, V42, P23, DOI 10.1152/physiolgenomics.00027.2010
[6]  
Chen YG, 2009, PROC INT CONF DATA, P1048, DOI 10.1109/ICDE.2009.20
[7]   Discovering Longest-lasting Correlation in Sequence Databases [J].
Li, Yuhong ;
Hou, Leong U. ;
Yiu, Man Lung ;
Gong, Zhiguo .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14) :1666-1677
[8]  
Lin Y., 2010, IAENG INT J COMPUTER, V37
[9]  
Lin Y, 2010, LECT NOTES ARTIF INT, V6118, P238
[10]  
Lines Jason, 2012, SIGKDD, P289