Prediction of Human Activity by Discovering Temporal Sequence Patterns

被引:149
作者
Li, Kang [1 ]
Fu, Yun [1 ,2 ]
机构
[1] Northeastern Univ, Dept Elect & Comp Engn, Coll Engn, Boston, MA 02115 USA
[2] Northeastern Univ, Coll Comp & Informat Sci, Dana Res Ctr 403, Boston, MA 02115 USA
基金
美国国家科学基金会;
关键词
Activity prediction; causality; context-cue; predictability; ACTION RECOGNITION;
D O I
10.1109/TPAMI.2013.2297321
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Early prediction of ongoing human activity has become more valuable in a large variety of time-critical applications. To build an effective representation for prediction, human activities can be characterized by a complex temporal composition of constituent simple actions and interacting objects. Different from early detection on short-duration simple actions, we propose a novel framework for long-duration complex activity prediction by discovering three key aspects of activity: Causality, Context-cue, and Predictability. The major contributions of our work include: (1) a general framework is proposed to systematically address the problem of complex activity prediction by mining temporal sequence patterns; (2) probabilistic suffix tree (PST) is introduced to model causal relationships between constituent actions, where both large and small order Markov dependencies between action units are captured; (3) the context-cue, especially interactive objects information, is modeled through sequential pattern mining (SPM), where a series of action and object co-occurrence are encoded as a complex symbolic sequence; (4) we also present a predictive accumulative function (PAF) to depict the predictability of each kind of activity. The effectiveness of our approach is evaluated on two experimental scenarios with two data sets for each: action-only prediction and context-aware prediction. Our method achieves superior performance for predicting global activity classes and local action units.
引用
收藏
页码:1644 / 1657
页数:14
相关论文
共 57 条
[1]  
Abbeel P., 2004, P 21 INT C MACH LEAR
[2]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[3]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[4]   Action understanding as inverse planning [J].
Baker, Chris L. ;
Saxe, Rebecca ;
Tenenbaum, Joshua B. .
COGNITION, 2009, 113 (03) :329-349
[5]   On prediction using variable order Markov models [J].
Begleiter, R ;
El-Yaniv, R ;
Yona, G .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :385-421
[6]  
Brendel W, 2011, IEEE I CONF COMP VIS, P778, DOI 10.1109/ICCV.2011.6126316
[7]  
Brendel W, 2011, PROC CVPR IEEE, P1273, DOI 10.1109/CVPR.2011.5995395
[8]  
Brown P. F., 1992, Computational Linguistics, V18, P467
[9]  
Collins R. T., 2005, P IEEE INT WORKSH PE
[10]   Minimal-latency human action recognition using reliable-inference [J].
Davis, James W. ;
Tyagi, Ambrish .
IMAGE AND VISION COMPUTING, 2006, 24 (05) :455-472