Temporal Fusion Transformers for interpretable multi-horizon time series forecasting

被引:1044
作者
Lim, Bryan [1 ]
Arik, Sercan O. [2 ]
Loeff, Nicolas [2 ]
Pfister, Tomas [2 ]
机构
[1] Univ Oxford, Oxford, England
[2] Google Cloud AI, Sunnyvale, CA USA
关键词
Deep learning; Interpretability; Time series; Multi-horizon forecasting; Attention mechanisms; Explainable AI;
D O I
10.1016/j.ijforecast.2021.03.012
中图分类号
F [经济];
学科分类号
02 ;
摘要
Multi-horizon forecasting often contains a complex mix of inputs - including static (i.e. time-invariant) covariates, known future inputs, and other exogenous time series that are only observed in the past - without any prior information on how they interact with the target. Several deep learning methods have been proposed, but they are typically `black-box' models that do not shed light on how they use the full range of inputs present in practical scenarios. In this paper, we introduce the Temporal Fusion Transformer (TFT) - a novel attention-based architecture that combines high-performance multi-horizon forecasting with interpretable insights into temporal dynamics. To learn temporal relationships at different scales, TFT uses recurrent layers for local processing and interpretable self-attention layers for long-term dependencies. TFT utilizes specialized components to select relevant features and a series of gating layers to suppress unnecessary components, enabling high performance in a wide range of scenarios. On a variety of real-world datasets, we demonstrate significant performance improvements over existing benchmarks, and highlight three practical interpretability use cases of TFT. (C) 2021 The Authors. Published by Elsevier B.V. on behalf of International Institute of Forecasters.
引用
收藏
页码:1748 / 1764
页数:17
相关论文
共 43 条
[1]  
Alaa A., 2019, Advances in Neural Information Processing Systems
[2]   Regime Changes and Financial Markets [J].
Ang, Andrew ;
Timmermann, Allan .
ANNUAL REVIEW OF FINANCIAL ECONOMICS, VOL 4, 2012, 4 :313-337
[3]  
[Anonymous], 2014, INT J EC SCI APPL RE
[4]  
Arik SO, 2019, ARXIV190807442
[5]  
Ba J. L., 2016, LAYER NORMALIZATION
[6]  
Baker S. R., 2020, UNPRECEDENTED STOCK, V26945
[7]  
Baltagi B.H., 2008, ECONOMETRICS, P129, DOI [10.1007/978-3-540-76516-5_6, DOI 10.1007/978, 10.1007/978]
[8]   Multiple-output modeling for multi-step-ahead time series forecasting [J].
Ben Taieb, Souhaib ;
Sorjamaa, Antti ;
Bontempi, Gianluca .
NEUROCOMPUTING, 2010, 73 (10-12) :1950-1957
[9]  
Böse JH, 2017, PROC VLDB ENDOW, V10, P1694
[10]   Multi-horizon inflation forecasts using disaggregated data [J].
Capistran, Carlos ;
Constandse, Christian ;
Ramos-Francia, Manuel .
ECONOMIC MODELLING, 2010, 27 (03) :666-677