Affect analysis of web forums and blogs using correlation ensembles

被引:79
作者
Abbasi, Ahmed [1 ]
Chen, Hsinchun [1 ]
Thoms, Sven [1 ]
Fu, Tianjun [1 ]
机构
[1] Univ Arizona, Dept Management Informat Syst, Artificial Intelligence Lab, Tucson, AZ 85721 USA
关键词
affective computing; discourse; emotion recognition; linguistic processing; machine learning; text mining;
D O I
10.1109/TKDE.2008.51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analysis of affective intensities in computer-mediated communication is important in order to allow a better understanding of online users' emotions and preferences. Despite considerable research on textual affect classification, it is unclear which features and techniques are most effective. In this study, we compared several feature representations for affect analysis, including learned n-grams and various automatically and manually crafted affect lexicons. We also proposed the support vector regression correlation ensemble (SVRCE) method for enhanced classification of affect intensities. SVRCE uses an ensemble of classifiers each trained using a feature subset tailored toward classifying a single affect class. The ensemble is combined with affect correlation information to enable better prediction of emotive intensities. Experiments were conducted on four test beds encompassing web forums, blogs, and online stories. The results revealed that learned n-grams were more effective than lexicon-based affect representations. The findings also indicated that SVRCE outperformed comparison techniques, including Pace regression, semantic orientation, and WordNet models. Ablation testing showed that the improved performance of SVRCE was attributable to its use of feature ensembles as well as affect correlation information. A brief case study was conducted to illustrate the utility of the features and techniques for affect analysis of large archives of online discourse.
引用
收藏
页码:1168 / 1180
页数:13
相关论文
共 38 条
[1]   Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums [J].
Abbasi, Ahmed ;
Chen, Hsinchun ;
Salem, Arab .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2008, 26 (03)
[2]  
[Anonymous], 2005, P ACM SIGIR 2005 WOR
[3]  
[Anonymous], P 8 INT C INT US INT
[4]  
[Anonymous], P AAAI SPRING S COMP
[5]  
[Anonymous], 1997, Proceedings of the fourteenth international conference on machine learning, DOI DOI 10.1016/J.ESWA.2008.05.026
[6]  
[Anonymous], 2004, THESIS
[7]  
[Anonymous], 1998, WORDNET ELECT LEXICA
[8]   Stylistic text classification using functional lexical features [J].
Argamon, Shlomo ;
Whitelaw, Casey ;
Chase, Paul ;
Hota, Sobhan Raj ;
Garg, Navendu ;
Levitan, Shlomo .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (06) :802-822
[9]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[10]  
Cherkauer K. J., 1996, AAAI WORKSH INT MULT, P15