Uncovering the message from the mess of big data

被引:33
作者
Bendle, Neil T. [1 ]
Wang, Xin [1 ]
机构
[1] Univ Western Ontario, Ivey Business Sch, 1255 Western Rd, London, ON N6G 0N1, Canada
关键词
Big data; User-Generated content; Latent Dirichlet Allocation; Topic modeling; Market research; Qualitative data;
D O I
10.1016/j.bushor.2015.10.001
中图分类号
F [经济];
学科分类号
02 ;
摘要
User-generated content, such as online product reviews, is a valuable source of consumer insight. Such unstructured big data is generated in real-time, is easily accessed, and contains messages consumers want managers to hear. Analyzing such data has potential to revolutionize market research and competitive analysis, but how can the messages be extracted? How can the vast amount of data be condensed into insights to help steer businesses' strategy? We describe a nonproprietary technique that can be applied by anyone with statistical training. Latent Dirichlet Allocation (LDA) can analyze huge amounts of text and describe the content as focusing on unseen attributes in a specific weighting. For example, a review of a graphic novel might be analyzed to focus 70% on the storyline and 30% on the graphics. Aggregating the content from numerous consumers allows us to understand what is, collectively, on consumers' minds, and from this we can infer what consumers care about. We can even highlight which attributes are seen positively or negatively. The value of this technique extends well beyond the CMO's office as LDA can map the relative strategic positions of competitors where they matter most: in the minds of consumers. (C) 2015 Kelley School of Business, Indiana University. Published by Elsevier Inc. All rights reserved.
引用
收藏
页码:115 / 124
页数:10
相关论文
共 11 条
[1]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[2]  
Blei David M., 2009, Text Mining: Classification, Clustering, and Applications, P71
[3]   Finding scientific topics [J].
Griffiths, TL ;
Steyvers, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 :5228-5235
[4]   Unsupervised learning by probabilistic latent semantic analysis [J].
Hofmann, T .
MACHINE LEARNING, 2001, 42 (1-2) :177-196
[5]  
Lavalle S, 2011, MIT SLOAN MANAGE REV, V52, P21
[6]  
Reichheld F.F., 2011, The Ultimate Question 2.0: How Net Promoter Companies Thrive in a Customer-driven World
[7]   Big data and talent management: Using hard data to make the soft stuff easy [J].
Russell, Chuck ;
Bennett, Nathan .
BUSINESS HORIZONS, 2015, 58 (03) :237-242
[8]  
Simonson Itamar., 2014, ABSOLUTE VALUE WHAT
[9]  
Thelen S., 2004, Business Horizons, V47, P25, DOI 10.1016/j.bushor.2004.09.005
[10]   Mining Marketing Meaning from Online Chatter: Strategic Brand Analysis of Big Data Using Latent Dirichlet Allocation [J].
Tirunillai, Seshadri ;
Tellis, Gerard J. .
JOURNAL OF MARKETING RESEARCH, 2014, 51 (04) :463-479