Semantic search for public opinions on urban affairs: A probabilistic topic modeling-based approach

被引:42
作者
Ma, Baojun [1 ]
Zhang, Nan [2 ]
Liu, Guannan [3 ]
Li, Liangqiang [4 ]
Yuan, Hua [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Econ & Management, Beijing 100876, Peoples R China
[2] Tsinghua Univ, Sch Publ Policy & Management, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Sch Econ & Management, Beijing 100084, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Management & Econ, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Probabilistic topic modeling; Public opinions; Big data analysis; Semantic search; Latent Dirichlet allocation (LDA); FRAUD DETECTION; ONTOLOGY; WEB; RETRIEVAL; GOVERNMENT; IDENTIFICATION; FRAMEWORK; RANKING;
D O I
10.1016/j.ipm.2015.10.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The explosion of online user-generated content (UGC) and the development of big data analysis provide a new opportunity and challenge to understand and respond to public opinions in the G2C e-government context. To better understand semantic searching of public comments on an online platform for citizens' opinions about urban affairs issues, this paper proposed an approach based on the latent Dirichlet allocation (LDA), a probabilistic topic modeling method, and designed a practical system to provide users municipal administrators of B-city with satisfying searching results and the longitudinal changing curves of related topics. The system is developed to respond to actual demand from B-city's local government, and the user evaluation experiment results show that a system based on the LDA method could provide information that is more helpful to relevant staff members. Municipal administrators could better understand citizens' online comments based on the proposed semantic search approach and could improve their decision-making process by considering public opinions. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:430 / 445
页数:16
相关论文
共 85 条
[21]  
Ding L., 2007, ONTOLOGIES, P79, DOI [DOI 10.1007/978-0-387-37022-4_4, DOI 10.1007/978-0-387-37022-4]
[22]  
Dong H, 2008, 2008 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-5, P1404
[23]  
Du Weifu., 2010, P ACM INT C WEB SEAR, P111
[24]   Semantically enhanced Information Retrieval: An ontology-based approach [J].
Fernandez, Miriam ;
Cantador, Ivan ;
Lopez, Vanesa ;
Vallet, David ;
Castells, Pablo ;
Motta, Enrico .
JOURNAL OF WEB SEMANTICS, 2011, 9 (04) :434-452
[25]   Implementing and evaluating phrasal query suggestions for proximity search [J].
Feuer, Alan ;
Savev, Stefan ;
Aslam, Javed A. .
INFORMATION SYSTEMS, 2009, 34 (08) :711-723
[26]   The hierarchical hidden Markov model: Analysis and applications [J].
Fine, S ;
Singer, Y ;
Tishby, N .
MACHINE LEARNING, 1998, 32 (01) :41-62
[27]  
GAO R, 2013, P 7 ACM C REC SYST, P355, DOI DOI 10.1145/2507157.2507219
[28]   Finding scientific topics [J].
Griffiths, TL ;
Steyvers, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 :5228-5235
[29]  
Hastie T., 2009, ELEMENTS STAT LEARNI, V2, DOI [10.1007/978-0-387-84858-7, DOI 10.1007/978-0-387-84858-7]
[30]   Probabilistic latent semantic indexing [J].
Hofmann, T .
SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, :50-57