Web Query Reformulation via Joint Modeling of Latent Topic Dependency and Term Context

被引:19
作者
Bing, Lidong [1 ]
Lam, Wai [1 ]
Wong, Tak-Lam [2 ]
Jameel, Shoaib [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Hong Kong, Peoples R China
[2] Hong Kong Inst Educ, Dept Math & Informat Technol, Hong Kong, Hong Kong, Peoples R China
关键词
Algorithms; Experimentation; Web query reformulation; graphical model; social tagging; query log; PROBABILISTIC FUNCTIONS; RETRIEVAL; FEEDBACK;
D O I
10.1145/2699666
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An important way to improve users' satisfaction in Web search is to assist them by issuing more effective queries. One such approach is query reformulation, which generates new queries according to the current query issued by users. A common procedure for conducting reformulation is to generate some candidate queries first, then a scoring method is employed to assess these candidates. Currently, most of the existing methods are context based. They rely heavily on the context relation of terms in the history queries and cannot detect and maintain the semantic consistency of queries. In this article, we propose a graphical model to score queries. The proposed model exploits a latent topic space, which is automatically derived from the query log, to detect semantic dependency of terms in a query and dependency among topics. Meanwhile, the graphical model also captures the term context in the history query by skip-bigram and n-gram language models. In addition, our model can be easily extended to consider users' history search interests when we conduct query reformulation for different users. In the task of candidate query generation, we investigate a social tagging data resource-Delicious bookmark-to generate addition and substitution patterns that are employed as supplements to the patterns generated from query log data.
引用
收藏
页数:38
相关论文
共 92 条
[1]  
Agichtein E., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P19, DOI 10.1145/1148170.1148177
[2]  
Alfonseca E., 2009, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3, EMNLP '09, V3, P1046
[3]  
Alfonseca Enrique, 2009, P HUM LANG TECHN 200, P29
[4]  
Anagnostopoulos Aris., 2010, Proceedings of the Third International Conference on Web Search and Web Data Mining, WSDM 2010, New York, NY, USA, February 4-6, 2010, P161
[5]  
[Anonymous], 2008, P 17 ACM C INF KNOWL
[6]  
[Anonymous], 2009, P 2009 WORKSH WEB SE, DOI DOI 10.1145/1507509.1507518
[7]  
[Anonymous], 2002, P ACM SIGKDD KDD 200, DOI 10.1145/775047.775067
[8]  
[Anonymous], 2010, Proceedings of the 19th international conference on World wide web, WWW '10
[9]  
[Anonymous], 2002, WWW
[10]  
[Anonymous], 2011, P 20 INT C WORLD WID, DOI DOI 10.1145/1963405.1963424