New Information Distance Measure and Its Application in Question Answering System

被引:3
作者
张显 [1 ]
郝宇 [1 ]
朱小燕 [1 ]
李明 [2 ]
机构
[1] Department of Computer Science and Technology,Tsinghua University
[2] David RCheriton School of Computer Science,University of Waterloo
关键词
information distance; normalized information distance; question answering system;
D O I
暂无
中图分类号
TP391.1 [文字信息处理];
学科分类号
081203 ; 0835 ;
摘要
<正>In a question answering (QA) system,the fundamental problem is how to measure tile distance between a question and an answer,hence ranking different answers.We demonstrate that such a distance can he precisely and mathematically defined.Not only such a definition is possible,it is actually provably better than any other feasible definitions. Not only such an ultimate definition is possible,but also it can be conveniently and fruitfully applied to construct a QA system.We have built such a system——QUANTA.Extensive experiments are conducted to justify the new theory.
引用
收藏
页码:557 / 572
页数:16
相关论文
共 8 条
[1]   The context-tree kernel for strings [J].
Cuturi, M ;
Vert, JP .
NEURAL NETWORKS, 2005, 18 (08) :1111-1123
[2]  
Philipp Cimiano,Steffen Staab.Learning by googling[J].ACM SIGKDD Explorations Newsletter,2004
[3]   Algorithmic clustering of music based on string compression [J].
Cilibrasi, R ;
Vitányi, P ;
de Wolf, R .
COMPUTER MUSIC JOURNAL, 2004, 28 (04) :49-67
[4]  
Andrej A. Muchnik.Conditional complexity and codes[J].Theoretical Computer Science,2002(1)
[5]  
Nikolai K. Vereshchagin,Michael V. Vyugin.Independent minimum length programs to translate between given strings[J].Theoretical Computer Science,2002(1)
[6]  
Alexei Chernov,Andrej Muchnik,Andrei Romashchenko,Alexander Shen,Nikolai Vereshchagin.Upper semi-lattice of binary strings with the relation “ x is simple conditional to y ”[J].Theoretical Computer Science,2002(1)
[7]  
Ming Li,Jonathan H. Badger,Xin Chen,Sam Kwong,Paul Kearney.An information-based sequence distance and its application to whole mitochondrial genome phylogeny[J].Bioinformatics,2001
[8]   Relaxing the Triangle Inequality in Pattern Matching [J].
Ronald Fagin ;
Larry Stockmeyer .
International Journal of Computer Vision, 1998, 30 :219-231