A probabilistic relational algebra for the integration of information retrieval and database systems

被引:177
作者
Fuhr, N
Rolleke, T
机构
[1] Universitaet Dortmund, 44221 Dortmund
关键词
hypertext retrieval; imprecise data; logical retrieval model; probabilistic retrieval; relational data model; uncertain data; vague predicates;
D O I
10.1145/239041.239045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a probabilistic relational algebra (PRA) which is a generalization of standard relational algebra. In PRA, tuples are assigned probabilistic weights giving the probability that a tuple belongs to a relation. Based on intensional semantics, the tuple weights of the result of a PRA expression always conform to the underlying probabilistic model. We also show for which expressions extensional semantics yields the same results. Furthermore, we discuss complexity issues and indicate possibilities for optimization. With regard to databases, the approach allows for representing imprecise attribute values, whereas for information retrieval, probabilistic document indexing and probabilistic search term weighting can be modeled. We introduce the concept of vague predicates which yield probabilistic weights instead of Boolean values, thus allowing for queries with vague selection conditions. With these features, PRA implements uncertainty and vagueness in combination with the relational model.
引用
收藏
页码:32 / 66
页数:35
相关论文
共 49 条
[1]  
[Anonymous], PROBABILITY MEASURE
[2]   THE MANAGEMENT OF PROBABILISTIC DATA [J].
BARBARA, D ;
GARCIAMOLINA, H ;
PORTER, D .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1992, 4 (05) :487-502
[3]   AN EXTENDED RELATIONAL DOCUMENT-RETRIEVAL MODEL [J].
BLAIR, DC .
INFORMATION PROCESSING & MANAGEMENT, 1988, 24 (03) :349-371
[4]  
Buckley Chris, 1985, 85686 CORN U DEP COM
[5]  
CAVALLO R, 1987, 13TH P INT C VER LAR, P71
[6]  
CODD EF, 1986, ACM SIGMOD RECORD, V15, P53
[7]  
CROFT WB, 1991, P 14 ANN INT ACM SIG, P32, DOI DOI 10.1145/122860.122864
[8]   NON-1ST NORMAL-FORM UNIVERSAL RELATIONS - AN APPLICATION TO INFORMATION-RETRIEVAL SYSTEMS [J].
DESAI, BC ;
GOYAL, P ;
SADRI, F .
INFORMATION SYSTEMS, 1987, 12 (01) :49-55
[9]   QUERY BY IMAGE AND VIDEO CONTENT - THE QBIC SYSTEM [J].
FLICKNER, M ;
SAWHNEY, H ;
NIBLACK, W ;
ASHLEY, J ;
HUANG, Q ;
DOM, B ;
GORKANI, M ;
HAFNER, J ;
LEE, D ;
PETKOVIC, D ;
STEELE, D ;
YANKER, P .
COMPUTER, 1995, 28 (09) :23-32
[10]   A PROBABILISTIC LEARNING APPROACH FOR DOCUMENT INDEXING [J].
FUHR, N ;
BUCKLEY, C .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1991, 9 (03) :223-248