Evaluating collaborative filtering recommender systems

被引:3552
作者
Herlocker, JL
Konstan, JA
Terveen, K
Riedl, JT
机构
[1] Oregon State Univ, Sch Elect Engn & Comp Sci, Corvallis, OR 97331 USA
[2] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
关键词
experimentation; measurement; performance; collaborative filtering; recommender systems; metrics; evaluation;
D O I
10.1145/963770.963772
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recommender systems have been evaluated in many, often incomparable, ways. In this article, we review the key decisions in evaluating collaborative filtering recommender systems: the user tasks being evaluated, the types of analysis and datasets being used, the ways in which prediction quality is measured, the evaluation of prediction attributes other than quality, and the user-based evaluation of the system as a whole. In addition to reviewing the evaluation strategies used by prior researchers, we present empirical results from the analysis of various accuracy metrics on one content domain where all the tested metrics collapsed roughly into three equivalence classes. Metrics within each equivalency class were strongly correlated, while metrics from different equivalency classes were uncorrelated.
引用
收藏
页码:5 / 53
页数:49
相关论文
共 64 条
[1]  
AGGARWAL C, 1999, P ACM SIGKDD INT C K
[2]  
Amento B., 2003, ACM Transactions on Computer-Human Interaction, V10, P54, DOI 10.1145/606658.606661
[3]  
AMENTO B, 1999, P ACM SIGCHI 99 C HU, P552
[4]  
[Anonymous], 1994, P 17 ANN INT ACM SIG
[5]  
[Anonymous], 2001, ACM SIGIR WORKSH REC
[6]  
BAEZAYATES RA, 1999, MODERN INFORMATION R
[7]  
BAILEY BP, 2001, P 7 C HUM FACT WEB J
[8]   Fab: Content-based, collaborative recommendation [J].
Balabanovic, M ;
Shoham, Y .
COMMUNICATIONS OF THE ACM, 1997, 40 (03) :66-72
[9]  
Basu C, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P714
[10]  
BILLSUS D, 1998, P 15 INT C MACH LEAR, P46