WisColl: Collective wisdom based blog clustering

被引:25
作者
Agarwal, Nitin [1 ]
Galan, Magdiel [2 ]
Liu, Huan [2 ]
Subramanya, Shankar [2 ]
机构
[1] Univ Arkansas, Dept Informat Sci, Little Rock, AR 72204 USA
[2] Arizona State Univ, Tempe, AZ 85287 USA
关键词
Blog; Cluster; Collective wisdom; Blogosphere; Social networks; Web; 2.0; Wisdom of crowds;
D O I
10.1016/j.ins.2009.07.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Blogosphere is expanding in an unprecedented speed. A better understanding of the blogosphere can greatly facilitate the development of the Social Web to serve the needs of users, service providers, and advertisers. One important task in this process is clustering blog sites. Although a good number of traditional clustering methods exists, they are not designed to take into account the blogosphere unique characteristics. Clustering blog sites presents new challenges. A prominent feature of the Social Web is that many enthusiastic bloggers voluntarily write, tag, and catalog their posts in order to reach the widest possible audience who will share their thoughts and appreciate their ideas. In the process a new kind of collective wisdom is generated. We propose WisColl by tapping into this collective wisdom when clustering blog sites. In this paper. we study how clustering with collective wisdom can be achieved and compare it with a representative traditional clustering method. We present statistical and visual results, report findings and suggest future work extending to many real-world applications. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:39 / 61
页数:23
相关论文
共 31 条
[1]  
AGARWAL N, 2008, TR08004 AR STAT U
[2]  
AGARWAL N, 2009, P 3 INT ASS ADV ART
[3]  
Anderson C., 2006, LONG TAIL WHY FUTURE
[4]  
[Anonymous], P AAAI 06
[5]  
[Anonymous], NSF S NEXT GEN DAT M
[6]  
[Anonymous], 2004, P 17 INT C NEUR INF
[7]  
Bansal N., 2007, VLDB, P806
[8]  
Brooks ChristopherH., 2006, WWW '06, P625, DOI DOI 10.1145/1135777.1135869
[9]  
Chin Alvin., 2006, Proceedings of the seventeenth conference on Hypertext and hypermedia, P11
[10]  
CUTTING DR, 1992, SIGIR 92 : PROCEEDINGS OF THE FIFTEENTH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P318