Competitor mining with the web

被引:52
作者
Bao, Shenghua [1 ]
Li, Rui [1 ]
Yu, Yong [1 ]
Cao, Yunbo [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Microsoft Res Asia, Beijing 100190, Peoples R China
关键词
information search and retrieval; content analysis and indexing; performance evaluation;
D O I
10.1109/TKDE.2008.98
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with the problem of mining competitors from the Web automatically. Nowadays, the fierce competition in the market necessitates every company to know not only which companies are its primary competitors but also in which domains the company's rivals compete with itself and what its competitors' strength is in a specific competitive domain. The task of competitor mining that we address in the paper includes mining all the information such as competitors, competing domains, and competitors' strength. A novel algorithm called CoMiner is proposed, which tries to conduct a Web-scale mining in a domain-independent manner. The CoMiner algorithm consists of three parts: 1) given an input entity, extracting a set of comparative candidates and then ranking them according to comparability, 2) extracting the domains in which the given entity and its competitors play against each other, and 3) identifying and summarizing the competitive evidence that details the competitors' strength. As for evaluation, a prototype system implementing the CoMiner algorithm is presented. An evaluation data set consisting of 70 entities is constructed. A total of 728 competitors and 3,640 competitive domains with 6,381 competitive evidences are discovered with the prototype. The experimental results show that the proposed algorithm is highly effective.
引用
收藏
页码:1297 / 1310
页数:14
相关论文
共 28 条
[1]  
[Anonymous], P KDD
[2]  
[Anonymous], 2005, Proceedings of the 14th International Conference on World Wide Web, DOI [DOI 10.1145/1060745.1060797, 10.1145/1060745.1060797]
[3]  
ASHISH N, 1996, SIGMOD REC, P8
[4]  
Baeza-Yates R.A., 1999, Modern Information Retrieval
[5]   Generation of US nano-necklaces and NiS nanotubes templated by sugar-appended hydrogel [J].
Bao, CY ;
Lu, R ;
Xue, PC ;
Jin, M ;
Tan, CH ;
Liu, GF ;
Zhao, YY .
JOURNAL OF NANOSCIENCE AND NANOTECHNOLOGY, 2006, 6 (03) :807-812
[6]  
Chien LF, 1997, PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P50, DOI 10.1145/278459.258534
[7]  
CIMIANO P, 2004, P 13 INT C WORLD WID, P462, DOI DOI 10.1145/988672.988735
[8]  
Etzioni O, 2004, P 13 INT C WORLD WID, P100, DOI DOI 10.1145/988672.988687
[9]  
FINDAL N, 2006, P 21 NAT C ART INT A
[10]  
FREITAG D, 1999, P AAAI WORKSH MACH L