LOF: Identifying density-based local outliers

被引:5016
作者
Breunig, MM
Kriegel, HP
Ng, RT
Sander, J
机构
[1] Univ Munich, Inst Comp Sci, D-80538 Munich, Germany
[2] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
关键词
outlier detection; database mining;
D O I
10.1145/335191.335388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For many KDD applications, such as detecting criminal activities in E-commerce, finding the rare instances or the outliers, can. be more interesting than finding the common patterns. Existing work in outlier detection regards being an outlier as a binary property. In this paper, we contend that for many scenarios, it is more meaningful to assign to each object a degree of being an outlier. This degree is called the local outlier factor (LOF) of an object. It is local in that the degree depends on how isolated the object is with respect to the surrounding neighborhood. We give a detailed formal analysis showing that LOF enjoys many desirable properties. Using real-world datasets, we demonstrate that LOF can be used to find outliers which appear to be meaningful, but can otherwise not be identified with existing approaches. Finally, a careful performance evaluation of our algorithm confirms we show that our approach of finding local outliers can be practical.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 23 条
[1]  
[Anonymous], P ACM SIDMOD INT C M
[2]  
[Anonymous], 1999, P ACM SIGMOD INT C M
[3]  
Arning A., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P164
[4]  
Barnett V., 1984, Outliers in Statistical Data, V2nd
[5]  
Berchtold S, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P28
[6]  
DuMouchel W., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P189
[7]  
Ester M, 1996, 2 INT C KNOWL DISCOV, P226, DOI DOI 10.5555/3001460.3001507
[8]   Adaptive fraud detection [J].
Fawcett, T ;
Provost, F .
DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (03) :291-316
[9]  
Fayyad U., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P82
[10]  
Hawkins D.M, 1980, IDENTIFICATION OUTLI, V11, DOI [10.1007/978-94-015-3994-4, DOI 10.1007/978-94-015-3994-4]