Algorithm5: A technique for fuzzy similarity clustering of chemical inventories

被引:31
作者
Doman, TN
Cibulskis, JM
Cibulskis, MJ
McCray, PD
Spangler, DP
机构
[1] G. D. Searle, Skokie, IL 60077
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1996年 / 36卷 / 06期
关键词
D O I
10.1021/ci960361r
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Clustering of chemical inventories on the basis of structural similarity has been shown to be useful in a number of applications related to the utilization and enhancement of those inventories. However, the widely-used Jarvis-Patrick clustering algorithm displays a number of weaknesses which make it difficult to cluster large databases in a consistent, satisfactory, and timely manner. Jarvis-Patrick clusters tend to be either too large and heterogeneous (i.e., ''chained'') or too small and exclusive (i.e., under-clustered), and the algorithm requires time-consuming manual tuning. This paper describes a computer algorithm which is nondirective, in that it performs the clustering without manual tuning yet generates useful clustering results.
引用
收藏
页码:1195 / 1204
页数:10
相关论文
共 9 条
[1]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[2]  
Dubes R., 1980, ADV COMPUT, V19, P113
[3]  
Hansch C., 1995, Exploring QSAR-Fundamentals and Applications in Chemistry and Biology
[4]   CLUSTERING USING A SIMILARITY MEASURE BASED ON SHARED NEAR NEIGHBORS [J].
JARVIS, RA ;
PATRICK, EA .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (11) :1025-1034
[5]   CAS AIMS TO KEEP CUSTOMERS BY IMPROVED SERVICE, PRICING, MANAGEMENT [J].
ROSS, LR .
CHEMICAL & ENGINEERING NEWS, 1994, 72 (44) :19-23
[6]  
ROSS LR, 1994, CHEM ENG NEWS, V72, P44
[7]  
WEININGER D, 1995, DAYLIGHT USERS MANUA
[8]  
Willett P., 1987, Similarity and clustering in chemical information systems
[9]   FUZZY SETS [J].
ZADEH, LA .
INFORMATION AND CONTROL, 1965, 8 (03) :338-&