Clustering gene expression profile data by selective shrinkage

被引：7

作者：

Ishwaran, Hemant ^{[1
]}

Rao, J. Sunil

机构：

[1] Cleveland Clin, Cleveland, OH 44106 USA

[2] Case Western Reserve Univ, Cleveland, OH 44106 USA

来源：

STATISTICS & PROBABILITY LETTERS | 2008年 / 78卷 / 12期

关键词：

D O I：

10.1016/j.spl.2008.01.003

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Clustering of gene expression profiles is a widely used approach for finding macroscopic data structure. A complication in such analyses is that not all genes are informative for forming clusters and different clusters might have different transcription regulation. Driven by these considerations, we present a novel two-stage clustering approach. The first stage identifies informative genes by adaptive variable selection using pseudo-samples modeled by a high dimensional multigroup ANOVA model. Variables are selected using a rescaled spike and slab Bayesian hierarchical model having a special selective shrinkage property. The second stage Uses Output from the first stage for clustering. We demonstrate why selective shrinkage occurs, and by extension, why it is useful for the clustering paradigm. We analyze a human gene atlas expression dataset where the question of interest is to look for tissue-specific transcription regulation and investigate whether tissues can be grouped together due to similar genomic control. (C) 2008 Elsevier B.V. All rights reserved.

引用

页码：1490 / 1497

页数：8

共 16 条

[1] Incorporation of biological knowledge into distance for clustering genes [J].