Dynamic models of gene expression and classification

被引:28
作者
Dewey T.G. [1 ]
Galas D.J. [1 ]
机构
[1] Keck Grad. Inst. Applied Life Sci., Claremont, CA 91711
关键词
Gene control; Gene expression profiles; Microarray data; Time-series analysis;
D O I
10.1007/s101420000035
中图分类号
学科分类号
摘要
Powerful new methods, like expression profiles using cDNA arrays, have been used to monitor changes in gene expression levels as a result of a variety of metabolic, xenobiotic or pathogenic challenges. This potentially vast quantity of data enables, in principle, the dissection of the complex genetic networks that control the patterns and rhythms of gene expression in the cell. Here we present a general approach to developing dynamic models for analyzing time series of whole genome expression. In this approach, a self-consistent calculation is performed that involves both linear and non-linear response terms for interrelating gene expression levels. This calculation uses singular value decomposition (SVD) not as a statistical tool but as a means of inverting noisy and near-singular matrices. The linear transition matrix that is determined from this calculation can be used to calculate the underlying network reflected in the data. This suggests a direct method of classifying genes according to their place in the resulting network. In addition to providing a means to model such a large multi-variate system this approach can be used to reduce the dimensionality of the problem in a rational and consistent way, and suppress the strong noise amplification effects often encountered with expression profile data. Non-linear and higher-order Markov behavior of the network are also determined in this self-consistent method. In data sets from yeast, we calculate the Markov matrix and the gene classes based on the linear-Markov network. These results compare favorably with previously used methods like cluster analysis. Our dynamic method appears to give a broad and general framework for data analysis and modeling of gene expression arrays.
引用
收藏
页码:269 / 278
页数:9
相关论文
共 21 条
[1]  
Alter O., Brown P.O., Botstein D., Singular value decomposition for genome-wide expression data processing and modeling, Proc Natl Acad Sci USA, 18, pp. 10101-10106, (2000)
[2]  
Arkin A., Ross J., Statistical construction of chemical reaction mechanism from measured time-series, J Phys Chem, 99, pp. 970-980, (1995)
[3]  
Berne B.J., Statistical mechanics, part B: Time-dependent processes, (1977)
[4]  
Brown M.P.S., Grundy W.N., Lin D., Cristianini N., Sugnet C., Furey T.S., Ares M. Jr., Haussler D., Knowledge-based analysis of microarray gene expression data using support vector machines, Proc Natl Acad Sci USA, 97, pp. 262-267, (2000)
[5]  
Chen T., He H.L., Church G.M., Modeling gene expression with differential equations, Pac Symp Biocomput, 4, pp. 29-40, (1999)
[6]  
Chu S., DeRisi J., Eisen M., Mulholland J., Botstein D., Brown P.O., Herskowitz I., The transcriptional program of sporulation in budding yeast, Science, 282, pp. 699-705, (1998)
[7]  
Craig J.C., Eberwine J.H., Calvin J.A., Wlodarczyk B., Bennett G.D., Finnell R.H., Developmental expression of morphoregulatory genes in the mouse embryo: An analytical approach using novel technology, Biochem Mol Med, 60, pp. 81-91, (1997)
[8]  
Davidson E.H., A view from the genome: Spatial control of transcription in sea urchin development, Curr Opin Genet Dev, 9, pp. 530-541, (1999)
[9]  
DeRisi J.L., Iyer V.R., Brown P.O., Exploring the metabolic and genetic control of gene expression on a genomic scale, Science, 278, pp. 680-686, (1997)
[10]  
D'haeseleer P., Wen X., Fuhrman S., Somogyi R., Linear modeling of mRNA expression levels during CNS development and injury, Pac Symp Biocomput, 3, pp. 41-52, (1999)