CART classification of human 5′ UTR sequences

被引:143
作者
Davuluri, RV
Suzuki, Y
Sugano, S
Zhang, MQ
机构
[1] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[2] Univ Tokyo, Inst Med Sci, Dept Virol, Tokyo 1088639, Japan
关键词
D O I
10.1101/gr.GR-1460R
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A nonredundant database of 2312 full-length human 5'-untranslated regions (UTRs) was carefully prepared using state-of-the-art experimental and computational technologies. A comprehensive computational analysis of this data was conducted for characterizing the 5' UTR Features. Classification and regression tree (CART) analysis was used to classify the data into three distinct classes. Class I consists of mRNAs that are believed to be poorly translated with long 5' UTRs filled with potential inhibitory features. Class II consists of terminal oligopyrimidine tract (TOP) mRNAs that are regulated in a growth-dependent manner, and class III consists of mRNAs with Favorable 5' UTR features that may help efficient translation. The most accurate tree we found has 92.5% classification accuracy as estimated by cross validation. The classification model included the presence of TOP, a secondary structure, 5' UTR length, and the presence of upstream AUGs (uAUGs) as the most relevant variables. The present classification and characterization of the 5' UTRs provide precious information for better understanding the translational regulation of human mRNAs. Furthermore, this database and classification can help people build better computational models for predicting the 5'-terminal exon and separating the 5' UTR from the coding region.
引用
收藏
页码:1807 / 1816
页数:10
相关论文
共 34 条
[1]   ENHANCED TRANSLATIONAL EFFICIENCY OF A NOVEL TRANSFORMING GROWTH FACTOR-BETA-3 MESSENGER-RNA IN HUMAN BREAST-CANCER CELLS [J].
ARRICK, BA ;
GRENDELL, RL ;
GRIFFIN, LA .
MOLECULAR AND CELLULAR BIOLOGY, 1994, 14 (01) :619-628
[2]   The 5' terminal oligopyrimidine-tract confers translational control on TOP mRNAs in a cell-type and sequence context-dependent manner [J].
Avni, D ;
Biberman, Y ;
Meyuhas, O .
NUCLEIC ACIDS RESEARCH, 1997, 25 (05) :995-1001
[3]   POSITION-+5 AND POSITION-+6 CAN BE MAJOR DETERMINANTS OF THE EFFICIENCY OF NON-AUG INITIATION CODONS FOR PROTEIN-SYNTHESIS [J].
BOECK, R ;
KOLAKOFSKY, D .
EMBO JOURNAL, 1994, 13 (15) :3608-3617
[4]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[5]   The increased level of β1,4-galactosyltransferase required for lactose biosynthesis is achieved in part by translational control [J].
Charron, M ;
Shaper, JH ;
Shaper, NL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14805-14810
[6]   Translational control: the cancer connection [J].
Clemens, MJ ;
Bommer, UA .
INTERNATIONAL JOURNAL OF BIOCHEMISTRY & CELL BIOLOGY, 1999, 31 (01) :1-23
[7]   Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, Arabidopsis [J].
Duret, L ;
Mouchiroud, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (08) :4482-4487
[8]   Translational control:: a general mechanism for gene regulation during T cell activation [J].
Garcia-Sanz, JA ;
Mikulits, W ;
Livingstone, A ;
Lefkovits, I ;
Müllner, EW .
FASEB JOURNAL, 1998, 12 (03) :299-306
[9]   Control of translation initiation in animals [J].
Gray, NK ;
Wickens, M .
ANNUAL REVIEW OF CELL AND DEVELOPMENTAL BIOLOGY, 1998, 14 :399-458
[10]   What drives codon choices in human genes? [J].
Karlin, S ;
Mrazek, J .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 262 (04) :459-472