NUMERICAL CLASSIFICATION OF CODING SEQUENCES

被引:4
作者
COLLINS, DW [1 ]
LIU, CC [1 ]
JUKES, TH [1 ]
机构
[1] UNIV CALIF BERKELEY,DEPT INTEGRAT BIOL,BERKELEY,CA 94720
关键词
D O I
10.1093/nar/20.6.1405
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DNA sequences coding for protein may be represented by counts of nucleotides or codons. A complete reading frame may be abbreviated by its base count, e.g. A76C158G121T74, or with the corresponding codon table, e.g. (AAA)0(AAC)1(AAG)9...(TTT)0. We propose that these numerical designations be used to augment current methods of sequence annotation. Because base counts and codon tables do not require revision as knowledge of function evolves, they are well-suited to act as cross-references, for example to identify redundant GenBank entries. These descriptors may be compared, in place of DNA sequences, to extract homologous genes from large databases. This approach permits rapid searching with good selectivity.
引用
收藏
页码:1405 / 1410
页数:6
相关论文
共 23 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]   THE GENBANK GENETIC SEQUENCE DATA-BANK [J].
BILOFSKY, HS ;
BURKS, C .
NUCLEIC ACIDS RESEARCH, 1988, 16 (05) :1861-1863
[3]   HOW RELIABLY DO AMINO-ACID COMPOSITION COMPARISONS PREDICT SEQUENCE SIMILARITIES BETWEEN PROTEINS [J].
CORNISHBOWDEN, A .
JOURNAL OF THEORETICAL BIOLOGY, 1979, 76 (04) :369-386
[4]   ASSESSMENT OF PROTEIN SEQUENCE IDENTITY FROM AMINO-ACID COMPOSITION DATA [J].
CORNISHBOWDEN, A .
JOURNAL OF THEORETICAL BIOLOGY, 1977, 65 (04) :735-742
[5]  
DAYHOFF MO, 1978, ATLAS PROTEIN SEQU S, V5, P3
[6]  
DEJONG WW, 1988, P NATL ACAD SCI USA, V85, P7114
[7]  
DOOLITTLE RF, 1990, SFI S SCI C, V7, P21
[8]  
FISCHER R, 1988, J BIOL CHEM, V263, P17055
[9]   PRIMARY STRUCTURE OF HUMAN PLACENTAL ANTICOAGULANT PROTEIN [J].
FUNAKOSHI, T ;
HENDRICKSON, LE ;
MCMULLEN, BA ;
FUJIKAWA, K .
BIOCHEMISTRY, 1987, 26 (25) :8087-8092
[10]  
GRUNDMANN U, 1988, P NATL ACAD SCI USA, V85, P3709