Database development in toxicogenomics: Issues and efforts

被引:80
作者
Mattes, WB
Pettit, SD
Sansone, SA
Bushel, PR
Waters, MD
机构
[1] Pfizer Inc, Groton, CT 06340 USA
[2] ILSI Hlth & Environm Sci Inst, Washington, DC USA
[3] European Mol Biol Lab, European Bioinformat Inst, Hinxton, England
[4] NIEHS, Natl Ctr Toxicogenom, Dept Hlth & Human Serv, NIH, Res Triangle Pk, NC 27709 USA
关键词
ArrayExpress; bioinformatics; CEBS; database; EBI; HESI; MIAME; NCT; toxicogenomics;
D O I
10.1289/ehp.6697
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The marriage of toxicology and genomics has created not only opportunities but also novel informatics challenges. As with the larger field of gene expression analysis, toxicogenomics faces the problems of probe annotation and data comparison across different array platforms. Toxicogenomics studies are generally built on standard toxicology studies generating biological end point data, and as such, one goal of toxicogenomics is to detect relationships between changes in gene expression and in those biological parameters. These challenges are best addressed through data collection into a well-designed toxicogenomics database. A successful publicly accessible toxicogenomics database will serve as a repository for data sharing and as a resource for analysis, data mining, and discussion. It will offer a vehicle for harmonizing nomenclature and analytical approaches and serve as a reference for regulatory organizations to evaluate toxicogenomics data submitted as part of registrations. Such a database would capture the experimental context of in vivo studies with great fidelity such that the dynamics of the dose response could be probed statistically with confidence. This review presents the collaborative efforts between the European Molecular Biology Laboratory-European Bioinformatics Institute ArrayExpress, the International Life Sciences Institute Health and Environmental Science Institute, and the National Institute of Environmental Health Sciences National Center for Toxigenomics Chemical Effects in Biological Systems knowledge base. The goal of this collaboration is to establish public infrastructure on an international scale and examine other developments aimed at establishing toxicogenomics databases. In this review we discuss several issues common to such databases: the requirement for identifying minimal descriptors to represent the experiment, the demand for standardizing data storage and exchange formats, the challenge of creating standardized nomenclature and ontologies to describe biological data, the technical problems involved in data upload, the necessity of defining parameters that assess and record data quality, and the development of standardized analytical approaches.
引用
收藏
页码:495 / 505
页数:11
相关论文
共 38 条
[1]  
Ambrose StephenE., 1996, Undaunted Courage
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]  
Ball CA, 2002, SCIENCE, V298, P539
[4]   Gene expression informatics - it's all in your mine [J].
Bassett, DE ;
Eisen, MB ;
Boguski, MS .
NATURE GENETICS, 1999, 21 (Suppl 1) :51-55
[5]   Minimum information about a microarray experiment (MIAME) - toward standards for microarray data [J].
Brazma, A ;
Hingamp, P ;
Quackenbush, J ;
Sherlock, G ;
Spellman, P ;
Stoeckert, C ;
Aach, J ;
Ansorge, W ;
Ball, CA ;
Causton, HC ;
Gaasterland, T ;
Glenisson, P ;
Holstege, FCP ;
Kim, IF ;
Markowitz, V ;
Matese, JC ;
Parkinson, H ;
Robinson, A ;
Sarkans, U ;
Schulze-Kremer, S ;
Stewart, J ;
Taylor, R ;
Vilo, J ;
Vingron, M .
NATURE GENETICS, 2001, 29 (04) :365-371
[6]   ArrayExpress - a public repository for microarray gene expression data at the EBI [J].
Brazma, A ;
Parkinson, H ;
Sarkans, U ;
Shojatalab, M ;
Vilo, J ;
Abeygunawardena, N ;
Holloway, E ;
Kapushesky, M ;
Kemmeren, P ;
Lara, GG ;
Oezcimen, A ;
Rocca-Serra, P ;
Sansone, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :68-71
[7]   CGO: utilizing and integrating gene expression of microarray data in clinical research and data management [J].
Bumm, K ;
Zheng, MZ ;
Bailey, C ;
Zhan, FH ;
Chiriva-Internati, M ;
Eddlemon, P ;
Terry, J ;
Barlogie, B ;
Shaughnessy, JD .
BIOINFORMATICS, 2002, 18 (02) :327-328
[8]   THE UDP GLUCURONOSYLTRANSFERASE GENE SUPERFAMILY - SUGGESTED NOMENCLATURE BASED ON EVOLUTIONARY DIVERGENCE [J].
BURCHELL, B ;
NEBERT, DW ;
NELSON, DR ;
BOCK, KW ;
IYANAGI, T ;
JANSEN, PLM ;
LANCET, D ;
MULDER, GJ ;
CHOWDHURY, JR ;
SIEST, G ;
TEPHLY, TR ;
MACKENZIE, PI .
DNA AND CELL BIOLOGY, 1991, 10 (07) :487-494
[9]   Computational selection of distinct class- and subclass-specific gene expression signatures [J].
Bushel, PR ;
Hamadeh, HK ;
Bennett, L ;
Green, J ;
Ableson, A ;
Misener, S ;
Afshari, CA ;
Paules, RS .
JOURNAL OF BIOMEDICAL INFORMATICS, 2002, 35 (03) :160-170
[10]   MAPS: a microarray project system for gene expression experiment information and data validation [J].
Bushel, PR ;
Hamadeh, H ;
Bennett, L ;
Sieber, S ;
Martin, K ;
Nuwaysir, EF ;
Johnson, K ;
Reynolds, K ;
Paules, RS ;
Afshari, CA .
BIOINFORMATICS, 2001, 17 (06) :564-565