The bioperl toolkit:: Perl modules for the life sciences

被引:1209
作者
Stajich, JE
Block, D
Boulez, K
Brenner, SE
Chervitz, SA
Dagdigian, C
Fuellen, G
Gilbert, JGR
Korf, I
Lapp, H
Lehväslaiho, H
Matsalla, C
Mungall, CJ
Osborne, BI
Pocock, MR
Schattner, P
Senger, M
Stein, LD
Stupka, E
Wilkinson, MD
Birney, E
机构
[1] Duke Univ, Genet Program, Durham, NC 27710 USA
[2] Natl Res Council Canada, Inst Plant Biotechnol, Saskatoon, SK S7N 0W9, Canada
[3] AlgoNom, B-9052 Ghent, Belgium
[4] Univ Calif Berkeley, Dept Plant & Mol Biol, Berkeley, CA 94720 USA
[5] Affymetrix Inc, Emeryville, CA 94608 USA
[6] Open Bioinformat Fdn, Somerville, MA 02144 USA
[7] Univ Hosp Munster, IZKF, D-48149 Munster, Germany
[8] Welcome Trust Sanger Inst, Cambridge CB10 1SA, England
[9] Washington Univ, Dept Comp Sci, St Louis, MO 63130 USA
[10] Novartis Res Fdn, GNF, Genom Inst, San Diego, CA 92121 USA
[11] European Bioinformat Inst, Cambridge CB10 1SD, England
[12] Agr & Agri Food Canada, Saskatoon Res Ctr, Saskatoon, SK S7N 0X2, Canada
[13] Univ Calif Berkeley, Berkeley Drosophil Genome Project, Berkeley, CA 94720 USA
[14] Cogina, New York, NY 10022 USA
[15] Univ Calif Santa Cruz, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
[16] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[17] Inst Mol & Cell Biol, Singapore 117609, Singapore
关键词
D O I
10.1101/gr.361602
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Bioperl project is an international open-source collaboration of biologists, bioinformaticians, and computer scientists that has evolved over the past 7 yr into the most comprehensive library of Perl modules available for managing and manipulating life-science information. Bioperl provides an easy-to-use, stable, and consistent programming interface for bioinformatics application programmers. The Bioperl modules have been successfully and repeatedly used to reduce otherwise complex tasks to only a few lines of code. The Bioperl object model has been proven to be flexible enough to support enterprise-level applications such as EnsEMBL, while maintaining an easy learning curve for novice Perl programmers. Bioperl is capable of executing analyses and processing results from programs such as BLAST, ClustalW, or the EMBOSS suite. Interoperation with modules written in Python and lava is supported through the evolving BioCORBA bridge. Bioperl provides access to data stores such as GenBank and SwissProt via a flexible series of sequence input/output modules, and to the emerging common sequence data storage format of the Open Bioinformatics Database Access project. This study describes the overall architecture of the toolkit, the problem domains that it addresses, and gives specific examples of how the toolkit can be used to solve common life-sciences problems. We conclude with a discussion of how the open-source nature of the project has contributed to the development effort.
引用
收藏
页码:1611 / 1618
页数:8
相关论文
共 23 条
[1]   XML, bioinformatics and data integration [J].
Achard, F ;
Vaysseix, G ;
Barillot, E .
BIOINFORMATICS, 2001, 17 (02) :115-125
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
Beck K., 1999, EXTREME PROGRAMMING
[4]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[5]  
CHERVITZ SA, 1998, BIOPERL STANDARD PER
[6]   The Distributed Annotation System [J].
Dowell, Robin D. ;
Jokerst, Rodney M. ;
Day, Allen ;
Eddy, Sean R. ;
Stein, Lincoln .
BMC BIOINFORMATICS, 2001, 2 (1)
[7]  
Eddy S, 2001, HMMER PROFILE HIDDEN
[8]  
FELSENSTEIN J, 1983, PHYLIP PHYLOGENY INF
[9]  
Gamma E., 1995, Design Patterns: Elements of Reusable Object-Oriented Software, V206, P395, DOI 10.1093/carcin/bgs084
[10]   The Ensembl genome database project [J].
Hubbard, T ;
Barker, D ;
Birney, E ;
Cameron, G ;
Chen, Y ;
Clark, L ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Down, T ;
Durbin, R ;
Eyras, E ;
Gilbert, J ;
Hammond, M ;
Huminiecki, L ;
Kasprzyk, A ;
Lehvaslaiho, H ;
Lijnzaad, P ;
Melsopp, C ;
Mongin, E ;
Pettett, R ;
Pocock, M ;
Potter, S ;
Rust, A ;
Schmidt, E ;
Searle, S ;
Slater, G ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Stupka, E ;
Ureta-Vidal, A ;
Vastrik, I ;
Clamp, M .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :38-41