Fast and efficient searching of biological data resources-using EB-eye

被引:30
作者
Valentin, Franck
Squizzato, Silvano
Goujon, Mickael
McWilliam, Hamish
Paern, Juri
Lopez, Rodrigo
机构
[1] External Service Group, EMBL-EBI
基金
美国国家卫生研究院; 英国惠康基金;
关键词
text search; biological databases; integration; interoperability; web services; Apache Lucene; RETRIEVAL-SYSTEM;
D O I
10.1093/bib/bbp065
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The EB-eye is a fast and efficient search engine that provides easy and uniform access to the biological data resources hosted at the EMBL-EBI. Currently, users can access information from more than 62 distinct datasets covering some 400 million entries. The data resources represented in the EB-eye include: nucleotide and protein sequences at both the genomic and proteomic levels, structures ranging from chemicals to macro-molecular complexes, gene-expression experiments, binary level molecular interactions as well as reaction maps and pathway models, functional classifications, biological ontologies, and comprehensive literature libraries covering the biomedical sciences and related intellectual property. The EB-eye can be accessed over the web or programmatically using a SOAP Web Services interface. This allows its search and retrieval capabilities to be exploited in workflows and analytical pipe-lines. The EB-eye is a novel alternative to existing biological search and retrieval engines. In this article we describe in detail how to exploit its powerful capabilities.
引用
收藏
页码:375 / 384
页数:10
相关论文
共 21 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
[Anonymous], AG DYN LANG JAV PLAT
[3]  
[Anonymous], NUCL ACIDS RES
[4]   The Universal Protein Resource (UniProt) 2009 [J].
Bairoch, Amos ;
Consortium, UniProt ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Argoud-Puy, Ghislaine ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bolleman, Jerven ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
deCastro, Edouard ;
Ciapina, Luciane ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
Delbard, Gwennaelle ;
Dornevil, Dolnide ;
Roggli, Paula Duek ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Feuermann, Marc ;
Gehant, Sebastian ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
James, Janet ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Kappler, Thomas ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra ;
Lara, Vicente .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D169-D174
[5]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[6]   Petabyte-scale innovations at the European Nucleotide Archive [J].
Cochrane, Guy ;
Akhtar, Ruth ;
Bonfield, James ;
Bower, Lawrence ;
Demiralp, Fehmi ;
Faruque, Nadeem ;
Gibson, Richard ;
Hoad, Gemma ;
Hubbard, Tim ;
Hunter, Christopher ;
Jang, Mikyung ;
Juhos, Szilveszter ;
Leinonen, Rasko ;
Leonard, Steven ;
Lin, Quan ;
Lopez, Rodrigo ;
Lorenc, Dariusz ;
McWilliam, Hamish ;
Mukherjee, Gaurab ;
Plaister, Sheila ;
Radhakrishnan, Rajesh ;
Robinson, Stephen ;
Sobhany, Siamak ;
Hoopen, Petra Ten ;
Vaughan, Robert ;
Zalunin, Vadim ;
Birney, Ewan .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D19-D25
[7]   ChEBI:: a database and ontology for chemical entities of biological interest [J].
Degtyarenko, Kirill ;
de Matos, Paula ;
Ennis, Marcus ;
Hastings, Janna ;
Zbinden, Martin ;
McNaught, Alan ;
Alcantara, Rafael ;
Darsow, Michael ;
Guedj, Mickael ;
Ashburner, Michael .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D344-D350
[8]  
Etzold T, 1996, METHOD ENZYMOL, V266, P114
[9]   The RESID database of protein modifications as a resource and annotation tool [J].
Garavelli, JS .
PROTEOMICS, 2004, 4 (06) :1527-1533
[10]   MRS: a fast and compact retrieval system for biological data [J].
Hekkelman, ML ;
Vriend, G .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W766-W769