OR Demo Repository >
Open Repository >
Content Examples >
eHive: an artificial intelligence workflow system for genomic analysis.
this identifier to cite or link
to this item:
|Title: ||eHive: an artificial intelligence workflow system for genomic analysis.|
|Affiliation: ||European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK.|
|Citation: ||eHive: an artificial intelligence workflow system for genomic analysis. 2010, 11:240 BMC Bioinformatics|
|Journal: ||BMC bioinformatics|
|Issue Date: ||2010 |
|PubMed ID: ||20459813|
|Abstract: ||BACKGROUND: The Ensembl project produces updates to its comparative genomics resources with each of its several releases per year. During each release cycle approximately two weeks are allocated to generate all the genomic alignments and the protein homology predictions. The number of calculations required for this task grows approximately quadratically with the number of species. We currently support 50 species in Ensembl and we expect the number to continue to grow in the future. RESULTS: We present eHive, a new fault tolerant distributed processing system initially designed to support comparative genomic analysis, based on blackboard systems, network distributed autonomous agents, dataflow graphs and block-branch diagrams. In the eHive system a MySQL database serves as the central blackboard and the autonomous agent, a Perl script, queries the system and runs jobs as required. The system allows us to define dataflow and branching rules to suit all our production pipelines. We describe the implementation of three pipelines: (1) pairwise whole genome alignments, (2) multiple whole genome alignments and (3) gene trees with protein homology inference. Finally, we show the efficiency of the system in real case scenarios. CONCLUSIONS: eHive allows us to produce computationally demanding results in a reliable and efficient way with minimal supervision and high throughput. Further documentation is available at: http://www.ensembl.org/info/docs/eHive/.|
|Appears in Collections: ||Content Examples|
|Files in This Item:|
|what is an IR.doc||29Kb||Microsoft Word|
|what is an IR.pdf||61Kb||Adobe PDF|
|Related articles on PubMed|
Ensembl variation resources.
Chen Y, Cunningham F, Rios D, McLaren WM, Smith J, Pritchard B, Spudich GM, Brent S, Kulesha E, Marin-Garcia P, Smedley D, Birney E, Flicek P
2010 May 11
Flicek P, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, Down T, Dyer SC, Eyre T, Fitzgerald S, Fernandez-Banet J, Gräf S, Haider S, Hammond M, Holland R, Howe KL, Howe K, Johnson N, Jenkinson A, Kähäri A, Keefe D, Kokocinski F, Kulesha E, Lawson D, Longden I, Megy K, Meidl P, Overduin B, Parker A, Pritchard B, Prlic A, Rice S, Rios D, Schuster M, Sealy I, Slater G, Smedley D, Spudich G, Trevanion S, Vilella AJ, Vogel J, White S, Wood M, Birney E, Cox T, Curwen V, Durbin R, Fernandez-Suarez XM, Herrero J, Hubbard TJ, Kasprzyk A, Proctor G, Smith J, Ureta-Vidal A, Searle S
Ergatis: a web interface and scalable software system for bioinformatics workflows.
Orvis J, Crabtree J, Galens K, Gussman A, Inman JM, Lee E, Nampally S, Riley D, Sundaram JP, Felix V, Whitty B, Mahurkar A, Wortman J, White O, Angiuoli SV
2010 Jun 15
|See all 67 articles|
All Items in Open Repository are protected by copyright, with all rights reserved, unless otherwise indicated.