Close
Help
Need Help?





JOURNAL

Evolutionary Bioinformatics

762,383 Journal Article Views | Journal Analytics

Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup

Submit a Paper



Publication Date: 22 Dec 2010

Type: Short Report

Journal: Evolutionary Bioinformatics

Citation: Evolutionary Bioinformatics 2010:6 197-203

doi: 10.4137/EBO.S6259

Abstract

Background: Comparative genomics resources, such as ortholog detection tools and repositories are rapidly increasing in scale and complexity. Cloud computing is an emerging technological paradigm that enables researchers to dynamically build a dedicated virtual cluster and may represent a valuable alternative for large computational tools in bioinformatics. In the present manuscript, we optimize the computation of a large-scale comparative genomics resource—Roundup—using cloud computing, describe the proper operating principles required to achieve computational efficiency on the cloud, and detail important procedures for improving cost-effectiveness to ensure maximal computation at minimal costs.

Methods: Utilizing the comparative genomics tool, Roundup, as a case study, we computed orthologs among 902 fully sequenced genomes on Amazon’s Elastic Compute Cloud. For managing the ortholog processes, we designed a strategy to deploy the web service, Elastic MapReduce, and maximize the use of the cloud while simultaneously minimizing costs. Specifically, we created a model to estimate cloud runtime based on the size and complexity of the genomes being compared that determines in advance the optimal order of the jobs to be submitted.

Results: We computed orthologous relationships for 245,323 genome-to-genome comparisons on Amazon’s computing cloud, a computation that required just over 200 hours and cost $8,000 USD, at least 40% less than expected under a strategy in which genome comparisons were submitted to the cloud randomly with respect to runtime. Our cost savings projections were based on a model that not only demonstrates the optimal strategy for deploying RSD to the cloud, but also finds the optimal cluster size to minimize waste and maximize usage. Our cost-reduction model is readily adaptable for other comparative genomics tools and potentially of significant benefit to labs seeking to take advantage of the cloud as an alternative to local computing infrastructure.


Downloads

PDF  (686.85 KB PDF FORMAT)

RIS citation   (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)

BibTex citation   (BIBDESK, LATEX)

XML




Our Service Promise

  • Prompt Processing (3 Weeks to Editorial Decision)
  • Fair, Independent Peer Review
  • High Visibility & Extensive Indexing
What Your Colleagues Say About Evolutionary Bioinformatics
According to my experience as a co-author, I recommend potential authors to publish their innovative bioinformatics work in Evolutionary Bioinformatics.  I am particularly satisfied with the rapid and high-quality review process, proofs delivery and eventual publication.
Dr Leho Tedersoo (University of Tartu, Estonia)
More Testimonials

Quick Links




Follow Us We make it easy to find new research papers.




SUBJECT HUBS
Author Survey Results
author_survey_results
All authors are surveyed after their articles are published. Authors are asked to rate their experience in a variety of areas, and their responses help us to monitor our performance. Presented here are their responses in some key areas. No 'poor' or 'very poor' responses were received; these are represented in the 'other' category.
See Our Results