Close
Help
Need Help?





JOURNAL

Evolutionary Bioinformatics

1,181,932 Journal Article Views | Journal Analytics

Maximum Likelihood Analyses of 3,490 rbcL Sequences: Scalability of Comprehensive Inference Versus Group-Specific Taxon Sampling

Submit a Paper



Publication Date: 24 May 2010

Type: Original Research

Journal: Evolutionary Bioinformatics

Citation: Evolutionary Bioinformatics 2010:6 73-90

doi: 10.4137/EBO.S4528

Abstract

The constant accumulation of sequence data poses new computational and methodological challenges for phylogenetic inference, since multiple sequence alignments grow both in the horizontal (number of base pairs, phylogenomic alignments) as well as vertical (number of taxa) dimension. Put aside the ongoing controversial discussion about appropriate models, partitioning schemes, and assembly methods for phylogenomic alignments, coupled with the high computational cost to infer these, for many organismic groups, a sufficient number of taxa is often exclusively available from one or just a few genes (e.g., rbcL, matK, rDNA). In this paper we address scalability of Maximum-Likelihood-based phylogeny reconstruction with respect to the number of taxa by example of several large nested single-gene rbcL alignments comprising 400 up to 3,491 taxa. In order to test the effect of taxon sampling, we employ an appropriately adapted taxon jackknifing approach. In contrast to standard jackknifing, this taxon subsampling procedure is not conducted entirely at random, but based on drawing subsamples from empirical taxon-groups which can either be user-defined or determined by using taxonomic information from databases. Our results indicate that, despite an unfavorable number of sequences to number of base pairs ratio, i.e., many relatively short sequences, Maximum Likelihood tree searches and bootstrap analyses scale well on single-gene rbcL alignments with a dense taxon sampling up to several thousand sequences. Moreover, the newly implemented taxon subsampling procedure can be beneficial for inferring higher level relationships and interpreting bootstrap support from comprehensive analysis.



Downloads

PDF  (2.08 MB PDF FORMAT)

RIS citation   (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)

BibTex citation   (BIBDESK, LATEX)

XML






What Your Colleagues Say About Evolutionary Bioinformatics
I found the submission management system for Evolutionary Bioinformatics to be one of the most user-friendly around. The peer review was very rigorous and constructive. Support staff were polite and furnished accurate information almost instantly. I strongly recommend other scientists to consider this journal.
Dr Madhav P. Nepal (South Dakota State University, Brookings, SD, USA)
More Testimonials

Quick Links




Follow Us We make it easy to find new research papers.




SUBJECT HUBS
Author Survey Results
author_survey_results
All authors are surveyed after their articles are published. Authors are asked to rate their experience in a variety of areas, and their responses help us to monitor our performance. Presented here are their responses in some key areas. No 'poor' or 'very poor' responses were received; these are represented in the 'other' category.
See Our Results