Close
Help
Need Help?



Classifying Coding DNA with Nucleotide Statistics

Submit a Paper


Libertas Analytics


1758 Article Views

Publication Date: 28 Oct 2009

Journal: Bioinformatics and Biology Insights 2009:3 141-154

BBI
journal

102,425 Article Views

3,056,811 Libertas Article Views

More Statistics

Abstract In this report, we compared the success rate of classification of coding sequences (CDS) vs. introns by Codon Structure Factor (CSF) and by a method that we called Universal Feature Method (UFM). UFM is based on the scoring of purine bias (Rrr) and stop codon frequency. We show that the success rate of CDS/intron classification by UFM is higher than by CSF. UFM classifies ORFs as coding or non-coding through a score based on (i) the stop codon distribution, (ii) the product of purine probabilities in the three positions of nucleotide triplets, (iii) the product of Cytosine (C), Guanine (G), and Adenine (A) probabilities in the 1st, 2nd, and 3rd positions of triplets, respectively, (iv) the probabilities of G in 1st and 2nd position of triplets and (v) the distance of their GC3 vs. GC2 levels to the regression line of the universal correlation. More than 80% of CDSs (true positives) of Homo sapiens (>250 bp), Drosophila melanogaster (>250 bp) and Arabidopsis thaliana (>200 bp) are successfully classified with a false positive rate lower or equal to 5%. The method releases coding sequences in their coding strand and coding frame, which allows their automatic translation into protein sequences with 95% confidence. The method is a natural consequence of the compositional bias of nucleotides in coding sequences.


Post a Comment

x close

Discussion Add A Comment
No comments yet...Be the first to comment.


share on

Our Service Promise

  • Prompt Processing (Average 3 Weeks)
  • Fair & Constructive Peer Review
  • Professional Author Service
  • High Visibility
  • High Readership
  • What Our Authors Say

Quick Links

Follow Us We make it easy to find new research papers. RSS Feeds Email Alerts Twitter

BROWSE CATEGORIES
Our Testimonials
I found publishing in Liberas Academica a friendly process from submission, review, editing and publication.  Everything was handled to a high calibre and proficiently.  The quality of the reviews were as good as any I have experienced in publishing in scientific journals.
Professor Abdullah M Asiri (King Abdul Aziz University, Jeddah, Saudi Arabia) What our authors say