[Biopython-dev] [Bug 2778] Efficiency improvement in function	Bio.SeqUtils.GC()
    bugzilla-daemon at portal.open-bio.org 
    bugzilla-daemon at portal.open-bio.org
       
    Thu Mar  5 05:56:39 EST 2009
    
    
  
http://bugzilla.open-bio.org/show_bug.cgi?id=2778
biopython-bugzilla at maubp.freeserve.co.uk changed:
           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |FIXED
------- Comment #2 from biopython-bugzilla at maubp.freeserve.co.uk  2009-03-05 05:56 EST -------
I've checked that in, but with the existing code to catch a zero length
sequence and return 0 instead of raising a ZeroDivisionError.
def GC(seq):
    """Calculates G+C content, ..."""
    gc=sum(map(seq.count,['G','C','g','c','S','s']))
    if gc == 0: return 0
    return gc*100.0/len(seq)
The old code had been modified several times - it originally calculated the GC%
as the CG count divided by the ATCG count, thus it had to count all the bases. 
You are right, this is much cleaner.
Thanks.
-- 
Configure bugmail: http://bugzilla.open-bio.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
    
    
More information about the Biopython-dev
mailing list