[Biopython-dev] Unigene flat file parser

Sean Davis sdavis2 at mail.nih.gov
Thu Oct 26 14:56:25 UTC 2006

I have put together a parser for the Unigene flat file format described here:


under the Hs.data section.  The actual .data files are included in the various 
organism-specific directories.  

Is there any interest in including this in biopython?  If so, I would 
appreciate some input on the code and details of contributions, etc.  The 
current code is available here:


Use like so and note that the ugrecord has much more information (in fact, all 
information is captured) in it that given in its __repr__.  

import Unigene

fh = file('Hs.data')  #downloaded previously from ftp, or whatever
ugparser = Unigene.Iterator(fh,Unigene.RecordParser())
for ugrecord in ugparser:
    print ugrecord

More information about the Biopython-dev mailing list