[Biopython-dev] [Bug 2382] Generic FASTA parser

bugzilla-daemon at portal.open-bio.org bugzilla-daemon at portal.open-bio.org
Tue Oct 16 17:02:03 EDT 2007


http://bugzilla.open-bio.org/show_bug.cgi?id=2382





------- Comment #2 from biopython-bugzilla at maubp.freeserve.co.uk  2007-10-16 17:02 EST -------
Are there any other "FASTA like" formats you know of, in addition to
traditional sequence data and the 454 GSFlex quality score files?

We could do this using the old Scanner/Consumer model (see the pre-Martel
parse, CVS revision 1.8 of Bio/Fasta/__init__.py for example).

http://cvs.biopython.org/cgi-bin/viewcvs/viewcvs.cgi/biopython/Bio/Fasta/__init__.py?rev=1.8&cvsroot=biopython&content-type=text/vnd.viewcvs-markup

The scanner would be the same for all formats, and would pass the data with
whitespace (spaces, new lines etc) as is.  We could then have one consumer for
each supported FASTA variant:

_Scanner           Scans a FASTA-format stream.
_RecordConsumer    Consumes FASTA data to a Record object.
_SequenceConsumer  Consumes FASTA data to a Sequence object.
_QualityConsumer   (new) could build a list of integers for each record?


-- 
Configure bugmail: http://bugzilla.open-bio.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.


More information about the Biopython-dev mailing list