[BioPython] Sequence from Fasta

Peter biopython at maubp.freeserve.co.uk
Tue Jul 1 16:06:08 UTC 2008


Giovanni wrote:
> yes, I think it will be useful to implement.
> I know of people who have written a customized fasta2tab script and
> use it quite frequently, so it would be good to support such a task.
> As you said before this format is commonly used in combination with
> grep/gawk scripts.

I've gone for the simple option about how to parse the first field, its used
as the record identifer (.id) and name only (nothing clever).  Here is my
suggested code, which you are welcome to download and try out.

Bug 2533 - Support for simple "tab" format in Bio.SeqIO
http://bugzilla.open-bio.org/show_bug.cgi?id=2533

If you want to try this yourself you'll need to download the new file
TabIO.py into the Bio/SeqIO folder and update Bio/SeqIO/__init__.py to
tell it about the new format (two new lines, see patch).

Peter



More information about the Biopython mailing list