[Biopython-dev] [Bug 2762] New: GFF capability in SeqIO
bugzilla-daemon at portal.open-bio.org
bugzilla-daemon at portal.open-bio.org
Tue Feb 17 08:22:57 EST 2009
http://bugzilla.open-bio.org/show_bug.cgi?id=2762
Summary: GFF capability in SeqIO
Product: Biopython
Version: 1.49b
Platform: All
OS/Version: All
Status: NEW
Severity: enhancement
Priority: P2
Component: Main Distribution
AssignedTo: biopython-dev at biopython.org
ReportedBy: lpritc at scri.sari.ac.uk
I'm increasingly coming across GFF format files, and SeqIO currently can't
handle them. It might be useful if at some point in the future, it could.
Also, the Bio.GFF module handles access to a database, and doesn't provide a
mechanism for importing or writing GFF format files. I'm not sure that there
is currently any facility to handle this format in Biopython.
There are at least two variants of the GFF format that I've seen in use...
GFF2 is the one I'm working with at the moment, and its specification is here:
http://www.sanger.ac.uk/Software/formats/GFF/GFF_Spec.shtml
I've come across GFF3 in other contexts, and it is defined here:
http://www.sequenceontology.org/gff3.shtml
Note that GFF3 is similar to GenBank files in that it may explicitly describe
both sequence features, and the sequence itself (potentially for multiple
sequences). GFF2 has the potential for this in the specification for the
Comments section, which includes a recommended syntax for defining sequences to
which the features refer, although that spec makes the reasonable assumption
that you would be able to obtain the sequence from elsewhere, knowing the
sequence ID from the GFF file.
--
Configure bugmail: http://bugzilla.open-bio.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
More information about the Biopython-dev
mailing list