[Biopython-dev] SeqIO
Thomas Sicheritz-Ponten
thomas at cbs.dtu.dk
Fri Sep 28 00:51:45 EDT 2001
Brad Chapman <chapmanb at arches.uga.edu> writes:
thx for the comments !
> One big question I have is, how many of the features do you want to
> try and retain in the conversion? So, for GenBank format, do you
> want me to just write out the basic information (sequence, type,
> etc) and ignore the feature table, or do we want to somehow map the
> features from format to format (ie. EMBL <-> GenBank).
All of them. I think each GenBank feature has an exact equivalence in EMBL
and SwissProt (GenPept). So that leaves us just with the definition of the
corresponding feature names.
>
> If we want to think about feature conversion, this'll be tougher and
> we'll need to think about converters between "similar" formats like
> EMBL and GenBank.
GenBank, EMBL and SwissProt ... where EMBL and SwissProt are almost
identical (I think...)
> => Why are you duplicating SeqRecord in the SeqIO stuff instead of
> just reusing it? I don't think I understand what you are talking
> about with stripping newlines...
I copied everything so that I cóuld play around without breaking e.g. your
code. Now I think the changes are actually backward compatible - so we
could move it back.
>
> => Is there a way to plug in a specialized converter for similar
> formats, like I was talking about above with EMBL/GenBank? I think
> Jeff suggested this earlier, and it seems like a good idea to me. I
> guess right now you could subclass ReadSeq and define your own
> Convert function, but maybe there is another way to do it.
I don't know if I understood this question...
A colleague and I, are thinking about converting SWISSPROT into a SQL
database for local use ... which actually gets close to a former
discussion where Andrew and I dreamed about a python variant of SRS !
My question: does anybody know about an already existing SQL tables for
SWISSPROT ? The step after that is actually creating an python interface for
generic queries, which would beat SRS ... at least on SWISSPROT.
cheers
-thomas
P.S. is anybody going to the Atlanta meeting in November ?
--
Sicheritz-Ponten Thomas, Ph.D CBS, Department of Biotechnology
thomas at biopython.org The Technical University of Denmark
CBS: +45 45 252489 Building 208, DK-2800 Lyngby
Fax +45 45 931585 http://www.cbs.dtu.dk/thomas
De Chelonian Mobile ... The Turtle Moves ...
More information about the Biopython-dev
mailing list