[Biopython-dev] Output sequence files
Iddo Friedberg
idoerg at cc.huji.ac.il
Sat May 26 23:44:57 EDT 2001
Hi,
Iddo:
: > So maybe we just need a writer for each {database}.Record types, and a
: > to_fasta converter and writer in Tools.
Jeff:
:
: Isn't this what the SeqIO directory is for?I had always hoped to get SeqIO
: functionality similar to bioperl's.
DAMN! I thought I was missing out on something...
Did I also miss the existence of writers for {database}.Record types?
Sorry about that. I'll look into SeqIO, and bioperl's one, see if I can
learn something. Thanks for clearing this up.
Iddo:
: > The problem arises from annotation. Do you think it's feasable to perform a
: > good GenPept (that's the GenBank translation database) <--> SwissProt
: > converter that will preserve everything?
: >
Jeff:
: The gold standard for preserving information, is if you can convert A to B
: back to A, and have it come out exactly the same.That'll probably be
: possible for a lot of records, but many of them will not work.For example,
: GenBank locations are much richer than SwissProt ones, so complex location
: semantics that SwissProt doesn't handle will be lost.
:
Actually, GenBank <--> SwissProt is probably the least convertible of the
kind. Many GenBank records hold the annotation to several CDS's, and
generally a GenBank sequence holds also untranslated regions, etc.
GenPept, the GenBank translation, is not much better: it holds coding
information and all sorts of stuff which is SwissProt irrelevant. And
vice-versa.
Iddo
--
Iddo Friedberg | Tel: +972-2-6758647
Dept. of Molecular Genetics and Biotechnology | Fax: +972-2-6757308
The Hebrew University - Hadassah Medical School | email: idoerg at cc.huji.ac.il
POB 12272, Jerusalem 91120 |
Israel |
http://bioinfo.md.huji.ac.il/marg/people-home/iddo/
More information about the Biopython-dev
mailing list