[Bioperl-l] Behavior of Bio::Species object

Michael Muratet mam@torchconcepts.com
Sun, 20 Oct 2002 16:36:32 -0500


Greetings

In processing GenBank files which I then write out in EMBL format, the
OS tag in the EMBL file carries information that was in the SOURCE tag
of the GenBank file. For example, using mouse.gbff, the SOURCE line
lists "house mouse" and the ORGANISM line below it gives Mus musculus
followed by the rest of the classification. If I initialize the species
field in a Bio::Seq object with the species object obtained from a
GenBank file and then write it out in EMBL format, the OS tag line will
say "Mus musculus (house mouse)". Wouldn't it be better just to have the
binomial species name? Should I always expect information after the
binomial name and parse off the first two words?

Thanks.

Mike