[Bioperl-l] Re: SeqIO::swiss.pm + "gsequence"

hilmar.lapp@pharma.Novartis.com hilmar.lapp@pharma.Novartis.com
Fri, 15 Sep 2000 14:55:44 +0100


Hmm. What do others think / have encountered?

Could you volunteer to perform a couple of tests for various programs
reading FastA format files to see what they do upon the empty seq and
whether they choke? I'm not sure whether it's a good idea to produce files
that look sensible but make other programs choke because they violate
format specs.

     Hilmar





Kris Boulez <krbou@pgsgent.be> on 15.09.2000 13:47:01

To:   HILMAR LAPP/PH/Novartis@PH
cc:   bioperl-l@bioperl.org, Lorenz Pollak <lorenz@ist.org>
Subject:  Re: [Bioperl-l] Re: SeqIO::swiss.pm + "gsequence"


Quoting hilmar.lapp@pharma.Novartis.com (hilmar.lapp@pharma.Novartis.com):
>
>
> Sorry that I didn't respond yet but I think the point of empty seqs is
> valid, and I wanted to change the code to allow for this unless someone
> vetoes. The problem still remains how to do IO for empty seqs (e.g. in
> FASTA you cannot have just the ID line, followed by the ID line of the
next
> seq). So, IO probably remains forbidden for empty seqs unless someone
comes
> up with a sensible solution.
>

For FastA files I would expect them to be (and have already encountered
them).

>1 description of nr 1
GCC
>2 description of empty sequence

>3 description for nr3
GCG



I would vote for an empty line where the sequence should be.


Kris,